Warning: Permanently added '52.203.151.153' (ED25519) to the list of known hosts. You can reproduce this build on your computer by running: sudo dnf install copr-rpmbuild /usr/bin/copr-rpmbuild --verbose --drop-resultdir --task-url https://copr.fedorainfracloud.org/backend/get-build-task/10036347-rhel+epel-10-x86_64 --chroot rhel+epel-10-x86_64 Version: 1.6 PID: 8758 Logging PID: 8760 Task: {'allow_user_ssh': False, 'appstream': False, 'background': True, 'bootstrap': 'off', 'build_id': 10036347, 'buildroot_pkgs': [], 'chroot': 'rhel+epel-10-x86_64', 'enable_net': False, 'fedora_review': False, 'git_hash': '35b2dc61da272ff0022b11a6371c3ad02ab41dda', 'git_repo': 'https://copr-dist-git.fedorainfracloud.org/git/dchen/el-pkgs/llama-cpp', 'isolation': 'default', 'memory_reqs': 2048, 'package_name': 'llama-cpp', 'package_version': 'b6153-1', 'project_dirname': 'el-pkgs', 'project_name': 'el-pkgs', 'project_owner': 'dchen', 'repo_priority': None, 'repos': [{'baseurl': 'https://download.copr.fedorainfracloud.org/results/dchen/el-pkgs/rhel+epel-10-x86_64/', 'id': 'copr_base', 'name': 'Copr repository', 'priority': None}], 'sandbox': 'dchen/el-pkgs--https://src.fedoraproject.org/user/trix', 'source_json': {}, 'source_type': None, 'ssh_public_keys': None, 'storage': 0, 'submitter': 'https://src.fedoraproject.org/user/trix', 'tags': [], 'task_id': '10036347-rhel+epel-10-x86_64', 'timeout': 18000, 'uses_devel_repo': False, 'with_opts': [], 'without_opts': []} Running: git clone https://copr-dist-git.fedorainfracloud.org/git/dchen/el-pkgs/llama-cpp /var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp --depth 500 --no-single-branch --recursive cmd: ['git', 'clone', 'https://copr-dist-git.fedorainfracloud.org/git/dchen/el-pkgs/llama-cpp', '/var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp', '--depth', '500', '--no-single-branch', '--recursive'] cwd: . rc: 0 stdout: stderr: Cloning into '/var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp'... Running: git checkout 35b2dc61da272ff0022b11a6371c3ad02ab41dda -- cmd: ['git', 'checkout', '35b2dc61da272ff0022b11a6371c3ad02ab41dda', '--'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp rc: 0 stdout: stderr: Note: switching to '35b2dc61da272ff0022b11a6371c3ad02ab41dda'. You are in 'detached HEAD' state. You can look around, make experimental changes and commit them, and you can discard any commits you make in this state without impacting any branches by switching back to a branch. If you want to create a new branch to retain commits you create, you may do so (now or later) by using -c with the switch command. Example: git switch -c Or undo this operation with: git switch - Turn off this advice by setting config variable advice.detachedHead to false HEAD is now at 35b2dc6 automatic import of llama-cpp Running: dist-git-client sources cmd: ['dist-git-client', 'sources'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp rc: 0 stdout: stderr: INFO: Reading stdout from command: git rev-parse --abbrev-ref HEAD INFO: Reading stdout from command: git rev-parse HEAD INFO: Reading sources specification file: sources INFO: Downloading llama.cpp-b6153.tar.gz INFO: Reading stdout from command: curl --help all INFO: Calling: curl -H Pragma: -o llama.cpp-b6153.tar.gz --location --connect-timeout 60 --retry 3 --retry-delay 10 --remote-time --show-error --fail --retry-all-errors https://copr-dist-git.fedorainfracloud.org/repo/pkgs/dchen/el-pkgs/llama-cpp/llama.cpp-b6153.tar.gz/md5/e7eae951975b13b8eed5bb4264c632cc/llama.cpp-b6153.tar.gz % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 24.3M 100 24.3M 0 0 16.9M 0 0:00:01 0:00:01 --:--:-- 16.9M INFO: Reading stdout from command: md5sum llama.cpp-b6153.tar.gz Running (timeout=18000): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp/llama-cpp.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1768866066.175986 -r /var/lib/copr-rpmbuild/results/configs/child.cfg tail: /var/lib/copr-rpmbuild/main.log: file truncated INFO: mock.py version 6.6 starting (python version = 3.13.7, NVR = mock-6.6-1.fc42), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp/llama-cpp.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1768866066.175986 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp/llama-cpp.spec) Config(rhel+epel-10-x86_64) Start: clean chroot Finish: clean chroot Mock Version: 6.6 INFO: Mock Version: 6.6 Start: chroot init INFO: mounting tmpfs at /var/lib/mock/rhel+epel-10-x86_64-1768866066.175986/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf4 detected and used (fallback) INFO: Buildroot is handled by package management from host and used with --installroot: rpm-4.20.1-1.fc42.x86_64 rpm-sequoia-1.7.0-5.fc42.x86_64 python3-dnf-4.24.0-1.fc42.noarch python3-dnf-plugins-core-4.10.1-1.fc42.noarch dnf5-5.2.17.0-1.fc42.x86_64 dnf5-plugins-5.2.17.0-1.fc42.x86_64 Start: installing minimal buildroot with dnf No matches found for the following disable plugin patterns: local, spacewalk, versionlock Updating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Copr repository 727 kB/s | 71 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - BaseOS 51 MB/s | 42 MB 00:00 Red Hat Enterprise Linux 10 for x86_64 - AppStr 6.6 MB/s | 4.2 MB 00:00 Red Hat CodeReady Linux Builder for RHEL 10 x86 2.1 MB/s | 911 kB 00:00 Extra Packages for Enterprise Linux 10 - x86_64 38 MB/s | 5.6 MB 00:00 Dependencies resolved. ======================================================================================= Package Arch Version Repo Size ======================================================================================= Installing: bash x86_64 5.2.26-6.el10 baseos 1.8 M bzip2 x86_64 1.0.8-25.el10 baseos 59 k coreutils x86_64 9.5-6.el10 baseos 1.1 M cpio x86_64 2.15-3.el10 baseos 296 k diffutils x86_64 3.10-8.el10 baseos 413 k epel-rpm-macros noarch 10-6.el10_1 epel 8.3 k findutils x86_64 1:4.10.0-5.el10 baseos 555 k gawk x86_64 5.3.0-6.el10 baseos 1.1 M glibc-minimal-langpack x86_64 2.39-58.el10_1.2 baseos 45 k grep x86_64 3.11-10.el10 baseos 305 k gzip x86_64 1.13-3.el10 baseos 174 k info x86_64 7.1-6.el10 baseos 187 k patch x86_64 2.7.6-26.el10 appstream 134 k redhat-release x86_64 10.1-18.el10 baseos 61 k redhat-rpm-config noarch 293-1.el10 appstream 77 k rpm-build x86_64 4.19.1.1-20.el10 appstream 75 k sed x86_64 4.9-3.el10 baseos 322 k shadow-utils x86_64 2:4.15.0-8.el10 baseos 1.3 M tar x86_64 2:1.35-9.el10_1 baseos 866 k unzip x86_64 6.0-69.el10 baseos 190 k util-linux x86_64 2.40.2-13.el10 baseos 1.3 M which x86_64 2.21-44.el10_0 baseos 42 k xz x86_64 1:5.6.2-4.el10_0 baseos 481 k Installing dependencies: alternatives x86_64 1.30-2.el10 baseos 45 k ansible-srpm-macros noarch 1-16.1.el10_0 epel 20 k audit-libs x86_64 4.0.3-4.el10 baseos 133 k authselect x86_64 1.5.0-8.el10 baseos 148 k authselect-libs x86_64 1.5.0-8.el10 baseos 227 k basesystem noarch 11-22.el10 baseos 8.3 k binutils x86_64 2.41-58.el10_1.2 baseos 6.4 M binutils-gold x86_64 2.41-58.el10_1.2 baseos 797 k bzip2-libs x86_64 1.0.8-25.el10 baseos 43 k ca-certificates noarch 2025.2.80_v9.0.305-102.el10_1 baseos 1.1 M coreutils-common x86_64 9.5-6.el10 baseos 2.2 M cracklib x86_64 2.9.11-8.el10 baseos 100 k cracklib-dicts x86_64 2.9.11-8.el10 baseos 3.7 M crypto-policies noarch 20250905-2.gitc7eb7b2.el10_1 baseos 98 k curl x86_64 8.12.1-2.el10 baseos 219 k cyrus-sasl-lib x86_64 2.1.28-29.el10 baseos 106 k debugedit x86_64 5.1-8.el10 appstream 80 k dwz x86_64 0.16-1.el10 appstream 140 k ed x86_64 1.20-5.el10 baseos 86 k efi-srpm-macros noarch 6-6.el10 appstream 25 k elfutils x86_64 0.193-1.el10 baseos 573 k elfutils-debuginfod-client x86_64 0.193-1.el10 baseos 47 k elfutils-default-yama-scope noarch 0.193-1.el10 baseos 13 k elfutils-libelf x86_64 0.193-1.el10 baseos 208 k elfutils-libs x86_64 0.193-1.el10 baseos 270 k file x86_64 5.45-8.el10 baseos 49 k file-libs x86_64 5.45-8.el10 baseos 764 k filesystem x86_64 3.18-17.el10 baseos 4.8 M fonts-srpm-macros noarch 1:2.0.5-18.el10 appstream 29 k forge-srpm-macros noarch 0.4.0-6.el10 appstream 23 k fpc-srpm-macros noarch 1.3-7.el10_1 epel 7.8 k gdb-minimal x86_64 16.3-2.el10 appstream 4.4 M gdbm x86_64 1:1.23-12.el10_0 baseos 156 k gdbm-libs x86_64 1:1.23-12.el10_0 baseos 60 k ghc-srpm-macros noarch 1.9.2-1.el10_0 epel 9.1 k glibc x86_64 2.39-58.el10_1.2 baseos 2.1 M glibc-common x86_64 2.39-58.el10_1.2 baseos 339 k glibc-gconv-extra x86_64 2.39-58.el10_1.2 baseos 1.7 M gmp x86_64 1:6.2.1-12.el10 baseos 318 k go-srpm-macros noarch 3.6.0-4.el10 appstream 29 k jansson x86_64 2.14-3.el10 baseos 48 k json-c x86_64 0.18-3.el10 baseos 47 k kernel-srpm-macros noarch 1.0-25.el10 appstream 11 k keyutils-libs x86_64 1.6.3-5.el10 baseos 35 k krb5-libs x86_64 1.21.3-8.el10_0 baseos 767 k libacl x86_64 2.3.2-4.el10 baseos 27 k libarchive x86_64 3.7.7-4.el10_0 baseos 414 k libattr x86_64 2.5.2-5.el10 baseos 20 k libblkid x86_64 2.40.2-13.el10 baseos 124 k libbrotli x86_64 1.1.0-6.el10 baseos 349 k libcap x86_64 2.69-7.el10 baseos 95 k libcap-ng x86_64 0.8.4-6.el10 baseos 36 k libcom_err x86_64 1.47.1-4.el10 baseos 27 k libcurl x86_64 8.12.1-2.el10 baseos 371 k libeconf x86_64 0.6.2-4.el10 baseos 36 k libevent x86_64 2.1.12-16.el10 baseos 265 k libfdisk x86_64 2.40.2-13.el10 baseos 159 k libffi x86_64 3.4.4-10.el10 baseos 41 k libgcc x86_64 14.3.1-2.1.el10 baseos 145 k libgomp x86_64 14.3.1-2.1.el10 baseos 368 k libidn2 x86_64 2.3.7-3.el10 baseos 122 k libmount x86_64 2.40.2-13.el10 baseos 155 k libnghttp2 x86_64 1.64.0-2.el10 baseos 80 k libpkgconf x86_64 2.1.0-3.el10 baseos 41 k libpsl x86_64 0.21.5-6.el10 baseos 67 k libpwquality x86_64 1.4.5-12.el10 baseos 127 k libselinux x86_64 3.9-1.el10 baseos 97 k libsemanage x86_64 3.9-1.el10 baseos 122 k libsepol x86_64 3.9-1.el10 baseos 348 k libsmartcols x86_64 2.40.2-13.el10 baseos 83 k libssh x86_64 0.11.1-5.el10_1 baseos 233 k libssh-config noarch 0.11.1-5.el10_1 baseos 8.6 k libstdc++ x86_64 14.3.1-2.1.el10 baseos 924 k libtasn1 x86_64 4.20.0-1.el10 baseos 78 k libunistring x86_64 1.1-10.el10 baseos 550 k libutempter x86_64 1.2.1-15.el10 baseos 30 k libuuid x86_64 2.40.2-13.el10 baseos 28 k libverto x86_64 0.3.2-10.el10 baseos 24 k libxcrypt x86_64 4.4.36-10.el10 baseos 124 k libxml2 x86_64 2.12.5-9.el10_0 baseos 692 k libzstd x86_64 1.5.5-9.el10 baseos 294 k lua-libs x86_64 5.4.6-7.el10 baseos 134 k lua-srpm-macros noarch 1-15.el10 appstream 10 k lz4-libs x86_64 1.9.4-8.el10 baseos 70 k mpfr x86_64 4.2.1-5.el10 baseos 349 k ncurses-base noarch 6.4-14.20240127.el10 baseos 104 k ncurses-libs x86_64 6.4-14.20240127.el10 baseos 342 k ocaml-srpm-macros noarch 10-4.el10 appstream 10 k openblas-srpm-macros noarch 2-19.el10 appstream 9.0 k openldap x86_64 2.6.9-1.el10 baseos 240 k openssl-fips-provider x86_64 3.0.7-8.el10 baseos 9.2 k openssl-fips-provider-so x86_64 3.0.7-8.el10 baseos 576 k openssl-libs x86_64 1:3.5.1-5.el10_1 baseos 2.3 M p11-kit x86_64 0.25.5-7.el10 baseos 501 k p11-kit-trust x86_64 0.25.5-7.el10 baseos 137 k package-notes-srpm-macros noarch 0.5-13.el10 appstream 11 k pam x86_64 1.6.1-8.el10 baseos 586 k pam-libs x86_64 1.6.1-8.el10 baseos 58 k pcre2 x86_64 10.44-1.el10.3 baseos 250 k pcre2-syntax noarch 10.44-1.el10.3 baseos 155 k perl-srpm-macros noarch 1-57.el10 appstream 9.7 k pkgconf x86_64 2.1.0-3.el10 baseos 48 k pkgconf-m4 noarch 2.1.0-3.el10 baseos 15 k pkgconf-pkg-config x86_64 2.1.0-3.el10 baseos 12 k popt x86_64 1.19-8.el10 baseos 70 k publicsuffix-list-dafsa noarch 20240107-5.el10 baseos 60 k pyproject-srpm-macros noarch 1.16.2-1.el10 appstream 16 k python-srpm-macros noarch 3.12-10.el10 appstream 24 k qt6-srpm-macros noarch 6.9.1-1.el10 appstream 11 k readline x86_64 8.2-11.el10 baseos 217 k rpm x86_64 4.19.1.1-20.el10 baseos 560 k rpm-build-libs x86_64 4.19.1.1-20.el10 baseos 93 k rpm-libs x86_64 4.19.1.1-20.el10 baseos 309 k rpm-sequoia x86_64 1.9.0.3-1.el10_1 baseos 968 k rust-toolset-srpm-macros noarch 1.88.0-1.el10 appstream 13 k setup noarch 2.14.5-7.el10 baseos 153 k sqlite-libs x86_64 3.46.1-5.el10_1 baseos 745 k systemd-libs x86_64 257-13.el10 baseos 823 k util-linux-core x86_64 2.40.2-13.el10 baseos 550 k xz-libs x86_64 1:5.6.2-4.el10_0 baseos 113 k zip x86_64 3.0-45.el10 baseos 270 k zlib-ng-compat x86_64 2.2.3-2.el10 baseos 79 k zstd x86_64 1.5.5-9.el10 baseos 468 k Transaction Summary ======================================================================================= Install 146 Packages Total download size: 61 M Installed size: 187 M Downloading Packages: (1/146): authselect-libs-1.5.0-8.el10.x86_64.rp 1.4 MB/s | 227 kB 00:00 (2/146): authselect-1.5.0-8.el10.x86_64.rpm 893 kB/s | 148 kB 00:00 (3/146): alternatives-1.30-2.el10.x86_64.rpm 261 kB/s | 45 kB 00:00 (4/146): basesystem-11-22.el10.noarch.rpm 312 kB/s | 8.3 kB 00:00 (5/146): bzip2-libs-1.0.8-25.el10.x86_64.rpm 892 kB/s | 43 kB 00:00 (6/146): bzip2-1.0.8-25.el10.x86_64.rpm 516 kB/s | 59 kB 00:00 (7/146): bash-5.2.26-6.el10.x86_64.rpm 11 MB/s | 1.8 MB 00:00 (8/146): coreutils-9.5-6.el10.x86_64.rpm 12 MB/s | 1.1 MB 00:00 (9/146): cracklib-2.9.11-8.el10.x86_64.rpm 3.3 MB/s | 100 kB 00:00 (10/146): cpio-2.15-3.el10.x86_64.rpm 6.5 MB/s | 296 kB 00:00 (11/146): cracklib-dicts-2.9.11-8.el10.x86_64.r 42 MB/s | 3.7 MB 00:00 (12/146): coreutils-common-9.5-6.el10.x86_64.rp 12 MB/s | 2.2 MB 00:00 (13/146): ed-1.20-5.el10.x86_64.rpm 3.1 MB/s | 86 kB 00:00 (14/146): diffutils-3.10-8.el10.x86_64.rpm 3.0 MB/s | 413 kB 00:00 (15/146): gawk-5.3.0-6.el10.x86_64.rpm 27 MB/s | 1.1 MB 00:00 (16/146): findutils-4.10.0-5.el10.x86_64.rpm 8.0 MB/s | 555 kB 00:00 (17/146): gdbm-1.23-12.el10_0.x86_64.rpm 3.9 MB/s | 156 kB 00:00 (18/146): gdbm-libs-1.23-12.el10_0.x86_64.rpm 1.5 MB/s | 60 kB 00:00 (19/146): grep-3.11-10.el10.x86_64.rpm 8.2 MB/s | 305 kB 00:00 (20/146): gzip-1.13-3.el10.x86_64.rpm 4.7 MB/s | 174 kB 00:00 (21/146): jansson-2.14-3.el10.x86_64.rpm 996 kB/s | 48 kB 00:00 (22/146): info-7.1-6.el10.x86_64.rpm 2.8 MB/s | 187 kB 00:00 (23/146): json-c-0.18-3.el10.x86_64.rpm 893 kB/s | 47 kB 00:00 (24/146): libacl-2.3.2-4.el10.x86_64.rpm 1.0 MB/s | 27 kB 00:00 (25/146): keyutils-libs-1.6.3-5.el10.x86_64.rpm 1.1 MB/s | 35 kB 00:00 (26/146): libattr-2.5.2-5.el10.x86_64.rpm 523 kB/s | 20 kB 00:00 (27/146): libcap-2.69-7.el10.x86_64.rpm 2.5 MB/s | 95 kB 00:00 (28/146): libeconf-0.6.2-4.el10.x86_64.rpm 1.1 MB/s | 36 kB 00:00 (29/146): libbrotli-1.1.0-6.el10.x86_64.rpm 4.4 MB/s | 349 kB 00:00 (30/146): libcap-ng-0.8.4-6.el10.x86_64.rpm 621 kB/s | 36 kB 00:00 (31/146): libevent-2.1.12-16.el10.x86_64.rpm 8.8 MB/s | 265 kB 00:00 (32/146): libidn2-2.3.7-3.el10.x86_64.rpm 4.1 MB/s | 122 kB 00:00 (33/146): libpkgconf-2.1.0-3.el10.x86_64.rpm 1.4 MB/s | 41 kB 00:00 (34/146): libpsl-0.21.5-6.el10.x86_64.rpm 2.0 MB/s | 67 kB 00:00 (35/146): libpwquality-1.4.5-12.el10.x86_64.rpm 1.8 MB/s | 127 kB 00:00 (36/146): libtasn1-4.20.0-1.el10.x86_64.rpm 1.0 MB/s | 78 kB 00:00 (37/146): libunistring-1.1-10.el10.x86_64.rpm 17 MB/s | 550 kB 00:00 (38/146): libnghttp2-1.64.0-2.el10.x86_64.rpm 544 kB/s | 80 kB 00:00 (39/146): libutempter-1.2.1-15.el10.x86_64.rpm 884 kB/s | 30 kB 00:00 (40/146): libverto-0.3.2-10.el10.x86_64.rpm 856 kB/s | 24 kB 00:00 (41/146): lua-libs-5.4.6-7.el10.x86_64.rpm 2.5 MB/s | 134 kB 00:00 (42/146): libzstd-1.5.5-9.el10.x86_64.rpm 2.8 MB/s | 294 kB 00:00 (43/146): libxcrypt-4.4.36-10.el10.x86_64.rpm 1.0 MB/s | 124 kB 00:00 (44/146): lz4-libs-1.9.4-8.el10.x86_64.rpm 845 kB/s | 70 kB 00:00 (45/146): mpfr-4.2.1-5.el10.x86_64.rpm 6.7 MB/s | 349 kB 00:00 (46/146): ncurses-base-6.4-14.20240127.el10.noa 1.6 MB/s | 104 kB 00:00 (47/146): p11-kit-trust-0.25.5-7.el10.x86_64.rp 4.7 MB/s | 137 kB 00:00 (48/146): ncurses-libs-6.4-14.20240127.el10.x86 4.1 MB/s | 342 kB 00:00 (49/146): p11-kit-0.25.5-7.el10.x86_64.rpm 3.2 MB/s | 501 kB 00:00 (50/146): pcre2-syntax-10.44-1.el10.3.noarch.rp 1.6 MB/s | 155 kB 00:00 (51/146): pcre2-10.44-1.el10.3.x86_64.rpm 1.9 MB/s | 250 kB 00:00 (52/146): pkgconf-m4-2.1.0-3.el10.noarch.rpm 334 kB/s | 15 kB 00:00 (53/146): pkgconf-pkg-config-2.1.0-3.el10.x86_6 277 kB/s | 12 kB 00:00 (54/146): popt-1.19-8.el10.x86_64.rpm 1.4 MB/s | 70 kB 00:00 (55/146): pkgconf-2.1.0-3.el10.x86_64.rpm 413 kB/s | 48 kB 00:00 (56/146): publicsuffix-list-dafsa-20240107-5.el 1.0 MB/s | 60 kB 00:00 (57/146): readline-8.2-11.el10.x86_64.rpm 2.2 MB/s | 217 kB 00:00 (58/146): sed-4.9-3.el10.x86_64.rpm 3.0 MB/s | 322 kB 00:00 (59/146): zstd-1.5.5-9.el10.x86_64.rpm 4.2 MB/s | 468 kB 00:00 (60/146): krb5-libs-1.21.3-8.el10_0.x86_64.rpm 18 MB/s | 767 kB 00:00 (61/146): libarchive-3.7.7-4.el10_0.x86_64.rpm 5.7 MB/s | 414 kB 00:00 (62/146): libxml2-2.12.5-9.el10_0.x86_64.rpm 11 MB/s | 692 kB 00:00 (63/146): which-2.21-44.el10_0.x86_64.rpm 815 kB/s | 42 kB 00:00 (64/146): audit-libs-4.0.3-4.el10.x86_64.rpm 4.6 MB/s | 133 kB 00:00 (65/146): crypto-policies-20250905-2.gitc7eb7b2 3.3 MB/s | 98 kB 00:00 (66/146): xz-5.6.2-4.el10_0.x86_64.rpm 4.7 MB/s | 481 kB 00:00 (67/146): cyrus-sasl-lib-2.1.28-29.el10.x86_64. 1.8 MB/s | 106 kB 00:00 (68/146): xz-libs-5.6.2-4.el10_0.x86_64.rpm 633 kB/s | 113 kB 00:00 (69/146): elfutils-debuginfod-client-0.193-1.el 763 kB/s | 47 kB 00:00 (70/146): elfutils-0.193-1.el10.x86_64.rpm 3.6 MB/s | 573 kB 00:00 (71/146): elfutils-libelf-0.193-1.el10.x86_64.r 1.5 MB/s | 208 kB 00:00 (72/146): elfutils-default-yama-scope-0.193-1.e 11 kB/s | 13 kB 00:01 (73/146): file-5.45-8.el10.x86_64.rpm 1.6 MB/s | 49 kB 00:00 (74/146): curl-8.12.1-2.el10.x86_64.rpm 159 kB/s | 219 kB 00:01 (75/146): elfutils-libs-0.193-1.el10.x86_64.rpm 259 kB/s | 270 kB 00:01 (76/146): filesystem-3.18-17.el10.x86_64.rpm 34 MB/s | 4.8 MB 00:00 (77/146): file-libs-5.45-8.el10.x86_64.rpm 5.2 MB/s | 764 kB 00:00 (78/146): gmp-6.2.1-12.el10.x86_64.rpm 2.2 MB/s | 318 kB 00:00 (79/146): libblkid-2.40.2-13.el10.x86_64.rpm 1.8 MB/s | 124 kB 00:00 (80/146): libcom_err-1.47.1-4.el10.x86_64.rpm 383 kB/s | 27 kB 00:00 (81/146): libfdisk-2.40.2-13.el10.x86_64.rpm 5.8 MB/s | 159 kB 00:00 (82/146): libgcc-14.3.1-2.1.el10.x86_64.rpm 5.1 MB/s | 145 kB 00:00 (83/146): libffi-3.4.4-10.el10.x86_64.rpm 591 kB/s | 41 kB 00:00 (84/146): libmount-2.40.2-13.el10.x86_64.rpm 5.2 MB/s | 155 kB 00:00 (85/146): libgomp-14.3.1-2.1.el10.x86_64.rpm 5.2 MB/s | 368 kB 00:00 (86/146): libcurl-8.12.1-2.el10.x86_64.rpm 2.4 MB/s | 371 kB 00:00 (87/146): libsemanage-3.9-1.el10.x86_64.rpm 3.9 MB/s | 122 kB 00:00 (88/146): libselinux-3.9-1.el10.x86_64.rpm 1.3 MB/s | 97 kB 00:00 (89/146): libsmartcols-2.40.2-13.el10.x86_64.rp 2.9 MB/s | 83 kB 00:00 (90/146): libsepol-3.9-1.el10.x86_64.rpm 4.2 MB/s | 348 kB 00:00 (91/146): libstdc++-14.3.1-2.1.el10.x86_64.rpm 11 MB/s | 924 kB 00:00 (92/146): libuuid-2.40.2-13.el10.x86_64.rpm 316 kB/s | 28 kB 00:00 (93/146): openssl-fips-provider-3.0.7-8.el10.x8 318 kB/s | 9.2 kB 00:00 (94/146): pam-1.6.1-8.el10.x86_64.rpm 14 MB/s | 586 kB 00:00 (95/146): openssl-fips-provider-so-3.0.7-8.el10 5.1 MB/s | 576 kB 00:00 (96/146): openldap-2.6.9-1.el10.x86_64.rpm 1.2 MB/s | 240 kB 00:00 (97/146): pam-libs-1.6.1-8.el10.x86_64.rpm 479 kB/s | 58 kB 00:00 (98/146): rpm-build-libs-4.19.1.1-20.el10.x86_6 1.2 MB/s | 93 kB 00:00 (99/146): rpm-4.19.1.1-20.el10.x86_64.rpm 5.0 MB/s | 560 kB 00:00 (100/146): rpm-libs-4.19.1.1-20.el10.x86_64.rpm 5.3 MB/s | 309 kB 00:00 (101/146): shadow-utils-4.15.0-8.el10.x86_64.rp 13 MB/s | 1.3 MB 00:00 (102/146): setup-2.14.5-7.el10.noarch.rpm 1.3 MB/s | 153 kB 00:00 (103/146): rpm-sequoia-1.9.0.3-1.el10_1.x86_64. 6.4 MB/s | 968 kB 00:00 (104/146): sqlite-libs-3.46.1-5.el10_1.x86_64.r 13 MB/s | 745 kB 00:00 (105/146): systemd-libs-257-13.el10.x86_64.rpm 14 MB/s | 823 kB 00:00 (106/146): util-linux-2.40.2-13.el10.x86_64.rpm 17 MB/s | 1.3 MB 00:00 (107/146): util-linux-core-2.40.2-13.el10.x86_6 5.7 MB/s | 550 kB 00:00 (108/146): unzip-6.0-69.el10.x86_64.rpm 1.1 MB/s | 190 kB 00:00 (109/146): zlib-ng-compat-2.2.3-2.el10.x86_64.r 1.5 MB/s | 79 kB 00:00 (110/146): glibc-common-2.39-58.el10_1.2.x86_64 11 MB/s | 339 kB 00:00 (111/146): glibc-gconv-extra-2.39-58.el10_1.2.x 32 MB/s | 1.7 MB 00:00 (112/146): glibc-minimal-langpack-2.39-58.el10_ 1.5 MB/s | 45 kB 00:00 (113/146): ca-certificates-2025.2.80_v9.0.305-1 31 MB/s | 1.1 MB 00:00 (114/146): zip-3.0-45.el10.x86_64.rpm 1.1 MB/s | 270 kB 00:00 (115/146): glibc-2.39-58.el10_1.2.x86_64.rpm 10 MB/s | 2.1 MB 00:00 (116/146): redhat-release-10.1-18.el10.x86_64.r 1.2 MB/s | 61 kB 00:00 (117/146): libssh-0.11.1-5.el10_1.x86_64.rpm 4.2 MB/s | 233 kB 00:00 (118/146): binutils-2.41-58.el10_1.2.x86_64.rpm 61 MB/s | 6.4 MB 00:00 (119/146): tar-1.35-9.el10_1.x86_64.rpm 15 MB/s | 866 kB 00:00 (120/146): binutils-gold-2.41-58.el10_1.2.x86_6 4.7 MB/s | 797 kB 00:00 (121/146): openssl-libs-3.5.1-5.el10_1.x86_64.r 42 MB/s | 2.3 MB 00:00 (122/146): libssh-config-0.11.1-5.el10_1.noarch 38 kB/s | 8.6 kB 00:00 (123/146): fonts-srpm-macros-2.0.5-18.el10.noar 558 kB/s | 29 kB 00:00 (124/146): lua-srpm-macros-1-15.el10.noarch.rpm 113 kB/s | 10 kB 00:00 (125/146): package-notes-srpm-macros-0.5-13.el1 200 kB/s | 11 kB 00:00 (126/146): perl-srpm-macros-1-57.el10.noarch.rp 59 kB/s | 9.7 kB 00:00 (127/146): openblas-srpm-macros-2-19.el10.noarc 120 kB/s | 9.0 kB 00:00 (128/146): efi-srpm-macros-6-6.el10.noarch.rpm 99 kB/s | 25 kB 00:00 (129/146): go-srpm-macros-3.6.0-4.el10.noarch.r 459 kB/s | 29 kB 00:00 (130/146): ocaml-srpm-macros-10-4.el10.noarch.r 63 kB/s | 10 kB 00:00 (131/146): patch-2.7.6-26.el10.x86_64.rpm 3.1 MB/s | 134 kB 00:00 (132/146): pyproject-srpm-macros-1.16.2-1.el10. 519 kB/s | 16 kB 00:00 (133/146): python-srpm-macros-3.12-10.el10.noar 532 kB/s | 24 kB 00:00 (134/146): qt6-srpm-macros-6.9.1-1.el10.noarch. 353 kB/s | 11 kB 00:00 (135/146): forge-srpm-macros-0.4.0-6.el10.noarc 245 kB/s | 23 kB 00:00 (136/146): kernel-srpm-macros-1.0-25.el10.noarc 56 kB/s | 11 kB 00:00 (137/146): redhat-rpm-config-293-1.el10.noarch. 1.8 MB/s | 77 kB 00:00 (138/146): rpm-build-4.19.1.1-20.el10.x86_64.rp 1.2 MB/s | 75 kB 00:00 (139/146): debugedit-5.1-8.el10.x86_64.rpm 1.5 MB/s | 80 kB 00:00 (140/146): dwz-0.16-1.el10.x86_64.rpm 3.6 MB/s | 140 kB 00:00 (141/146): ansible-srpm-macros-1-16.1.el10_0.no 1.6 MB/s | 20 kB 00:00 (142/146): epel-rpm-macros-10-6.el10_1.noarch.r 2.2 MB/s | 8.3 kB 00:00 (143/146): fpc-srpm-macros-1.3-7.el10_1.noarch. 1.7 MB/s | 7.8 kB 00:00 (144/146): ghc-srpm-macros-1.9.2-1.el10_0.noarc 2.5 MB/s | 9.1 kB 00:00 (145/146): gdb-minimal-16.3-2.el10.x86_64.rpm 45 MB/s | 4.4 MB 00:00 (146/146): rust-toolset-srpm-macros-1.88.0-1.el 53 kB/s | 13 kB 00:00 -------------------------------------------------------------------------------- Total 12 MB/s | 61 MB 00:05 Red Hat Enterprise Linux 10 for x86_64 - BaseOS 3.6 MB/s | 3.7 kB 00:00 Importing GPG key 0x5A6340B3: Userid : "Red Hat, Inc. (auxiliary key 3) " Fingerprint: 7E46 2425 8C40 6535 D56D 6F13 5054 E4A4 5A63 40B3 From : /usr/share/distribution-gpg-keys/redhat/RPM-GPG-KEY-redhat10-release Key imported successfully Importing GPG key 0xFD431D51: Userid : "Red Hat, Inc. (release key 2) " Fingerprint: 567E 347A D004 4ADE 55BA 8A5F 199E 2F91 FD43 1D51 From : /usr/share/distribution-gpg-keys/redhat/RPM-GPG-KEY-redhat10-release Key imported successfully Extra Packages for Enterprise Linux 10 - x86_64 1.6 MB/s | 1.6 kB 00:00 Importing GPG key 0xE37ED158: Userid : "Fedora (epel10) " Fingerprint: 7D8D 15CB FC4E 6268 8591 FB26 33D9 8517 E37E D158 From : /usr/share/distribution-gpg-keys/epel/RPM-GPG-KEY-EPEL-10 Key imported successfully Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Running scriptlet: filesystem-3.18-17.el10.x86_64 1/1 Preparing : 1/1 Installing : libgcc-14.3.1-2.1.el10.x86_64 1/146 Running scriptlet: libgcc-14.3.1-2.1.el10.x86_64 1/146 Installing : redhat-release-10.1-18.el10.x86_64 2/146 Running scriptlet: setup-2.14.5-7.el10.noarch 3/146 Creating group 'adm' with GID 4. Creating group 'audio' with GID 63. Creating group 'bin' with GID 1. Creating group 'cdrom' with GID 11. Creating group 'clock' with GID 103. Creating group 'daemon' with GID 2. Creating group 'dialout' with GID 18. Creating group 'disk' with GID 6. Creating group 'floppy' with GID 19. Creating group 'ftp' with GID 50. Creating group 'games' with GID 20. Creating group 'kmem' with GID 9. Creating group 'lock' with GID 54. Creating group 'lp' with GID 7. Creating group 'mail' with GID 12. Creating group 'man' with GID 15. Creating group 'mem' with GID 8. Creating group 'nobody' with GID 65534. Creating group 'root' with GID 0. Creating group 'sys' with GID 3. Creating group 'tape' with GID 33. Creating group 'tty' with GID 5. Creating group 'users' with GID 100. Creating group 'video' with GID 39. Creating group 'wheel' with GID 10. Creating user 'adm' (adm) with UID 3 and GID 4. Creating user 'bin' (bin) with UID 1 and GID 1. Creating user 'daemon' (daemon) with UID 2 and GID 2. Creating user 'ftp' (FTP User) with UID 14 and GID 50. Creating user 'games' (games) with UID 12 and GID 20. Creating user 'halt' (halt) with UID 7 and GID 0. Creating user 'lp' (lp) with UID 4 and GID 7. Creating user 'mail' (mail) with UID 8 and GID 12. Creating user 'nobody' (Kernel Overflow User) with UID 65534 and GID 65534. Creating user 'operator' (operator) with UID 11 and GID 0. Creating user 'root' (Super User) with UID 0 and GID 0. Creating user 'shutdown' (shutdown) with UID 6 and GID 0. Creating user 'sync' (sync) with UID 5 and GID 0. Installing : setup-2.14.5-7.el10.noarch 3/146 warning: /etc/hosts created as /etc/hosts.rpmnew Running scriptlet: setup-2.14.5-7.el10.noarch 3/146 Installing : filesystem-3.18-17.el10.x86_64 4/146 Installing : basesystem-11-22.el10.noarch 5/146 Installing : ghc-srpm-macros-1.9.2-1.el10_0.noarch 6/146 Installing : fpc-srpm-macros-1.3-7.el10_1.noarch 7/146 Installing : ansible-srpm-macros-1-16.1.el10_0.noarch 8/146 Installing : rust-toolset-srpm-macros-1.88.0-1.el10.noarch 9/146 Installing : qt6-srpm-macros-6.9.1-1.el10.noarch 10/146 Installing : kernel-srpm-macros-1.0-25.el10.noarch 11/146 Installing : openblas-srpm-macros-2-19.el10.noarch 12/146 Installing : ocaml-srpm-macros-10-4.el10.noarch 13/146 Installing : package-notes-srpm-macros-0.5-13.el10.noarch 14/146 Installing : perl-srpm-macros-1-57.el10.noarch 15/146 Installing : libssh-config-0.11.1-5.el10_1.noarch 16/146 Installing : publicsuffix-list-dafsa-20240107-5.el10.noarch 17/146 Installing : pkgconf-m4-2.1.0-3.el10.noarch 18/146 Installing : pcre2-syntax-10.44-1.el10.3.noarch 19/146 Installing : ncurses-base-6.4-14.20240127.el10.noarch 20/146 Installing : bash-5.2.26-6.el10.x86_64 21/146 Running scriptlet: bash-5.2.26-6.el10.x86_64 21/146 Installing : ncurses-libs-6.4-14.20240127.el10.x86_64 22/146 Installing : glibc-common-2.39-58.el10_1.2.x86_64 23/146 Installing : glibc-gconv-extra-2.39-58.el10_1.2.x86_64 24/146 Running scriptlet: glibc-gconv-extra-2.39-58.el10_1.2.x86_64 24/146 Installing : glibc-minimal-langpack-2.39-58.el10_1.2.x86_64 25/146 Running scriptlet: glibc-2.39-58.el10_1.2.x86_64 26/146 Installing : glibc-2.39-58.el10_1.2.x86_64 26/146 Running scriptlet: glibc-2.39-58.el10_1.2.x86_64 26/146 Installing : zlib-ng-compat-2.2.3-2.el10.x86_64 27/146 Installing : bzip2-libs-1.0.8-25.el10.x86_64 28/146 Installing : xz-libs-1:5.6.2-4.el10_0.x86_64 29/146 Installing : popt-1.19-8.el10.x86_64 30/146 Installing : readline-8.2-11.el10.x86_64 31/146 Installing : libstdc++-14.3.1-2.1.el10.x86_64 32/146 Installing : libuuid-2.40.2-13.el10.x86_64 33/146 Installing : libblkid-2.40.2-13.el10.x86_64 34/146 Installing : libattr-2.5.2-5.el10.x86_64 35/146 Installing : libacl-2.3.2-4.el10.x86_64 36/146 Installing : libxcrypt-4.4.36-10.el10.x86_64 37/146 Installing : libzstd-1.5.5-9.el10.x86_64 38/146 Installing : elfutils-libelf-0.193-1.el10.x86_64 39/146 Installing : gmp-1:6.2.1-12.el10.x86_64 40/146 Installing : gdbm-libs-1:1.23-12.el10_0.x86_64 41/146 Installing : libeconf-0.6.2-4.el10.x86_64 42/146 Installing : mpfr-4.2.1-5.el10.x86_64 43/146 Installing : gawk-5.3.0-6.el10.x86_64 44/146 Installing : dwz-0.16-1.el10.x86_64 45/146 Installing : unzip-6.0-69.el10.x86_64 46/146 Installing : file-libs-5.45-8.el10.x86_64 47/146 Installing : file-5.45-8.el10.x86_64 48/146 Installing : alternatives-1.30-2.el10.x86_64 49/146 Installing : jansson-2.14-3.el10.x86_64 50/146 Installing : libcap-ng-0.8.4-6.el10.x86_64 51/146 Installing : audit-libs-4.0.3-4.el10.x86_64 52/146 Installing : pam-libs-1.6.1-8.el10.x86_64 53/146 Installing : libcap-2.69-7.el10.x86_64 54/146 Installing : systemd-libs-257-13.el10.x86_64 55/146 Installing : libtasn1-4.20.0-1.el10.x86_64 56/146 Installing : libunistring-1.1-10.el10.x86_64 57/146 Installing : libidn2-2.3.7-3.el10.x86_64 58/146 Installing : lua-libs-5.4.6-7.el10.x86_64 59/146 Installing : lz4-libs-1.9.4-8.el10.x86_64 60/146 Installing : pcre2-10.44-1.el10.3.x86_64 61/146 Installing : grep-3.11-10.el10.x86_64 62/146 Installing : xz-1:5.6.2-4.el10_0.x86_64 63/146 Installing : libffi-3.4.4-10.el10.x86_64 64/146 Installing : libsepol-3.9-1.el10.x86_64 65/146 Installing : libselinux-3.9-1.el10.x86_64 66/146 Installing : sed-4.9-3.el10.x86_64 67/146 Installing : findutils-1:4.10.0-5.el10.x86_64 68/146 Installing : libmount-2.40.2-13.el10.x86_64 69/146 Installing : libsmartcols-2.40.2-13.el10.x86_64 70/146 Running scriptlet: crypto-policies-20250905-2.gitc7eb7b2.el10_1.noa 71/146 Installing : crypto-policies-20250905-2.gitc7eb7b2.el10_1.noa 71/146 Running scriptlet: crypto-policies-20250905-2.gitc7eb7b2.el10_1.noa 71/146 Installing : util-linux-core-2.40.2-13.el10.x86_64 72/146 Installing : libsemanage-3.9-1.el10.x86_64 73/146 Installing : shadow-utils-2:4.15.0-8.el10.x86_64 74/146 Running scriptlet: libutempter-1.2.1-15.el10.x86_64 75/146 Installing : libutempter-1.2.1-15.el10.x86_64 75/146 Installing : tar-2:1.35-9.el10_1.x86_64 76/146 Installing : p11-kit-0.25.5-7.el10.x86_64 77/146 Installing : p11-kit-trust-0.25.5-7.el10.x86_64 78/146 Running scriptlet: p11-kit-trust-0.25.5-7.el10.x86_64 78/146 Installing : zstd-1.5.5-9.el10.x86_64 79/146 Installing : libpsl-0.21.5-6.el10.x86_64 80/146 Installing : zip-3.0-45.el10.x86_64 81/146 Installing : gdbm-1:1.23-12.el10_0.x86_64 82/146 Installing : cyrus-sasl-lib-2.1.28-29.el10.x86_64 83/146 Installing : libfdisk-2.40.2-13.el10.x86_64 84/146 Installing : libxml2-2.12.5-9.el10_0.x86_64 85/146 Installing : bzip2-1.0.8-25.el10.x86_64 86/146 Installing : sqlite-libs-3.46.1-5.el10_1.x86_64 87/146 Installing : cpio-2.15-3.el10.x86_64 88/146 Installing : diffutils-3.10-8.el10.x86_64 89/146 Installing : ed-1.20-5.el10.x86_64 90/146 Installing : patch-2.7.6-26.el10.x86_64 91/146 Installing : json-c-0.18-3.el10.x86_64 92/146 Installing : keyutils-libs-1.6.3-5.el10.x86_64 93/146 Installing : libbrotli-1.1.0-6.el10.x86_64 94/146 Installing : libnghttp2-1.64.0-2.el10.x86_64 95/146 Installing : libpkgconf-2.1.0-3.el10.x86_64 96/146 Installing : pkgconf-2.1.0-3.el10.x86_64 97/146 Installing : pkgconf-pkg-config-2.1.0-3.el10.x86_64 98/146 Installing : libverto-0.3.2-10.el10.x86_64 99/146 Installing : libcom_err-1.47.1-4.el10.x86_64 100/146 Installing : libgomp-14.3.1-2.1.el10.x86_64 101/146 Installing : elfutils-default-yama-scope-0.193-1.el10.noarch 102/146 Running scriptlet: elfutils-default-yama-scope-0.193-1.el10.noarch 102/146 Installing : elfutils-libs-0.193-1.el10.x86_64 103/146 Installing : coreutils-common-9.5-6.el10.x86_64 104/146 Installing : openssl-fips-provider-so-3.0.7-8.el10.x86_64 105/146 Installing : openssl-fips-provider-3.0.7-8.el10.x86_64 106/146 Installing : openssl-libs-1:3.5.1-5.el10_1.x86_64 107/146 Installing : coreutils-9.5-6.el10.x86_64 108/146 Running scriptlet: ca-certificates-2025.2.80_v9.0.305-102.el10_1.no 109/146 Installing : ca-certificates-2025.2.80_v9.0.305-102.el10_1.no 109/146 Running scriptlet: ca-certificates-2025.2.80_v9.0.305-102.el10_1.no 109/146 Installing : authselect-libs-1.5.0-8.el10.x86_64 110/146 Installing : gzip-1.13-3.el10.x86_64 111/146 Installing : cracklib-2.9.11-8.el10.x86_64 112/146 Installing : krb5-libs-1.21.3-8.el10_0.x86_64 113/146 Installing : libarchive-3.7.7-4.el10_0.x86_64 114/146 Installing : libssh-0.11.1-5.el10_1.x86_64 115/146 Installing : cracklib-dicts-2.9.11-8.el10.x86_64 116/146 Installing : libpwquality-1.4.5-12.el10.x86_64 117/146 Installing : pam-1.6.1-8.el10.x86_64 118/146 Installing : libevent-2.1.12-16.el10.x86_64 119/146 Installing : openldap-2.6.9-1.el10.x86_64 120/146 Installing : libcurl-8.12.1-2.el10.x86_64 121/146 Installing : elfutils-debuginfod-client-0.193-1.el10.x86_64 122/146 Installing : binutils-gold-2.41-58.el10_1.2.x86_64 123/146 Running scriptlet: binutils-gold-2.41-58.el10_1.2.x86_64 123/146 Installing : binutils-2.41-58.el10_1.2.x86_64 124/146 Running scriptlet: binutils-2.41-58.el10_1.2.x86_64 124/146 Installing : elfutils-0.193-1.el10.x86_64 125/146 Installing : gdb-minimal-16.3-2.el10.x86_64 126/146 Installing : debugedit-5.1-8.el10.x86_64 127/146 Installing : curl-8.12.1-2.el10.x86_64 128/146 Installing : rpm-sequoia-1.9.0.3-1.el10_1.x86_64 129/146 Installing : rpm-libs-4.19.1.1-20.el10.x86_64 130/146 Running scriptlet: rpm-4.19.1.1-20.el10.x86_64 131/146 Installing : rpm-4.19.1.1-20.el10.x86_64 131/146 Installing : efi-srpm-macros-6-6.el10.noarch 132/146 Installing : lua-srpm-macros-1-15.el10.noarch 133/146 Installing : rpm-build-libs-4.19.1.1-20.el10.x86_64 134/146 Installing : go-srpm-macros-3.6.0-4.el10.noarch 135/146 Installing : fonts-srpm-macros-1:2.0.5-18.el10.noarch 136/146 Installing : forge-srpm-macros-0.4.0-6.el10.noarch 137/146 Installing : python-srpm-macros-3.12-10.el10.noarch 138/146 Installing : redhat-rpm-config-293-1.el10.noarch 139/146 Installing : rpm-build-4.19.1.1-20.el10.x86_64 140/146 Installing : pyproject-srpm-macros-1.16.2-1.el10.noarch 141/146 Installing : util-linux-2.40.2-13.el10.x86_64 142/146 Running scriptlet: util-linux-2.40.2-13.el10.x86_64 142/146 Installing : authselect-1.5.0-8.el10.x86_64 143/146 Installing : which-2.21-44.el10_0.x86_64 144/146 Installing : info-7.1-6.el10.x86_64 145/146 Installing : epel-rpm-macros-10-6.el10_1.noarch 146/146 Running scriptlet: filesystem-3.18-17.el10.x86_64 146/146 Running scriptlet: ca-certificates-2025.2.80_v9.0.305-102.el10_1.no 146/146 Running scriptlet: authselect-libs-1.5.0-8.el10.x86_64 146/146 Running scriptlet: rpm-4.19.1.1-20.el10.x86_64 146/146 Running scriptlet: epel-rpm-macros-10-6.el10_1.noarch 146/146 Installed products updated. Installed: alternatives-1.30-2.el10.x86_64 ansible-srpm-macros-1-16.1.el10_0.noarch audit-libs-4.0.3-4.el10.x86_64 authselect-1.5.0-8.el10.x86_64 authselect-libs-1.5.0-8.el10.x86_64 basesystem-11-22.el10.noarch bash-5.2.26-6.el10.x86_64 binutils-2.41-58.el10_1.2.x86_64 binutils-gold-2.41-58.el10_1.2.x86_64 bzip2-1.0.8-25.el10.x86_64 bzip2-libs-1.0.8-25.el10.x86_64 ca-certificates-2025.2.80_v9.0.305-102.el10_1.noarch coreutils-9.5-6.el10.x86_64 coreutils-common-9.5-6.el10.x86_64 cpio-2.15-3.el10.x86_64 cracklib-2.9.11-8.el10.x86_64 cracklib-dicts-2.9.11-8.el10.x86_64 crypto-policies-20250905-2.gitc7eb7b2.el10_1.noarch curl-8.12.1-2.el10.x86_64 cyrus-sasl-lib-2.1.28-29.el10.x86_64 debugedit-5.1-8.el10.x86_64 diffutils-3.10-8.el10.x86_64 dwz-0.16-1.el10.x86_64 ed-1.20-5.el10.x86_64 efi-srpm-macros-6-6.el10.noarch elfutils-0.193-1.el10.x86_64 elfutils-debuginfod-client-0.193-1.el10.x86_64 elfutils-default-yama-scope-0.193-1.el10.noarch elfutils-libelf-0.193-1.el10.x86_64 elfutils-libs-0.193-1.el10.x86_64 epel-rpm-macros-10-6.el10_1.noarch file-5.45-8.el10.x86_64 file-libs-5.45-8.el10.x86_64 filesystem-3.18-17.el10.x86_64 findutils-1:4.10.0-5.el10.x86_64 fonts-srpm-macros-1:2.0.5-18.el10.noarch forge-srpm-macros-0.4.0-6.el10.noarch fpc-srpm-macros-1.3-7.el10_1.noarch gawk-5.3.0-6.el10.x86_64 gdb-minimal-16.3-2.el10.x86_64 gdbm-1:1.23-12.el10_0.x86_64 gdbm-libs-1:1.23-12.el10_0.x86_64 ghc-srpm-macros-1.9.2-1.el10_0.noarch glibc-2.39-58.el10_1.2.x86_64 glibc-common-2.39-58.el10_1.2.x86_64 glibc-gconv-extra-2.39-58.el10_1.2.x86_64 glibc-minimal-langpack-2.39-58.el10_1.2.x86_64 gmp-1:6.2.1-12.el10.x86_64 go-srpm-macros-3.6.0-4.el10.noarch grep-3.11-10.el10.x86_64 gzip-1.13-3.el10.x86_64 info-7.1-6.el10.x86_64 jansson-2.14-3.el10.x86_64 json-c-0.18-3.el10.x86_64 kernel-srpm-macros-1.0-25.el10.noarch keyutils-libs-1.6.3-5.el10.x86_64 krb5-libs-1.21.3-8.el10_0.x86_64 libacl-2.3.2-4.el10.x86_64 libarchive-3.7.7-4.el10_0.x86_64 libattr-2.5.2-5.el10.x86_64 libblkid-2.40.2-13.el10.x86_64 libbrotli-1.1.0-6.el10.x86_64 libcap-2.69-7.el10.x86_64 libcap-ng-0.8.4-6.el10.x86_64 libcom_err-1.47.1-4.el10.x86_64 libcurl-8.12.1-2.el10.x86_64 libeconf-0.6.2-4.el10.x86_64 libevent-2.1.12-16.el10.x86_64 libfdisk-2.40.2-13.el10.x86_64 libffi-3.4.4-10.el10.x86_64 libgcc-14.3.1-2.1.el10.x86_64 libgomp-14.3.1-2.1.el10.x86_64 libidn2-2.3.7-3.el10.x86_64 libmount-2.40.2-13.el10.x86_64 libnghttp2-1.64.0-2.el10.x86_64 libpkgconf-2.1.0-3.el10.x86_64 libpsl-0.21.5-6.el10.x86_64 libpwquality-1.4.5-12.el10.x86_64 libselinux-3.9-1.el10.x86_64 libsemanage-3.9-1.el10.x86_64 libsepol-3.9-1.el10.x86_64 libsmartcols-2.40.2-13.el10.x86_64 libssh-0.11.1-5.el10_1.x86_64 libssh-config-0.11.1-5.el10_1.noarch libstdc++-14.3.1-2.1.el10.x86_64 libtasn1-4.20.0-1.el10.x86_64 libunistring-1.1-10.el10.x86_64 libutempter-1.2.1-15.el10.x86_64 libuuid-2.40.2-13.el10.x86_64 libverto-0.3.2-10.el10.x86_64 libxcrypt-4.4.36-10.el10.x86_64 libxml2-2.12.5-9.el10_0.x86_64 libzstd-1.5.5-9.el10.x86_64 lua-libs-5.4.6-7.el10.x86_64 lua-srpm-macros-1-15.el10.noarch lz4-libs-1.9.4-8.el10.x86_64 mpfr-4.2.1-5.el10.x86_64 ncurses-base-6.4-14.20240127.el10.noarch ncurses-libs-6.4-14.20240127.el10.x86_64 ocaml-srpm-macros-10-4.el10.noarch openblas-srpm-macros-2-19.el10.noarch openldap-2.6.9-1.el10.x86_64 openssl-fips-provider-3.0.7-8.el10.x86_64 openssl-fips-provider-so-3.0.7-8.el10.x86_64 openssl-libs-1:3.5.1-5.el10_1.x86_64 p11-kit-0.25.5-7.el10.x86_64 p11-kit-trust-0.25.5-7.el10.x86_64 package-notes-srpm-macros-0.5-13.el10.noarch pam-1.6.1-8.el10.x86_64 pam-libs-1.6.1-8.el10.x86_64 patch-2.7.6-26.el10.x86_64 pcre2-10.44-1.el10.3.x86_64 pcre2-syntax-10.44-1.el10.3.noarch perl-srpm-macros-1-57.el10.noarch pkgconf-2.1.0-3.el10.x86_64 pkgconf-m4-2.1.0-3.el10.noarch pkgconf-pkg-config-2.1.0-3.el10.x86_64 popt-1.19-8.el10.x86_64 publicsuffix-list-dafsa-20240107-5.el10.noarch pyproject-srpm-macros-1.16.2-1.el10.noarch python-srpm-macros-3.12-10.el10.noarch qt6-srpm-macros-6.9.1-1.el10.noarch readline-8.2-11.el10.x86_64 redhat-release-10.1-18.el10.x86_64 redhat-rpm-config-293-1.el10.noarch rpm-4.19.1.1-20.el10.x86_64 rpm-build-4.19.1.1-20.el10.x86_64 rpm-build-libs-4.19.1.1-20.el10.x86_64 rpm-libs-4.19.1.1-20.el10.x86_64 rpm-sequoia-1.9.0.3-1.el10_1.x86_64 rust-toolset-srpm-macros-1.88.0-1.el10.noarch sed-4.9-3.el10.x86_64 setup-2.14.5-7.el10.noarch shadow-utils-2:4.15.0-8.el10.x86_64 sqlite-libs-3.46.1-5.el10_1.x86_64 systemd-libs-257-13.el10.x86_64 tar-2:1.35-9.el10_1.x86_64 unzip-6.0-69.el10.x86_64 util-linux-2.40.2-13.el10.x86_64 util-linux-core-2.40.2-13.el10.x86_64 which-2.21-44.el10_0.x86_64 xz-1:5.6.2-4.el10_0.x86_64 xz-libs-1:5.6.2-4.el10_0.x86_64 zip-3.0-45.el10.x86_64 zlib-ng-compat-2.2.3-2.el10.x86_64 zstd-1.5.5-9.el10.x86_64 Complete! Finish: installing minimal buildroot with dnf Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: alternatives-1.30-2.el10.x86_64 ansible-srpm-macros-1-16.1.el10_0.noarch audit-libs-4.0.3-4.el10.x86_64 authselect-1.5.0-8.el10.x86_64 authselect-libs-1.5.0-8.el10.x86_64 basesystem-11-22.el10.noarch bash-5.2.26-6.el10.x86_64 binutils-2.41-58.el10_1.2.x86_64 binutils-gold-2.41-58.el10_1.2.x86_64 bzip2-1.0.8-25.el10.x86_64 bzip2-libs-1.0.8-25.el10.x86_64 ca-certificates-2025.2.80_v9.0.305-102.el10_1.noarch coreutils-9.5-6.el10.x86_64 coreutils-common-9.5-6.el10.x86_64 cpio-2.15-3.el10.x86_64 cracklib-2.9.11-8.el10.x86_64 cracklib-dicts-2.9.11-8.el10.x86_64 crypto-policies-20250905-2.gitc7eb7b2.el10_1.noarch curl-8.12.1-2.el10.x86_64 cyrus-sasl-lib-2.1.28-29.el10.x86_64 debugedit-5.1-8.el10.x86_64 diffutils-3.10-8.el10.x86_64 dwz-0.16-1.el10.x86_64 ed-1.20-5.el10.x86_64 efi-srpm-macros-6-6.el10.noarch elfutils-0.193-1.el10.x86_64 elfutils-debuginfod-client-0.193-1.el10.x86_64 elfutils-default-yama-scope-0.193-1.el10.noarch elfutils-libelf-0.193-1.el10.x86_64 elfutils-libs-0.193-1.el10.x86_64 epel-rpm-macros-10-6.el10_1.noarch file-5.45-8.el10.x86_64 file-libs-5.45-8.el10.x86_64 filesystem-3.18-17.el10.x86_64 findutils-4.10.0-5.el10.x86_64 fonts-srpm-macros-2.0.5-18.el10.noarch forge-srpm-macros-0.4.0-6.el10.noarch fpc-srpm-macros-1.3-7.el10_1.noarch gawk-5.3.0-6.el10.x86_64 gdb-minimal-16.3-2.el10.x86_64 gdbm-1.23-12.el10_0.x86_64 gdbm-libs-1.23-12.el10_0.x86_64 ghc-srpm-macros-1.9.2-1.el10_0.noarch glibc-2.39-58.el10_1.2.x86_64 glibc-common-2.39-58.el10_1.2.x86_64 glibc-gconv-extra-2.39-58.el10_1.2.x86_64 glibc-minimal-langpack-2.39-58.el10_1.2.x86_64 gmp-6.2.1-12.el10.x86_64 go-srpm-macros-3.6.0-4.el10.noarch gpg-pubkey-5a6340b3-6229229e gpg-pubkey-e37ed158-65785fa9 gpg-pubkey-fd431d51-4ae0493b grep-3.11-10.el10.x86_64 gzip-1.13-3.el10.x86_64 info-7.1-6.el10.x86_64 jansson-2.14-3.el10.x86_64 json-c-0.18-3.el10.x86_64 kernel-srpm-macros-1.0-25.el10.noarch keyutils-libs-1.6.3-5.el10.x86_64 krb5-libs-1.21.3-8.el10_0.x86_64 libacl-2.3.2-4.el10.x86_64 libarchive-3.7.7-4.el10_0.x86_64 libattr-2.5.2-5.el10.x86_64 libblkid-2.40.2-13.el10.x86_64 libbrotli-1.1.0-6.el10.x86_64 libcap-2.69-7.el10.x86_64 libcap-ng-0.8.4-6.el10.x86_64 libcom_err-1.47.1-4.el10.x86_64 libcurl-8.12.1-2.el10.x86_64 libeconf-0.6.2-4.el10.x86_64 libevent-2.1.12-16.el10.x86_64 libfdisk-2.40.2-13.el10.x86_64 libffi-3.4.4-10.el10.x86_64 libgcc-14.3.1-2.1.el10.x86_64 libgomp-14.3.1-2.1.el10.x86_64 libidn2-2.3.7-3.el10.x86_64 libmount-2.40.2-13.el10.x86_64 libnghttp2-1.64.0-2.el10.x86_64 libpkgconf-2.1.0-3.el10.x86_64 libpsl-0.21.5-6.el10.x86_64 libpwquality-1.4.5-12.el10.x86_64 libselinux-3.9-1.el10.x86_64 libsemanage-3.9-1.el10.x86_64 libsepol-3.9-1.el10.x86_64 libsmartcols-2.40.2-13.el10.x86_64 libssh-0.11.1-5.el10_1.x86_64 libssh-config-0.11.1-5.el10_1.noarch libstdc++-14.3.1-2.1.el10.x86_64 libtasn1-4.20.0-1.el10.x86_64 libunistring-1.1-10.el10.x86_64 libutempter-1.2.1-15.el10.x86_64 libuuid-2.40.2-13.el10.x86_64 libverto-0.3.2-10.el10.x86_64 libxcrypt-4.4.36-10.el10.x86_64 libxml2-2.12.5-9.el10_0.x86_64 libzstd-1.5.5-9.el10.x86_64 lua-libs-5.4.6-7.el10.x86_64 lua-srpm-macros-1-15.el10.noarch lz4-libs-1.9.4-8.el10.x86_64 mpfr-4.2.1-5.el10.x86_64 ncurses-base-6.4-14.20240127.el10.noarch ncurses-libs-6.4-14.20240127.el10.x86_64 ocaml-srpm-macros-10-4.el10.noarch openblas-srpm-macros-2-19.el10.noarch openldap-2.6.9-1.el10.x86_64 openssl-fips-provider-3.0.7-8.el10.x86_64 openssl-fips-provider-so-3.0.7-8.el10.x86_64 openssl-libs-3.5.1-5.el10_1.x86_64 p11-kit-0.25.5-7.el10.x86_64 p11-kit-trust-0.25.5-7.el10.x86_64 package-notes-srpm-macros-0.5-13.el10.noarch pam-1.6.1-8.el10.x86_64 pam-libs-1.6.1-8.el10.x86_64 patch-2.7.6-26.el10.x86_64 pcre2-10.44-1.el10.3.x86_64 pcre2-syntax-10.44-1.el10.3.noarch perl-srpm-macros-1-57.el10.noarch pkgconf-2.1.0-3.el10.x86_64 pkgconf-m4-2.1.0-3.el10.noarch pkgconf-pkg-config-2.1.0-3.el10.x86_64 popt-1.19-8.el10.x86_64 publicsuffix-list-dafsa-20240107-5.el10.noarch pyproject-srpm-macros-1.16.2-1.el10.noarch python-srpm-macros-3.12-10.el10.noarch qt6-srpm-macros-6.9.1-1.el10.noarch readline-8.2-11.el10.x86_64 redhat-release-10.1-18.el10.x86_64 redhat-rpm-config-293-1.el10.noarch rpm-4.19.1.1-20.el10.x86_64 rpm-build-4.19.1.1-20.el10.x86_64 rpm-build-libs-4.19.1.1-20.el10.x86_64 rpm-libs-4.19.1.1-20.el10.x86_64 rpm-sequoia-1.9.0.3-1.el10_1.x86_64 rust-toolset-srpm-macros-1.88.0-1.el10.noarch sed-4.9-3.el10.x86_64 setup-2.14.5-7.el10.noarch shadow-utils-4.15.0-8.el10.x86_64 sqlite-libs-3.46.1-5.el10_1.x86_64 systemd-libs-257-13.el10.x86_64 tar-1.35-9.el10_1.x86_64 unzip-6.0-69.el10.x86_64 util-linux-2.40.2-13.el10.x86_64 util-linux-core-2.40.2-13.el10.x86_64 which-2.21-44.el10_0.x86_64 xz-5.6.2-4.el10_0.x86_64 xz-libs-5.6.2-4.el10_0.x86_64 zip-3.0-45.el10.x86_64 zlib-ng-compat-2.2.3-2.el10.x86_64 zstd-1.5.5-9.el10.x86_64 Start: buildsrpm Start: rpmbuild -bs Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1768780800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.el10.src.rpm Finish: rpmbuild -bs INFO: chroot_scan: 3 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/rhel+epel-10-x86_64-1768866066.175986/root/var/log/dnf.rpm.log /var/lib/mock/rhel+epel-10-x86_64-1768866066.175986/root/var/log/dnf.librepo.log /var/lib/mock/rhel+epel-10-x86_64-1768866066.175986/root/var/log/dnf.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-8q3yy9c3/llama-cpp/llama-cpp.spec) Config(child) 0 minutes 23 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/llama-cpp-b6153-1.el10.src.rpm) Config(rhel+epel-10-x86_64) Start: chroot init INFO: mounting tmpfs at /var/lib/mock/rhel+epel-10-x86_64-1768866066.175986/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management from host and used with --installroot: rpm-4.20.1-1.fc42.x86_64 rpm-sequoia-1.7.0-5.fc42.x86_64 python3-dnf-4.24.0-1.fc42.noarch python3-dnf-plugins-core-4.10.1-1.fc42.noarch dnf5-5.2.17.0-1.fc42.x86_64 dnf5-plugins-5.2.17.0-1.fc42.x86_64 Finish: chroot init Start: build phase for llama-cpp-b6153-1.el10.src.rpm Start: build setup for llama-cpp-b6153-1.el10.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1768780800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.el10.src.rpm No matches found for the following disable plugin patterns: local, spacewalk, versionlock Updating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Copr repository 84 kB/s | 1.5 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - BaseOS 31 kB/s | 4.1 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - AppStr 13 kB/s | 4.1 kB 00:00 Red Hat CodeReady Linux Builder for RHEL 10 x86 37 kB/s | 4.0 kB 00:00 Extra Packages for Enterprise Linux 10 - x86_64 137 kB/s | 34 kB 00:00 Package curl-8.12.1-2.el10.x86_64 is already installed. Dependencies resolved. ============================================================================================= Package Arch Version Repo Size ============================================================================================= Installing: cmake x86_64 3.30.5-3.el10_0 appstream 12 M gcc-c++ x86_64 14.3.1-2.1.el10 appstream 15 M git x86_64 2.47.3-1.el10_0 appstream 51 k hipblas-devel x86_64 6.4.1-2.el10_1 epel 106 k langpacks-en noarch 4.1-3.el10 appstream 12 k libcurl-devel x86_64 8.12.1-2.el10 appstream 948 k openmpi x86_64 2:5.0.2-5.el10 appstream 2.1 M pthreadpool-devel x86_64 0.0^git20230829.4fe0e1e-7.el10_1 epel 15 k rocblas-devel x86_64 6.4.2-7.el10_1 epel 108 k rocm-comgr-devel x86_64 20-13.rocm7.1.1.el10 copr_base 33 k rocm-hip-devel x86_64 6.4.2-1.el10_1 epel 233 k rocm-rpm-macros noarch 6.4.2-1.el10_1 epel 16 k rocm-runtime-devel x86_64 6.4.2-1.el10_1 epel 93 k wget x86_64 1.24.5-5.el10 appstream 807 k xxd x86_64 2:9.1.083-6.el10_1 appstream 31 k Installing dependencies: annobin-docs noarch 12.99-1.el10 appstream 88 k annobin-plugin-gcc x86_64 12.99-1.el10 appstream 996 k brotli x86_64 1.1.0-6.el10 appstream 22 k brotli-devel x86_64 1.1.0-6.el10 appstream 39 k cmake-data noarch 3.30.5-3.el10_0 appstream 2.5 M cmake-filesystem x86_64 3.30.5-3.el10_0 appstream 24 k cmake-rpm-macros noarch 3.30.5-3.el10_0 appstream 16 k cpp x86_64 14.3.1-2.1.el10 appstream 13 M dbus x86_64 1:1.14.10-5.el10 baseos 8.5 k dbus-broker x86_64 36-4.el10 baseos 174 k dbus-common noarch 1:1.14.10-5.el10 baseos 19 k default-fonts-core-sans noarch 4.1-3.el10 baseos 34 k emacs-filesystem noarch 1:29.4-12.el10 appstream 10 k environment-modules x86_64 5.3.1-8.el10 baseos 711 k expat x86_64 2.7.1-1.el10_1.3 baseos 119 k fonts-filesystem noarch 1:2.0.5-18.el10 baseos 9.9 k gcc x86_64 14.3.1-2.1.el10 appstream 38 M gcc-plugin-annobin x86_64 14.3.1-2.1.el10 appstream 68 k git-core x86_64 2.47.3-1.el10_0 appstream 4.9 M git-core-doc noarch 2.47.3-1.el10_0 appstream 3.1 M glibc-devel x86_64 2.39-58.el10_1.2 appstream 602 k gnutls x86_64 3.8.10-2.el10 baseos 1.5 M google-noto-fonts-common noarch 20240401-5.el10 baseos 19 k google-noto-sans-mono-vf-fonts noarch 20240401-5.el10 baseos 282 k google-noto-sans-vf-fonts noarch 20240401-5.el10 baseos 596 k google-noto-serif-vf-fonts noarch 20240401-5.el10 baseos 648 k groff-base x86_64 1.23.0-10.el10 baseos 1.1 M hipblas x86_64 6.4.1-2.el10_1 epel 163 k hipblas-common-devel noarch 6.4.0-1.el10_1 epel 13 k hipcc x86_64 20-13.rocm7.1.1.el10 copr_base 135 k hwdata noarch 0.379-10.6.el10 baseos 1.7 M hwloc-libs x86_64 2.11.1-3.el10 baseos 2.1 M jsoncpp x86_64 1.9.5-9.el10 appstream 104 k kernel-headers x86_64 6.12.0-124.28.1.el10_1 appstream 3.2 M keyutils-libs-devel x86_64 1.6.3-5.el10 appstream 65 k krb5-devel x86_64 1.21.3-8.el10_0 appstream 145 k langpacks-core-en noarch 4.1-3.el10 appstream 12 k langpacks-fonts-en noarch 4.1-3.el10 appstream 12 k less x86_64 661-3.el10 baseos 195 k libcbor x86_64 0.11.0-3.el10 baseos 36 k libcom_err-devel x86_64 1.47.1-4.el10 appstream 17 k libdrm x86_64 2.4.123-1.el10 appstream 167 k libedit x86_64 3.1-52.20230828cvs.el10 baseos 108 k libfabric x86_64 2.1.0-1.el10 appstream 662 k libfido2 x86_64 1.14.0-7.el10 baseos 101 k libgfortran x86_64 14.3.1-2.1.el10 baseos 828 k libibverbs x86_64 57.0-2.el10 baseos 457 k libidn2-devel x86_64 2.3.7-3.el10 appstream 75 k libkadm5 x86_64 1.21.3-8.el10_0 baseos 78 k libmpc x86_64 1.3.1-7.el10 appstream 74 k libnghttp2-devel x86_64 1.64.0-2.el10 appstream 58 k libnl3 x86_64 3.11.0-1.el10 baseos 365 k libpciaccess x86_64 0.16-16.el10 baseos 30 k libpipeline x86_64 1.5.7-7.el10 baseos 55 k libpsl-devel x86_64 0.21.5-6.el10 appstream 39 k libquadmath x86_64 14.3.1-2.1.el10 baseos 216 k librdmacm x86_64 57.0-2.el10 baseos 72 k libseccomp x86_64 2.5.6-1.el10 baseos 71 k libselinux-devel x86_64 3.9-1.el10 appstream 161 k libsepol-devel x86_64 3.9-1.el10 appstream 48 k libssh-devel x86_64 0.11.1-5.el10_1 appstream 42 k libstdc++-devel x86_64 14.3.1-2.1.el10 appstream 2.8 M libuv x86_64 1:1.51.0-1.el10_0 appstream 262 k libverto-devel x86_64 0.3.2-10.el10 appstream 16 k libxcrypt-devel x86_64 4.4.36-10.el10 appstream 33 k logrotate x86_64 3.22.0-4.el10 baseos 81 k make x86_64 1:4.4.1-9.el10 baseos 591 k man-db x86_64 2.12.0-10.el10 baseos 1.3 M mpdecimal x86_64 2.5.1-12.el10 baseos 92 k munge x86_64 0.5.15-10.el10 appstream 139 k munge-libs x86_64 0.5.15-10.el10 appstream 23 k ncurses x86_64 6.4-14.20240127.el10 baseos 427 k numactl-libs x86_64 2.0.19-2.el10 baseos 31 k ocl-icd x86_64 2.3.2-8.el10 baseos 69 k openssh x86_64 9.9p1-12.el10_1 baseos 351 k openssh-clients x86_64 9.9p1-12.el10_1 baseos 761 k openssl-devel x86_64 1:3.5.1-5.el10_1 appstream 4.2 M pcre2-devel x86_64 10.44-1.el10.3 appstream 536 k pcre2-utf16 x86_64 10.44-1.el10.3 appstream 228 k pcre2-utf32 x86_64 10.44-1.el10.3 appstream 216 k perl-AutoLoader noarch 5.74-512.2.el10_0 appstream 22 k perl-B x86_64 1.89-512.2.el10_0 appstream 178 k perl-Carp noarch 1.54-511.el10 appstream 31 k perl-Class-Struct noarch 0.68-512.2.el10_0 appstream 23 k perl-Data-Dumper x86_64 2.189-512.el10 appstream 60 k perl-Digest noarch 1.20-511.el10 appstream 28 k perl-Digest-MD5 x86_64 2.59-6.el10 appstream 40 k perl-DynaLoader x86_64 1.56-512.2.el10_0 appstream 27 k perl-Encode x86_64 4:3.21-511.el10 appstream 1.1 M perl-Errno x86_64 1.38-512.2.el10_0 appstream 16 k perl-Error noarch 1:0.17029-18.el10 appstream 46 k perl-Exporter noarch 5.78-511.el10 appstream 34 k perl-Fcntl x86_64 1.18-512.2.el10_0 appstream 31 k perl-File-Basename noarch 2.86-512.2.el10_0 appstream 18 k perl-File-Find noarch 1.44-512.2.el10_0 appstream 26 k perl-File-Path noarch 2.18-511.el10 appstream 37 k perl-File-Temp noarch 1:0.231.100-512.el10 appstream 63 k perl-File-stat noarch 1.14-512.2.el10_0 appstream 18 k perl-FileHandle noarch 2.05-512.2.el10_0 appstream 16 k perl-Getopt-Long noarch 1:2.58-3.el10 appstream 68 k perl-Getopt-Std noarch 1.14-512.2.el10_0 appstream 16 k perl-Git noarch 2.47.3-1.el10_0 appstream 38 k perl-HTTP-Tiny noarch 0.088-512.el10 appstream 60 k perl-IO x86_64 1.55-512.2.el10_0 appstream 81 k perl-IO-Socket-IP noarch 0.42-512.el10 appstream 45 k perl-IO-Socket-SSL noarch 2.085-3.el10 appstream 231 k perl-IPC-Open3 noarch 1.22-512.2.el10_0 appstream 23 k perl-MIME-Base64 x86_64 3.16-511.el10 appstream 34 k perl-Mozilla-CA noarch 20231213-5.el10 appstream 16 k perl-Net-SSLeay x86_64 1.94-8.el10 appstream 380 k perl-POSIX x86_64 2.20-512.2.el10_0 appstream 97 k perl-PathTools x86_64 3.91-512.el10 appstream 89 k perl-Pod-Escapes noarch 1:1.07-511.el10 appstream 22 k perl-Pod-Perldoc noarch 3.28.01-512.el10 appstream 88 k perl-Pod-Simple noarch 1:3.45-511.el10 appstream 223 k perl-Pod-Usage noarch 4:2.03-511.el10 appstream 43 k perl-Scalar-List-Utils x86_64 5:1.63-511.el10 appstream 78 k perl-SelectSaver noarch 1.02-512.2.el10_0 appstream 12 k perl-Socket x86_64 4:2.038-511.el10 appstream 59 k perl-Storable x86_64 1:3.32-511.el10 appstream 102 k perl-Symbol noarch 1.09-512.2.el10_0 appstream 15 k perl-Term-ANSIColor noarch 5.01-512.el10 appstream 51 k perl-Term-Cap noarch 1.18-511.el10 appstream 25 k perl-TermReadKey x86_64 2.38-24.el10 appstream 40 k perl-Text-ParseWords noarch 3.31-511.el10 appstream 19 k perl-Text-Tabs+Wrap noarch 2024.001-511.el10 appstream 24 k perl-Time-Local noarch 2:1.350-511.el10 appstream 38 k perl-URI noarch 5.27-3.el10 appstream 138 k perl-base noarch 2.27-512.2.el10_0 appstream 17 k perl-constant noarch 1.33-512.el10 appstream 25 k perl-if noarch 0.61.000-512.2.el10_0 appstream 15 k perl-interpreter x86_64 4:5.40.2-512.2.el10_0 appstream 73 k perl-lib x86_64 0.65-512.2.el10_0 appstream 16 k perl-libnet noarch 3.15-512.el10 appstream 131 k perl-libs x86_64 4:5.40.2-512.2.el10_0 appstream 2.4 M perl-locale noarch 1.12-512.2.el10_0 appstream 14 k perl-mro x86_64 1.29-512.2.el10_0 appstream 31 k perl-overload noarch 1.37-512.2.el10_0 appstream 46 k perl-overloading noarch 0.02-512.2.el10_0 appstream 14 k perl-parent noarch 1:0.241-512.el10 appstream 17 k perl-podlators noarch 1:5.01-511.el10 appstream 128 k perl-vars noarch 1.05-512.2.el10_0 appstream 14 k pmix x86_64 4.2.8-8.el10 appstream 746 k procps-ng x86_64 4.0.4-8.el10 baseos 374 k prrte x86_64 3.0.2-9.el10 appstream 86 k prrte-libs x86_64 3.0.2-9.el10 appstream 546 k pthreadpool x86_64 0.0^git20230829.4fe0e1e-7.el10_1 epel 48 k publicsuffix-list noarch 20240107-5.el10 appstream 90 k python3 x86_64 3.12.12-1.el10_1 baseos 28 k python3-libs x86_64 3.12.12-1.el10_1 baseos 9.4 M python3-pip-wheel noarch 23.3.2-7.el10 baseos 1.5 M redhat-mono-vf-fonts noarch 4.1.0-1.el10 baseos 346 k redhat-text-vf-fonts noarch 4.1.0-1.el10 baseos 357 k rocblas x86_64 6.4.2-7.el10_1 epel 158 M rocm-clang x86_64 20-13.rocm7.1.1.el10 copr_base 16 M rocm-clang-devel x86_64 20-13.rocm7.1.1.el10 copr_base 2.5 M rocm-clang-libs x86_64 20-13.rocm7.1.1.el10 copr_base 23 M rocm-clang-runtime-devel x86_64 20-13.rocm7.1.1.el10 copr_base 530 k rocm-comgr x86_64 20-13.rocm7.1.1.el10 copr_base 33 M rocm-device-libs x86_64 20-13.rocm7.1.1.el10 copr_base 495 k rocm-hip x86_64 6.4.2-1.el10_1 epel 9.4 M rocm-libc++ x86_64 20-13.rocm7.1.1.el10 copr_base 379 k rocm-libc++-devel x86_64 20-13.rocm7.1.1.el10 copr_base 1.2 M rocm-lld x86_64 20-13.rocm7.1.1.el10 copr_base 1.6 M rocm-llvm x86_64 20-13.rocm7.1.1.el10 copr_base 14 M rocm-llvm-devel x86_64 20-13.rocm7.1.1.el10 copr_base 4.0 M rocm-llvm-filesystem x86_64 20-13.rocm7.1.1.el10 copr_base 26 k rocm-llvm-libs x86_64 20-13.rocm7.1.1.el10 copr_base 21 M rocm-llvm-static x86_64 20-13.rocm7.1.1.el10 copr_base 31 M rocm-runtime x86_64 6.4.2-1.el10_1 epel 654 k rocsolver x86_64 6.4.2-2.el10_1 epel 118 M systemd x86_64 257-13.el10 baseos 5.7 M systemd-pam x86_64 257-13.el10 baseos 306 k systemd-rpm-macros noarch 257-13.el10 baseos 27 k tcl x86_64 1:8.6.13-4.el10 baseos 1.1 M torque-libs x86_64 6.1.3-16.el10 appstream 190 k tzdata noarch 2025c-1.el10 baseos 904 k ucx x86_64 1.18.1-1.el10 appstream 864 k vim-filesystem noarch 2:9.1.083-6.el10_1 baseos 16 k zlib-ng-compat-devel x86_64 2.2.3-2.el10 appstream 39 k Transaction Summary ============================================================================================= Install 195 Packages Total download size: 588 M Installed size: 1.6 G Downloading Packages: (1/195): hipcc-20-13.rocm7.1.1.el10.x86_64.rpm 209 kB/s | 135 kB 00:00 (2/195): rocm-clang-devel-20-13.rocm7.1.1.el10. 3.4 MB/s | 2.5 MB 00:00 (3/195): rocm-clang-20-13.rocm7.1.1.el10.x86_64 20 MB/s | 16 MB 00:00 (4/195): rocm-clang-runtime-devel-20-13.rocm7.1 2.7 MB/s | 530 kB 00:00 (5/195): rocm-comgr-devel-20-13.rocm7.1.1.el10. 2.0 MB/s | 33 kB 00:00 (6/195): rocm-device-libs-20-13.rocm7.1.1.el10. 5.0 MB/s | 495 kB 00:00 (7/195): rocm-comgr-20-13.rocm7.1.1.el10.x86_64 110 MB/s | 33 MB 00:00 (8/195): rocm-libc++-20-13.rocm7.1.1.el10.x86_6 3.7 MB/s | 379 kB 00:00 (9/195): rocm-libc++-devel-20-13.rocm7.1.1.el10 2.4 MB/s | 1.2 MB 00:00 (10/195): rocm-clang-libs-20-13.rocm7.1.1.el10. 22 MB/s | 23 MB 00:01 (11/195): rocm-llvm-20-13.rocm7.1.1.el10.x86_64 47 MB/s | 14 MB 00:00 (12/195): rocm-lld-20-13.rocm7.1.1.el10.x86_64. 2.1 MB/s | 1.6 MB 00:00 (13/195): rocm-llvm-devel-20-13.rocm7.1.1.el10. 19 MB/s | 4.0 MB 00:00 (14/195): rocm-llvm-filesystem-20-13.rocm7.1.1. 573 kB/s | 26 kB 00:00 (15/195): dbus-1.14.10-5.el10.x86_64.rpm 76 kB/s | 8.5 kB 00:00 (16/195): dbus-common-1.14.10-5.el10.noarch.rpm 288 kB/s | 19 kB 00:00 (17/195): rocm-llvm-libs-20-13.rocm7.1.1.el10.x 89 MB/s | 21 MB 00:00 (18/195): rocm-llvm-static-20-13.rocm7.1.1.el10 100 MB/s | 31 MB 00:00 (19/195): default-fonts-core-sans-4.1-3.el10.no 205 kB/s | 34 kB 00:00 (20/195): google-noto-fonts-common-20240401-5.e 600 kB/s | 19 kB 00:00 (21/195): environment-modules-5.3.1-8.el10.x86_ 3.4 MB/s | 711 kB 00:00 (22/195): fonts-filesystem-2.0.5-18.el10.noarch 59 kB/s | 9.9 kB 00:00 (23/195): google-noto-sans-mono-vf-fonts-202404 3.4 MB/s | 282 kB 00:00 (24/195): google-noto-sans-vf-fonts-20240401-5. 5.5 MB/s | 596 kB 00:00 (25/195): groff-base-1.23.0-10.el10.x86_64.rpm 6.6 MB/s | 1.1 MB 00:00 (26/195): google-noto-serif-vf-fonts-20240401-5 3.4 MB/s | 648 kB 00:00 (27/195): libcbor-0.11.0-3.el10.x86_64.rpm 903 kB/s | 36 kB 00:00 (28/195): less-661-3.el10.x86_64.rpm 3.3 MB/s | 195 kB 00:00 (29/195): hwloc-libs-2.11.1-3.el10.x86_64.rpm 12 MB/s | 2.1 MB 00:00 (30/195): libfido2-1.14.0-7.el10.x86_64.rpm 1.8 MB/s | 101 kB 00:00 (31/195): libedit-3.1-52.20230828cvs.el10.x86_6 1.3 MB/s | 108 kB 00:00 (32/195): libnl3-3.11.0-1.el10.x86_64.rpm 4.9 MB/s | 365 kB 00:00 (33/195): libpciaccess-0.16-16.el10.x86_64.rpm 910 kB/s | 30 kB 00:00 (34/195): make-4.4.1-9.el10.x86_64.rpm 13 MB/s | 591 kB 00:00 (35/195): mpdecimal-2.5.1-12.el10.x86_64.rpm 1.2 MB/s | 92 kB 00:00 (36/195): ncurses-6.4-14.20240127.el10.x86_64.r 6.3 MB/s | 427 kB 00:00 (37/195): logrotate-3.22.0-4.el10.x86_64.rpm 341 kB/s | 81 kB 00:00 (38/195): ocl-icd-2.3.2-8.el10.x86_64.rpm 694 kB/s | 69 kB 00:00 (39/195): libpipeline-1.5.7-7.el10.x86_64.rpm 148 kB/s | 55 kB 00:00 (40/195): python3-pip-wheel-23.3.2-7.el10.noarc 9.8 MB/s | 1.5 MB 00:00 (41/195): libkadm5-1.21.3-8.el10_0.x86_64.rpm 1.2 MB/s | 78 kB 00:00 (42/195): dbus-broker-36-4.el10.x86_64.rpm 3.5 MB/s | 174 kB 00:00 (43/195): tcl-8.6.13-4.el10.x86_64.rpm 7.4 MB/s | 1.1 MB 00:00 (44/195): hwdata-0.379-10.6.el10.noarch.rpm 36 MB/s | 1.7 MB 00:00 (45/195): libgfortran-14.3.1-2.1.el10.x86_64.rp 12 MB/s | 828 kB 00:00 (46/195): gnutls-3.8.10-2.el10.x86_64.rpm 18 MB/s | 1.5 MB 00:00 (47/195): libquadmath-14.3.1-2.1.el10.x86_64.rp 4.7 MB/s | 216 kB 00:00 (48/195): libibverbs-57.0-2.el10.x86_64.rpm 3.8 MB/s | 457 kB 00:00 (49/195): librdmacm-57.0-2.el10.x86_64.rpm 782 kB/s | 72 kB 00:00 (50/195): libseccomp-2.5.6-1.el10.x86_64.rpm 1.2 MB/s | 71 kB 00:00 (51/195): man-db-2.12.0-10.el10.x86_64.rpm 31 MB/s | 1.3 MB 00:00 (52/195): procps-ng-4.0.4-8.el10.x86_64.rpm 13 MB/s | 374 kB 00:00 (53/195): numactl-libs-2.0.19-2.el10.x86_64.rpm 650 kB/s | 31 kB 00:00 (54/195): redhat-mono-vf-fonts-4.1.0-1.el10.noa 5.6 MB/s | 346 kB 00:00 (55/195): redhat-text-vf-fonts-4.1.0-1.el10.noa 5.5 MB/s | 357 kB 00:00 (56/195): systemd-pam-257-13.el10.x86_64.rpm 6.2 MB/s | 306 kB 00:00 (57/195): systemd-257-13.el10.x86_64.rpm 50 MB/s | 5.7 MB 00:00 (58/195): vim-filesystem-9.1.083-6.el10_1.noarc 504 kB/s | 16 kB 00:00 (59/195): systemd-rpm-macros-257-13.el10.noarch 257 kB/s | 27 kB 00:00 (60/195): openssh-9.9p1-12.el10_1.x86_64.rpm 11 MB/s | 351 kB 00:00 (61/195): expat-2.7.1-1.el10_1.3.x86_64.rpm 2.1 MB/s | 119 kB 00:00 (62/195): tzdata-2025c-1.el10.noarch.rpm 19 MB/s | 904 kB 00:00 (63/195): python3-3.12.12-1.el10_1.x86_64.rpm 820 kB/s | 28 kB 00:00 (64/195): openssh-clients-9.9p1-12.el10_1.x86_6 8.3 MB/s | 761 kB 00:00 (65/195): brotli-1.1.0-6.el10.x86_64.rpm 243 kB/s | 22 kB 00:00 (66/195): python3-libs-3.12.12-1.el10_1.x86_64. 53 MB/s | 9.4 MB 00:00 (67/195): pcre2-utf32-10.44-1.el10.3.x86_64.rpm 2.1 MB/s | 216 kB 00:00 (68/195): perl-Data-Dumper-2.189-512.el10.x86_6 719 kB/s | 60 kB 00:00 (69/195): perl-Exporter-5.78-511.el10.noarch.rp 266 kB/s | 34 kB 00:00 (70/195): perl-Error-0.17029-18.el10.noarch.rpm 238 kB/s | 46 kB 00:00 (71/195): libverto-devel-0.3.2-10.el10.x86_64.r 42 kB/s | 16 kB 00:00 (72/195): perl-Mozilla-CA-20231213-5.el10.noarc 340 kB/s | 16 kB 00:00 (73/195): perl-HTTP-Tiny-0.088-512.el10.noarch. 832 kB/s | 60 kB 00:00 (74/195): perl-Pod-Simple-3.45-511.el10.noarch. 2.8 MB/s | 223 kB 00:00 (75/195): perl-Term-ANSIColor-5.01-512.el10.noa 784 kB/s | 51 kB 00:00 (76/195): perl-Term-Cap-1.18-511.el10.noarch.rp 180 kB/s | 25 kB 00:00 (77/195): perl-constant-1.33-512.el10.noarch.rp 175 kB/s | 25 kB 00:00 (78/195): perl-Scalar-List-Utils-1.63-511.el10. 333 kB/s | 78 kB 00:00 (79/195): wget-1.24.5-5.el10.x86_64.rpm 20 MB/s | 807 kB 00:00 (80/195): brotli-devel-1.1.0-6.el10.x86_64.rpm 585 kB/s | 39 kB 00:00 (81/195): libnghttp2-devel-1.64.0-2.el10.x86_64 632 kB/s | 58 kB 00:00 (82/195): libpsl-devel-0.21.5-6.el10.x86_64.rpm 711 kB/s | 39 kB 00:00 (83/195): libidn2-devel-2.3.7-3.el10.x86_64.rpm 373 kB/s | 75 kB 00:00 (84/195): perl-Carp-1.54-511.el10.noarch.rpm 1.0 MB/s | 31 kB 00:00 (85/195): pcre2-utf16-10.44-1.el10.3.x86_64.rpm 1.7 MB/s | 228 kB 00:00 (86/195): openmpi-5.0.2-5.el10.x86_64.rpm 12 MB/s | 2.1 MB 00:00 (87/195): perl-Digest-1.20-511.el10.noarch.rpm 506 kB/s | 28 kB 00:00 (88/195): perl-File-Temp-0.231.100-512.el10.noa 1.0 MB/s | 63 kB 00:00 (89/195): perl-IO-Socket-IP-0.42-512.el10.noarc 728 kB/s | 45 kB 00:00 (90/195): perl-MIME-Base64-3.16-511.el10.x86_64 823 kB/s | 34 kB 00:00 (91/195): perl-Pod-Usage-2.03-511.el10.noarch.r 849 kB/s | 43 kB 00:00 (92/195): perl-Getopt-Long-2.58-3.el10.noarch.r 280 kB/s | 68 kB 00:00 (93/195): perl-Socket-2.038-511.el10.x86_64.rpm 422 kB/s | 59 kB 00:00 (94/195): perl-TermReadKey-2.38-24.el10.x86_64. 627 kB/s | 40 kB 00:00 (95/195): perl-Pod-Escapes-1.07-511.el10.noarch 91 kB/s | 22 kB 00:00 (96/195): keyutils-libs-devel-1.6.3-5.el10.x86_ 984 kB/s | 65 kB 00:00 (97/195): perl-libnet-3.15-512.el10.noarch.rpm 1.4 MB/s | 131 kB 00:00 (98/195): perl-Time-Local-1.350-511.el10.noarch 172 kB/s | 38 kB 00:00 (99/195): perl-Digest-MD5-2.59-6.el10.x86_64.rp 352 kB/s | 40 kB 00:00 (100/195): langpacks-en-4.1-3.el10.noarch.rpm 79 kB/s | 12 kB 00:00 (101/195): perl-PathTools-3.91-512.el10.x86_64. 1.3 MB/s | 89 kB 00:00 (102/195): perl-Encode-3.21-511.el10.x86_64.rpm 10 MB/s | 1.1 MB 00:00 (103/195): perl-Storable-3.32-511.el10.x86_64.r 1.3 MB/s | 102 kB 00:00 (104/195): perl-URI-5.27-3.el10.noarch.rpm 2.1 MB/s | 138 kB 00:00 (105/195): perl-Text-Tabs+Wrap-2024.001-511.el1 143 kB/s | 24 kB 00:00 (106/195): perl-parent-0.241-512.el10.noarch.rp 95 kB/s | 17 kB 00:00 (107/195): pmix-4.2.8-8.el10.x86_64.rpm 9.2 MB/s | 746 kB 00:00 (108/195): perl-podlators-5.01-511.el10.noarch. 559 kB/s | 128 kB 00:00 (109/195): jsoncpp-1.9.5-9.el10.x86_64.rpm 2.1 MB/s | 104 kB 00:00 (110/195): torque-libs-6.1.3-16.el10.x86_64.rpm 1.8 MB/s | 190 kB 00:00 (111/195): langpacks-fonts-en-4.1-3.el10.noarch 175 kB/s | 12 kB 00:00 (112/195): libdrm-2.4.123-1.el10.x86_64.rpm 1.6 MB/s | 167 kB 00:00 (113/195): munge-libs-0.5.15-10.el10.x86_64.rpm 229 kB/s | 23 kB 00:00 (114/195): pcre2-devel-10.44-1.el10.3.x86_64.rp 8.4 MB/s | 536 kB 00:00 (115/195): perl-IO-Socket-SSL-2.085-3.el10.noar 7.5 MB/s | 231 kB 00:00 (116/195): perl-File-Path-2.18-511.el10.noarch. 769 kB/s | 37 kB 00:00 (117/195): perl-Text-ParseWords-3.31-511.el10.n 314 kB/s | 19 kB 00:00 (118/195): langpacks-core-en-4.1-3.el10.noarch. 76 kB/s | 12 kB 00:00 (119/195): prrte-libs-3.0.2-9.el10.x86_64.rpm 935 kB/s | 546 kB 00:00 (120/195): libmpc-1.3.1-7.el10.x86_64.rpm 1.4 MB/s | 74 kB 00:00 (121/195): libxcrypt-devel-4.4.36-10.el10.x86_6 877 kB/s | 33 kB 00:00 (122/195): perl-Pod-Perldoc-3.28.01-512.el10.no 1.7 MB/s | 88 kB 00:00 (123/195): munge-0.5.15-10.el10.x86_64.rpm 1.9 MB/s | 139 kB 00:00 (124/195): publicsuffix-list-20240107-5.el10.no 2.1 MB/s | 90 kB 00:00 (125/195): cmake-data-3.30.5-3.el10_0.noarch.rp 23 MB/s | 2.5 MB 00:00 (126/195): prrte-3.0.2-9.el10.x86_64.rpm 176 kB/s | 86 kB 00:00 (127/195): cmake-filesystem-3.30.5-3.el10_0.x86 693 kB/s | 24 kB 00:00 (128/195): cmake-rpm-macros-3.30.5-3.el10_0.noa 365 kB/s | 16 kB 00:00 (129/195): cmake-3.30.5-3.el10_0.x86_64.rpm 57 MB/s | 12 MB 00:00 (130/195): git-core-doc-2.47.3-1.el10_0.noarch. 75 MB/s | 3.1 MB 00:00 (131/195): git-core-2.47.3-1.el10_0.x86_64.rpm 61 MB/s | 4.9 MB 00:00 (132/195): git-2.47.3-1.el10_0.x86_64.rpm 460 kB/s | 51 kB 00:00 (133/195): perl-B-1.89-512.2.el10_0.x86_64.rpm 4.0 MB/s | 178 kB 00:00 (134/195): perl-AutoLoader-5.74-512.2.el10_0.no 473 kB/s | 22 kB 00:00 (135/195): krb5-devel-1.21.3-8.el10_0.x86_64.rp 1.6 MB/s | 145 kB 00:00 (136/195): perl-Class-Struct-0.68-512.2.el10_0. 728 kB/s | 23 kB 00:00 (137/195): perl-DynaLoader-1.56-512.2.el10_0.x8 657 kB/s | 27 kB 00:00 (138/195): perl-Fcntl-1.18-512.2.el10_0.x86_64. 1.0 MB/s | 31 kB 00:00 (139/195): perl-File-Basename-2.86-512.2.el10_0 520 kB/s | 18 kB 00:00 (140/195): perl-Errno-1.38-512.2.el10_0.x86_64. 241 kB/s | 16 kB 00:00 (141/195): perl-File-Find-1.44-512.2.el10_0.noa 848 kB/s | 26 kB 00:00 (142/195): perl-File-stat-1.14-512.2.el10_0.noa 620 kB/s | 18 kB 00:00 (143/195): perl-FileHandle-2.05-512.2.el10_0.no 405 kB/s | 16 kB 00:00 (144/195): perl-Getopt-Std-1.14-512.2.el10_0.no 461 kB/s | 16 kB 00:00 (145/195): perl-IO-1.55-512.2.el10_0.x86_64.rpm 1.4 MB/s | 81 kB 00:00 (146/195): perl-Git-2.47.3-1.el10_0.noarch.rpm 495 kB/s | 38 kB 00:00 (147/195): perl-IPC-Open3-1.22-512.2.el10_0.noa 247 kB/s | 23 kB 00:00 (148/195): perl-POSIX-2.20-512.2.el10_0.x86_64. 1.1 MB/s | 97 kB 00:00 (149/195): perl-Symbol-1.09-512.2.el10_0.noarch 241 kB/s | 15 kB 00:00 (150/195): perl-base-2.27-512.2.el10_0.noarch.r 541 kB/s | 17 kB 00:00 (151/195): perl-interpreter-5.40.2-512.2.el10_0 833 kB/s | 73 kB 00:00 (152/195): perl-SelectSaver-1.02-512.2.el10_0.n 57 kB/s | 12 kB 00:00 (153/195): perl-if-0.61.000-512.2.el10_0.noarch 121 kB/s | 15 kB 00:00 (154/195): perl-libs-5.40.2-512.2.el10_0.x86_64 59 MB/s | 2.4 MB 00:00 (155/195): perl-lib-0.65-512.2.el10_0.x86_64.rp 223 kB/s | 16 kB 00:00 (156/195): perl-locale-1.12-512.2.el10_0.noarch 174 kB/s | 14 kB 00:00 (157/195): perl-mro-1.29-512.2.el10_0.x86_64.rp 420 kB/s | 31 kB 00:00 (158/195): perl-overload-1.37-512.2.el10_0.noar 618 kB/s | 46 kB 00:00 (159/195): perl-overloading-0.02-512.2.el10_0.n 225 kB/s | 14 kB 00:00 (160/195): perl-vars-1.05-512.2.el10_0.noarch.r 155 kB/s | 14 kB 00:00 (161/195): annobin-plugin-gcc-12.99-1.el10.x86_ 5.3 MB/s | 996 kB 00:00 (162/195): cpp-14.3.1-2.1.el10.x86_64.rpm 70 MB/s | 13 MB 00:00 (163/195): emacs-filesystem-29.4-12.el10.noarch 80 kB/s | 10 kB 00:00 (164/195): libcom_err-devel-1.47.1-4.el10.x86_6 101 kB/s | 17 kB 00:00 (165/195): libstdc++-devel-14.3.1-2.1.el10.x86_ 37 MB/s | 2.8 MB 00:00 (166/195): libuv-1.51.0-1.el10_0.x86_64.rpm 6.7 MB/s | 262 kB 00:00 (167/195): perl-Net-SSLeay-1.94-8.el10.x86_64.r 3.7 MB/s | 380 kB 00:00 (168/195): gcc-14.3.1-2.1.el10.x86_64.rpm 81 MB/s | 38 MB 00:00 (169/195): libfabric-2.1.0-1.el10.x86_64.rpm 1.4 MB/s | 662 kB 00:00 (170/195): zlib-ng-compat-devel-2.2.3-2.el10.x8 280 kB/s | 39 kB 00:00 (171/195): annobin-docs-12.99-1.el10.noarch.rpm 1.1 MB/s | 88 kB 00:00 (172/195): gcc-plugin-annobin-14.3.1-2.1.el10.x 920 kB/s | 68 kB 00:00 (173/195): libcurl-devel-8.12.1-2.el10.x86_64.r 12 MB/s | 948 kB 00:00 (174/195): libsepol-devel-3.9-1.el10.x86_64.rpm 1.1 MB/s | 48 kB 00:00 (175/195): libselinux-devel-3.9-1.el10.x86_64.r 2.2 MB/s | 161 kB 00:00 (176/195): glibc-devel-2.39-58.el10_1.2.x86_64. 9.9 MB/s | 602 kB 00:00 (177/195): xxd-9.1.083-6.el10_1.x86_64.rpm 566 kB/s | 31 kB 00:00 (178/195): gcc-c++-14.3.1-2.1.el10.x86_64.rpm 38 MB/s | 15 MB 00:00 (179/195): ucx-1.18.1-1.el10.x86_64.rpm 3.9 MB/s | 864 kB 00:00 (180/195): kernel-headers-6.12.0-124.28.1.el10_ 39 MB/s | 3.2 MB 00:00 (181/195): openssl-devel-3.5.1-5.el10_1.x86_64. 27 MB/s | 4.2 MB 00:00 (182/195): libssh-devel-0.11.1-5.el10_1.x86_64. 171 kB/s | 42 kB 00:00 (183/195): hipblas-common-devel-6.4.0-1.el10_1. 473 kB/s | 13 kB 00:00 (184/195): hipblas-6.4.1-2.el10_1.x86_64.rpm 1.6 MB/s | 163 kB 00:00 (185/195): pthreadpool-0.0^git20230829.4fe0e1e- 1.2 MB/s | 48 kB 00:00 (186/195): hipblas-devel-6.4.1-2.el10_1.x86_64. 1.9 MB/s | 106 kB 00:00 (187/195): rocblas-devel-6.4.2-7.el10_1.x86_64. 3.6 MB/s | 108 kB 00:00 (188/195): pthreadpool-devel-0.0^git20230829.4f 240 kB/s | 15 kB 00:00 (189/195): rocm-hip-devel-6.4.2-1.el10_1.x86_64 4.0 MB/s | 233 kB 00:00 (190/195): rocm-rpm-macros-6.4.2-1.el10_1.noarc 528 kB/s | 16 kB 00:00 (191/195): rocm-runtime-6.4.2-1.el10_1.x86_64.r 13 MB/s | 654 kB 00:00 (192/195): rocm-hip-6.4.2-1.el10_1.x86_64.rpm 49 MB/s | 9.4 MB 00:00 (193/195): rocm-runtime-devel-6.4.2-1.el10_1.x8 1.1 MB/s | 93 kB 00:00 (194/195): rocblas-6.4.2-7.el10_1.x86_64.rpm 121 MB/s | 158 MB 00:01 (195/195): rocsolver-6.4.2-2.el10_1.x86_64.rpm 97 MB/s | 118 MB 00:01 -------------------------------------------------------------------------------- Total 61 MB/s | 588 MB 00:09 Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Preparing : 1/1 Installing : cmake-filesystem-3.30.5-3.el10_0.x86_64 1/195 Installing : fonts-filesystem-1:2.0.5-18.el10.noarch 2/195 Installing : expat-2.7.1-1.el10_1.3.x86_64 3/195 Installing : libmpc-1.3.1-7.el10.x86_64 4/195 Installing : munge-libs-0.5.15-10.el10.x86_64 5/195 Installing : libnl3-3.11.0-1.el10.x86_64 6/195 Installing : less-661-3.el10.x86_64 7/195 Installing : google-noto-fonts-common-20240401-5.el10.noarch 8/195 Installing : libibverbs-57.0-2.el10.x86_64 9/195 Installing : zlib-ng-compat-devel-2.2.3-2.el10.x86_64 10/195 Installing : vim-filesystem-2:9.1.083-6.el10_1.noarch 11/195 Installing : numactl-libs-2.0.19-2.el10.x86_64 12/195 Installing : make-1:4.4.1-9.el10.x86_64 13/195 Running scriptlet: groff-base-1.23.0-10.el10.x86_64 14/195 Installing : groff-base-1.23.0-10.el10.x86_64 14/195 Running scriptlet: groff-base-1.23.0-10.el10.x86_64 14/195 Installing : rocm-llvm-filesystem-20-13.rocm7.1.1.el10.x86_64 15/195 Installing : rocm-libc++-20-13.rocm7.1.1.el10.x86_64 16/195 Running scriptlet: rocm-libc++-20-13.rocm7.1.1.el10.x86_64 16/195 Installing : rocm-llvm-libs-20-13.rocm7.1.1.el10.x86_64 17/195 Running scriptlet: rocm-llvm-libs-20-13.rocm7.1.1.el10.x86_64 17/195 Installing : rocm-clang-libs-20-13.rocm7.1.1.el10.x86_64 18/195 Running scriptlet: rocm-clang-libs-20-13.rocm7.1.1.el10.x86_64 18/195 Installing : rocm-comgr-20-13.rocm7.1.1.el10.x86_64 19/195 Running scriptlet: rocm-comgr-20-13.rocm7.1.1.el10.x86_64 19/195 Installing : rocm-lld-20-13.rocm7.1.1.el10.x86_64 20/195 Installing : rocm-libc++-devel-20-13.rocm7.1.1.el10.x86_64 21/195 Installing : librdmacm-57.0-2.el10.x86_64 22/195 Installing : libfabric-2.1.0-1.el10.x86_64 23/195 Installing : google-noto-sans-mono-vf-fonts-20240401-5.el10.n 24/195 Installing : google-noto-sans-vf-fonts-20240401-5.el10.noarch 25/195 Installing : google-noto-serif-vf-fonts-20240401-5.el10.noarc 26/195 Installing : cpp-14.3.1-2.1.el10.x86_64 27/195 Installing : redhat-mono-vf-fonts-4.1.0-1.el10.noarch 28/195 Installing : redhat-text-vf-fonts-4.1.0-1.el10.noarch 29/195 Installing : default-fonts-core-sans-4.1-3.el10.noarch 30/195 Installing : langpacks-fonts-en-4.1-3.el10.noarch 31/195 Installing : langpacks-core-en-4.1-3.el10.noarch 32/195 Installing : libssh-devel-0.11.1-5.el10_1.x86_64 33/195 Installing : hipblas-common-devel-6.4.0-1.el10_1.noarch 34/195 Installing : pthreadpool-0.0^git20230829.4fe0e1e-7.el10_1.x86 35/195 Installing : kernel-headers-6.12.0-124.28.1.el10_1.x86_64 36/195 Installing : glibc-devel-2.39-58.el10_1.2.x86_64 37/195 Installing : libxcrypt-devel-4.4.36-10.el10.x86_64 38/195 Installing : gcc-14.3.1-2.1.el10.x86_64 39/195 Running scriptlet: gcc-14.3.1-2.1.el10.x86_64 39/195 Installing : openssl-devel-1:3.5.1-5.el10_1.x86_64 40/195 Installing : ucx-1.18.1-1.el10.x86_64 41/195 Installing : libsepol-devel-3.9-1.el10.x86_64 42/195 Installing : annobin-docs-12.99-1.el10.noarch 43/195 Installing : libuv-1:1.51.0-1.el10_0.x86_64 44/195 Installing : libstdc++-devel-14.3.1-2.1.el10.x86_64 45/195 Installing : libcom_err-devel-1.47.1-4.el10.x86_64 46/195 Installing : emacs-filesystem-1:29.4-12.el10.noarch 47/195 Installing : publicsuffix-list-20240107-5.el10.noarch 48/195 Installing : libpsl-devel-0.21.5-6.el10.x86_64 49/195 Installing : jsoncpp-1.9.5-9.el10.x86_64 50/195 Installing : keyutils-libs-devel-1.6.3-5.el10.x86_64 51/195 Installing : pcre2-utf16-10.44-1.el10.3.x86_64 52/195 Installing : libnghttp2-devel-1.64.0-2.el10.x86_64 53/195 Installing : libidn2-devel-2.3.7-3.el10.x86_64 54/195 Installing : pcre2-utf32-10.44-1.el10.3.x86_64 55/195 Installing : pcre2-devel-10.44-1.el10.3.x86_64 56/195 Installing : libselinux-devel-3.9-1.el10.x86_64 57/195 Installing : libverto-devel-0.3.2-10.el10.x86_64 58/195 Installing : brotli-1.1.0-6.el10.x86_64 59/195 Installing : brotli-devel-1.1.0-6.el10.x86_64 60/195 Installing : tzdata-2025c-1.el10.noarch 61/195 Installing : openssh-9.9p1-12.el10_1.x86_64 62/195 Installing : procps-ng-4.0.4-8.el10.x86_64 63/195 Installing : libseccomp-2.5.6-1.el10.x86_64 64/195 Installing : libquadmath-14.3.1-2.1.el10.x86_64 65/195 Installing : libgfortran-14.3.1-2.1.el10.x86_64 66/195 Installing : hwdata-0.379-10.6.el10.noarch 67/195 Installing : libpciaccess-0.16-16.el10.x86_64 68/195 Installing : libdrm-2.4.123-1.el10.x86_64 69/195 Installing : rocm-runtime-6.4.2-1.el10_1.x86_64 70/195 Installing : rocm-runtime-devel-6.4.2-1.el10_1.x86_64 71/195 Installing : gnutls-3.8.10-2.el10.x86_64 72/195 Installing : libkadm5-1.21.3-8.el10_0.x86_64 73/195 Installing : krb5-devel-1.21.3-8.el10_0.x86_64 74/195 Installing : tcl-1:8.6.13-4.el10.x86_64 75/195 Installing : python3-pip-wheel-23.3.2-7.el10.noarch 76/195 Installing : ocl-icd-2.3.2-8.el10.x86_64 77/195 Installing : hwloc-libs-2.11.1-3.el10.x86_64 78/195 Installing : pmix-4.2.8-8.el10.x86_64 79/195 Installing : ncurses-6.4-14.20240127.el10.x86_64 80/195 Installing : perl-Digest-1.20-511.el10.noarch 81/195 Installing : perl-Digest-MD5-2.59-6.el10.x86_64 82/195 Installing : perl-B-1.89-512.2.el10_0.x86_64 83/195 Installing : perl-FileHandle-2.05-512.2.el10_0.noarch 84/195 Installing : perl-Data-Dumper-2.189-512.el10.x86_64 85/195 Installing : perl-libnet-3.15-512.el10.noarch 86/195 Installing : perl-AutoLoader-5.74-512.2.el10_0.noarch 87/195 Installing : perl-IO-Socket-IP-0.42-512.el10.noarch 88/195 Installing : perl-URI-5.27-3.el10.noarch 89/195 Installing : perl-Text-Tabs+Wrap-2024.001-511.el10.noarch 90/195 Installing : perl-Time-Local-2:1.350-511.el10.noarch 91/195 Installing : perl-Mozilla-CA-20231213-5.el10.noarch 92/195 Installing : perl-if-0.61.000-512.2.el10_0.noarch 93/195 Installing : perl-locale-1.12-512.2.el10_0.noarch 94/195 Installing : perl-Pod-Escapes-1:1.07-511.el10.noarch 95/195 Installing : perl-File-Path-2.18-511.el10.noarch 96/195 Installing : perl-IO-Socket-SSL-2.085-3.el10.noarch 97/195 Installing : perl-Net-SSLeay-1.94-8.el10.x86_64 98/195 Installing : perl-Term-ANSIColor-5.01-512.el10.noarch 99/195 Installing : perl-Class-Struct-0.68-512.2.el10_0.noarch 100/195 Installing : perl-POSIX-2.20-512.2.el10_0.x86_64 101/195 Installing : perl-IPC-Open3-1.22-512.2.el10_0.noarch 102/195 Installing : perl-Term-Cap-1.18-511.el10.noarch 103/195 Installing : perl-Pod-Simple-1:3.45-511.el10.noarch 104/195 Installing : perl-File-Temp-1:0.231.100-512.el10.noarch 105/195 Installing : perl-HTTP-Tiny-0.088-512.el10.noarch 106/195 Installing : perl-Socket-4:2.038-511.el10.x86_64 107/195 Installing : perl-SelectSaver-1.02-512.2.el10_0.noarch 108/195 Installing : perl-Symbol-1.09-512.2.el10_0.noarch 109/195 Installing : perl-File-stat-1.14-512.2.el10_0.noarch 110/195 Installing : perl-podlators-1:5.01-511.el10.noarch 111/195 Installing : perl-Pod-Perldoc-3.28.01-512.el10.noarch 112/195 Installing : perl-Text-ParseWords-3.31-511.el10.noarch 113/195 Installing : perl-Fcntl-1.18-512.2.el10_0.x86_64 114/195 Installing : perl-base-2.27-512.2.el10_0.noarch 115/195 Installing : perl-mro-1.29-512.2.el10_0.x86_64 116/195 Installing : perl-IO-1.55-512.2.el10_0.x86_64 117/195 Installing : perl-overloading-0.02-512.2.el10_0.noarch 118/195 Installing : perl-Pod-Usage-4:2.03-511.el10.noarch 119/195 Installing : perl-Scalar-List-Utils-5:1.63-511.el10.x86_64 120/195 Installing : perl-constant-1.33-512.el10.noarch 121/195 Installing : perl-MIME-Base64-3.16-511.el10.x86_64 122/195 Installing : perl-parent-1:0.241-512.el10.noarch 123/195 Installing : perl-Errno-1.38-512.2.el10_0.x86_64 124/195 Installing : perl-File-Basename-2.86-512.2.el10_0.noarch 125/195 Installing : perl-Getopt-Std-1.14-512.2.el10_0.noarch 126/195 Installing : perl-Storable-1:3.32-511.el10.x86_64 127/195 Installing : perl-overload-1.37-512.2.el10_0.noarch 128/195 Installing : perl-vars-1.05-512.2.el10_0.noarch 129/195 Installing : perl-Getopt-Long-1:2.58-3.el10.noarch 130/195 Installing : perl-Exporter-5.78-511.el10.noarch 131/195 Installing : perl-Carp-1.54-511.el10.noarch 132/195 Installing : perl-PathTools-3.91-512.el10.x86_64 133/195 Installing : perl-DynaLoader-1.56-512.2.el10_0.x86_64 134/195 Installing : perl-Encode-4:3.21-511.el10.x86_64 135/195 Installing : perl-libs-4:5.40.2-512.2.el10_0.x86_64 136/195 Installing : perl-interpreter-4:5.40.2-512.2.el10_0.x86_64 137/195 Installing : perl-Error-1:0.17029-18.el10.noarch 138/195 Installing : perl-TermReadKey-2.38-24.el10.x86_64 139/195 Installing : perl-File-Find-1.44-512.2.el10_0.noarch 140/195 Installing : perl-lib-0.65-512.2.el10_0.x86_64 141/195 Installing : mpdecimal-2.5.1-12.el10.x86_64 142/195 Installing : python3-libs-3.12.12-1.el10_1.x86_64 143/195 Installing : python3-3.12.12-1.el10_1.x86_64 144/195 Installing : cmake-rpm-macros-3.30.5-3.el10_0.noarch 145/195 Installing : cmake-data-3.30.5-3.el10_0.noarch 146/195 Installing : cmake-3.30.5-3.el10_0.x86_64 147/195 Installing : rocm-llvm-20-13.rocm7.1.1.el10.x86_64 148/195 Installing : rocm-llvm-devel-20-13.rocm7.1.1.el10.x86_64 149/195 Running scriptlet: rocm-llvm-devel-20-13.rocm7.1.1.el10.x86_64 149/195 Installing : rocm-llvm-static-20-13.rocm7.1.1.el10.x86_64 150/195 Installing : libpipeline-1.5.7-7.el10.x86_64 151/195 Running scriptlet: man-db-2.12.0-10.el10.x86_64 152/195 Installing : man-db-2.12.0-10.el10.x86_64 152/195 Running scriptlet: man-db-2.12.0-10.el10.x86_64 152/195 Installing : environment-modules-5.3.1-8.el10.x86_64 153/195 Running scriptlet: environment-modules-5.3.1-8.el10.x86_64 153/195 Installing : libedit-3.1-52.20230828cvs.el10.x86_64 154/195 Installing : libcbor-0.11.0-3.el10.x86_64 155/195 Installing : libfido2-1.14.0-7.el10.x86_64 156/195 Installing : openssh-clients-9.9p1-12.el10_1.x86_64 157/195 Running scriptlet: openssh-clients-9.9p1-12.el10_1.x86_64 157/195 Installing : git-core-2.47.3-1.el10_0.x86_64 158/195 Installing : git-core-doc-2.47.3-1.el10_0.noarch 159/195 Installing : perl-Git-2.47.3-1.el10_0.noarch 160/195 Installing : git-2.47.3-1.el10_0.x86_64 161/195 Running scriptlet: dbus-common-1:1.14.10-5.el10.noarch 162/195 Creating group 'dbus' with GID 81. Creating user 'dbus' (System Message Bus) with UID 81 and GID 81. Installing : dbus-common-1:1.14.10-5.el10.noarch 162/195 Running scriptlet: dbus-common-1:1.14.10-5.el10.noarch 162/195 Running scriptlet: dbus-broker-36-4.el10.x86_64 163/195 Installing : dbus-broker-36-4.el10.x86_64 163/195 Running scriptlet: dbus-broker-36-4.el10.x86_64 163/195 Installing : dbus-1:1.14.10-5.el10.x86_64 164/195 Installing : systemd-pam-257-13.el10.x86_64 165/195 Running scriptlet: systemd-257-13.el10.x86_64 166/195 Creating group 'systemd-journal' with GID 190. Installing : systemd-257-13.el10.x86_64 166/195 Running scriptlet: systemd-257-13.el10.x86_64 166/195 Creating group 'input' with GID 104. Creating group 'kvm' with GID 36. Creating group 'render' with GID 105. Creating group 'sgx' with GID 106. Running scriptlet: logrotate-3.22.0-4.el10.x86_64 167/195 Installing : logrotate-3.22.0-4.el10.x86_64 167/195 Running scriptlet: logrotate-3.22.0-4.el10.x86_64 167/195 Created symlink '/etc/systemd/system/timers.target.wants/logrotate.timer' → '/usr/lib/systemd/system/logrotate.timer'. Running scriptlet: munge-0.5.15-10.el10.x86_64 168/195 Creating group 'munge' with GID 998. Creating user 'munge' (Runs Uid 'N' Gid Emporium) with UID 998 and GID 998. Installing : munge-0.5.15-10.el10.x86_64 168/195 Running scriptlet: munge-0.5.15-10.el10.x86_64 168/195 Installing : torque-libs-6.1.3-16.el10.x86_64 169/195 Installing : prrte-libs-3.0.2-9.el10.x86_64 170/195 Installing : prrte-3.0.2-9.el10.x86_64 171/195 Installing : rocm-clang-runtime-devel-20-13.rocm7.1.1.el10.x8 172/195 Installing : rocm-clang-20-13.rocm7.1.1.el10.x86_64 173/195 Installing : rocm-clang-devel-20-13.rocm7.1.1.el10.x86_64 174/195 Installing : rocm-device-libs-20-13.rocm7.1.1.el10.x86_64 175/195 Installing : hipcc-20-13.rocm7.1.1.el10.x86_64 176/195 Installing : rocm-hip-6.4.2-1.el10_1.x86_64 177/195 Running scriptlet: rocm-hip-6.4.2-1.el10_1.x86_64 177/195 Installing : rocblas-6.4.2-7.el10_1.x86_64 178/195 Running scriptlet: rocblas-6.4.2-7.el10_1.x86_64 178/195 Installing : rocsolver-6.4.2-2.el10_1.x86_64 179/195 Running scriptlet: rocsolver-6.4.2-2.el10_1.x86_64 179/195 Installing : hipblas-6.4.1-2.el10_1.x86_64 180/195 Running scriptlet: hipblas-6.4.1-2.el10_1.x86_64 180/195 Installing : rocm-comgr-devel-20-13.rocm7.1.1.el10.x86_64 181/195 Installing : rocm-hip-devel-6.4.2-1.el10_1.x86_64 182/195 Installing : rocblas-devel-6.4.2-7.el10_1.x86_64 183/195 Installing : hipblas-devel-6.4.1-2.el10_1.x86_64 184/195 Installing : openmpi-2:5.0.2-5.el10.x86_64 185/195 Installing : rocm-rpm-macros-6.4.2-1.el10_1.noarch 186/195 Installing : libcurl-devel-8.12.1-2.el10.x86_64 187/195 Installing : wget-1.24.5-5.el10.x86_64 188/195 Installing : gcc-c++-14.3.1-2.1.el10.x86_64 189/195 Installing : annobin-plugin-gcc-12.99-1.el10.x86_64 190/195 Running scriptlet: annobin-plugin-gcc-12.99-1.el10.x86_64 190/195 Installing : gcc-plugin-annobin-14.3.1-2.1.el10.x86_64 191/195 Running scriptlet: gcc-plugin-annobin-14.3.1-2.1.el10.x86_64 191/195 Installing : pthreadpool-devel-0.0^git20230829.4fe0e1e-7.el10 192/195 Installing : langpacks-en-4.1-3.el10.noarch 193/195 Installing : xxd-2:9.1.083-6.el10_1.x86_64 194/195 Installing : systemd-rpm-macros-257-13.el10.noarch 195/195 Running scriptlet: systemd-rpm-macros-257-13.el10.noarch 195/195 Installed products updated. Installed: annobin-docs-12.99-1.el10.noarch annobin-plugin-gcc-12.99-1.el10.x86_64 brotli-1.1.0-6.el10.x86_64 brotli-devel-1.1.0-6.el10.x86_64 cmake-3.30.5-3.el10_0.x86_64 cmake-data-3.30.5-3.el10_0.noarch cmake-filesystem-3.30.5-3.el10_0.x86_64 cmake-rpm-macros-3.30.5-3.el10_0.noarch cpp-14.3.1-2.1.el10.x86_64 dbus-1:1.14.10-5.el10.x86_64 dbus-broker-36-4.el10.x86_64 dbus-common-1:1.14.10-5.el10.noarch default-fonts-core-sans-4.1-3.el10.noarch emacs-filesystem-1:29.4-12.el10.noarch environment-modules-5.3.1-8.el10.x86_64 expat-2.7.1-1.el10_1.3.x86_64 fonts-filesystem-1:2.0.5-18.el10.noarch gcc-14.3.1-2.1.el10.x86_64 gcc-c++-14.3.1-2.1.el10.x86_64 gcc-plugin-annobin-14.3.1-2.1.el10.x86_64 git-2.47.3-1.el10_0.x86_64 git-core-2.47.3-1.el10_0.x86_64 git-core-doc-2.47.3-1.el10_0.noarch glibc-devel-2.39-58.el10_1.2.x86_64 gnutls-3.8.10-2.el10.x86_64 google-noto-fonts-common-20240401-5.el10.noarch google-noto-sans-mono-vf-fonts-20240401-5.el10.noarch google-noto-sans-vf-fonts-20240401-5.el10.noarch google-noto-serif-vf-fonts-20240401-5.el10.noarch groff-base-1.23.0-10.el10.x86_64 hipblas-6.4.1-2.el10_1.x86_64 hipblas-common-devel-6.4.0-1.el10_1.noarch hipblas-devel-6.4.1-2.el10_1.x86_64 hipcc-20-13.rocm7.1.1.el10.x86_64 hwdata-0.379-10.6.el10.noarch hwloc-libs-2.11.1-3.el10.x86_64 jsoncpp-1.9.5-9.el10.x86_64 kernel-headers-6.12.0-124.28.1.el10_1.x86_64 keyutils-libs-devel-1.6.3-5.el10.x86_64 krb5-devel-1.21.3-8.el10_0.x86_64 langpacks-core-en-4.1-3.el10.noarch langpacks-en-4.1-3.el10.noarch langpacks-fonts-en-4.1-3.el10.noarch less-661-3.el10.x86_64 libcbor-0.11.0-3.el10.x86_64 libcom_err-devel-1.47.1-4.el10.x86_64 libcurl-devel-8.12.1-2.el10.x86_64 libdrm-2.4.123-1.el10.x86_64 libedit-3.1-52.20230828cvs.el10.x86_64 libfabric-2.1.0-1.el10.x86_64 libfido2-1.14.0-7.el10.x86_64 libgfortran-14.3.1-2.1.el10.x86_64 libibverbs-57.0-2.el10.x86_64 libidn2-devel-2.3.7-3.el10.x86_64 libkadm5-1.21.3-8.el10_0.x86_64 libmpc-1.3.1-7.el10.x86_64 libnghttp2-devel-1.64.0-2.el10.x86_64 libnl3-3.11.0-1.el10.x86_64 libpciaccess-0.16-16.el10.x86_64 libpipeline-1.5.7-7.el10.x86_64 libpsl-devel-0.21.5-6.el10.x86_64 libquadmath-14.3.1-2.1.el10.x86_64 librdmacm-57.0-2.el10.x86_64 libseccomp-2.5.6-1.el10.x86_64 libselinux-devel-3.9-1.el10.x86_64 libsepol-devel-3.9-1.el10.x86_64 libssh-devel-0.11.1-5.el10_1.x86_64 libstdc++-devel-14.3.1-2.1.el10.x86_64 libuv-1:1.51.0-1.el10_0.x86_64 libverto-devel-0.3.2-10.el10.x86_64 libxcrypt-devel-4.4.36-10.el10.x86_64 logrotate-3.22.0-4.el10.x86_64 make-1:4.4.1-9.el10.x86_64 man-db-2.12.0-10.el10.x86_64 mpdecimal-2.5.1-12.el10.x86_64 munge-0.5.15-10.el10.x86_64 munge-libs-0.5.15-10.el10.x86_64 ncurses-6.4-14.20240127.el10.x86_64 numactl-libs-2.0.19-2.el10.x86_64 ocl-icd-2.3.2-8.el10.x86_64 openmpi-2:5.0.2-5.el10.x86_64 openssh-9.9p1-12.el10_1.x86_64 openssh-clients-9.9p1-12.el10_1.x86_64 openssl-devel-1:3.5.1-5.el10_1.x86_64 pcre2-devel-10.44-1.el10.3.x86_64 pcre2-utf16-10.44-1.el10.3.x86_64 pcre2-utf32-10.44-1.el10.3.x86_64 perl-AutoLoader-5.74-512.2.el10_0.noarch perl-B-1.89-512.2.el10_0.x86_64 perl-Carp-1.54-511.el10.noarch perl-Class-Struct-0.68-512.2.el10_0.noarch perl-Data-Dumper-2.189-512.el10.x86_64 perl-Digest-1.20-511.el10.noarch perl-Digest-MD5-2.59-6.el10.x86_64 perl-DynaLoader-1.56-512.2.el10_0.x86_64 perl-Encode-4:3.21-511.el10.x86_64 perl-Errno-1.38-512.2.el10_0.x86_64 perl-Error-1:0.17029-18.el10.noarch perl-Exporter-5.78-511.el10.noarch perl-Fcntl-1.18-512.2.el10_0.x86_64 perl-File-Basename-2.86-512.2.el10_0.noarch perl-File-Find-1.44-512.2.el10_0.noarch perl-File-Path-2.18-511.el10.noarch perl-File-Temp-1:0.231.100-512.el10.noarch perl-File-stat-1.14-512.2.el10_0.noarch perl-FileHandle-2.05-512.2.el10_0.noarch perl-Getopt-Long-1:2.58-3.el10.noarch perl-Getopt-Std-1.14-512.2.el10_0.noarch perl-Git-2.47.3-1.el10_0.noarch perl-HTTP-Tiny-0.088-512.el10.noarch perl-IO-1.55-512.2.el10_0.x86_64 perl-IO-Socket-IP-0.42-512.el10.noarch perl-IO-Socket-SSL-2.085-3.el10.noarch perl-IPC-Open3-1.22-512.2.el10_0.noarch perl-MIME-Base64-3.16-511.el10.x86_64 perl-Mozilla-CA-20231213-5.el10.noarch perl-Net-SSLeay-1.94-8.el10.x86_64 perl-POSIX-2.20-512.2.el10_0.x86_64 perl-PathTools-3.91-512.el10.x86_64 perl-Pod-Escapes-1:1.07-511.el10.noarch perl-Pod-Perldoc-3.28.01-512.el10.noarch perl-Pod-Simple-1:3.45-511.el10.noarch perl-Pod-Usage-4:2.03-511.el10.noarch perl-Scalar-List-Utils-5:1.63-511.el10.x86_64 perl-SelectSaver-1.02-512.2.el10_0.noarch perl-Socket-4:2.038-511.el10.x86_64 perl-Storable-1:3.32-511.el10.x86_64 perl-Symbol-1.09-512.2.el10_0.noarch perl-Term-ANSIColor-5.01-512.el10.noarch perl-Term-Cap-1.18-511.el10.noarch perl-TermReadKey-2.38-24.el10.x86_64 perl-Text-ParseWords-3.31-511.el10.noarch perl-Text-Tabs+Wrap-2024.001-511.el10.noarch perl-Time-Local-2:1.350-511.el10.noarch perl-URI-5.27-3.el10.noarch perl-base-2.27-512.2.el10_0.noarch perl-constant-1.33-512.el10.noarch perl-if-0.61.000-512.2.el10_0.noarch perl-interpreter-4:5.40.2-512.2.el10_0.x86_64 perl-lib-0.65-512.2.el10_0.x86_64 perl-libnet-3.15-512.el10.noarch perl-libs-4:5.40.2-512.2.el10_0.x86_64 perl-locale-1.12-512.2.el10_0.noarch perl-mro-1.29-512.2.el10_0.x86_64 perl-overload-1.37-512.2.el10_0.noarch perl-overloading-0.02-512.2.el10_0.noarch perl-parent-1:0.241-512.el10.noarch perl-podlators-1:5.01-511.el10.noarch perl-vars-1.05-512.2.el10_0.noarch pmix-4.2.8-8.el10.x86_64 procps-ng-4.0.4-8.el10.x86_64 prrte-3.0.2-9.el10.x86_64 prrte-libs-3.0.2-9.el10.x86_64 pthreadpool-0.0^git20230829.4fe0e1e-7.el10_1.x86_64 pthreadpool-devel-0.0^git20230829.4fe0e1e-7.el10_1.x86_64 publicsuffix-list-20240107-5.el10.noarch python3-3.12.12-1.el10_1.x86_64 python3-libs-3.12.12-1.el10_1.x86_64 python3-pip-wheel-23.3.2-7.el10.noarch redhat-mono-vf-fonts-4.1.0-1.el10.noarch redhat-text-vf-fonts-4.1.0-1.el10.noarch rocblas-6.4.2-7.el10_1.x86_64 rocblas-devel-6.4.2-7.el10_1.x86_64 rocm-clang-20-13.rocm7.1.1.el10.x86_64 rocm-clang-devel-20-13.rocm7.1.1.el10.x86_64 rocm-clang-libs-20-13.rocm7.1.1.el10.x86_64 rocm-clang-runtime-devel-20-13.rocm7.1.1.el10.x86_64 rocm-comgr-20-13.rocm7.1.1.el10.x86_64 rocm-comgr-devel-20-13.rocm7.1.1.el10.x86_64 rocm-device-libs-20-13.rocm7.1.1.el10.x86_64 rocm-hip-6.4.2-1.el10_1.x86_64 rocm-hip-devel-6.4.2-1.el10_1.x86_64 rocm-libc++-20-13.rocm7.1.1.el10.x86_64 rocm-libc++-devel-20-13.rocm7.1.1.el10.x86_64 rocm-lld-20-13.rocm7.1.1.el10.x86_64 rocm-llvm-20-13.rocm7.1.1.el10.x86_64 rocm-llvm-devel-20-13.rocm7.1.1.el10.x86_64 rocm-llvm-filesystem-20-13.rocm7.1.1.el10.x86_64 rocm-llvm-libs-20-13.rocm7.1.1.el10.x86_64 rocm-llvm-static-20-13.rocm7.1.1.el10.x86_64 rocm-rpm-macros-6.4.2-1.el10_1.noarch rocm-runtime-6.4.2-1.el10_1.x86_64 rocm-runtime-devel-6.4.2-1.el10_1.x86_64 rocsolver-6.4.2-2.el10_1.x86_64 systemd-257-13.el10.x86_64 systemd-pam-257-13.el10.x86_64 systemd-rpm-macros-257-13.el10.noarch tcl-1:8.6.13-4.el10.x86_64 torque-libs-6.1.3-16.el10.x86_64 tzdata-2025c-1.el10.noarch ucx-1.18.1-1.el10.x86_64 vim-filesystem-2:9.1.083-6.el10_1.noarch wget-1.24.5-5.el10.x86_64 xxd-2:9.1.083-6.el10_1.x86_64 zlib-ng-compat-devel-2.2.3-2.el10.x86_64 Complete! Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1768780800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.el10.src.rpm No matches found for the following disable plugin patterns: local, spacewalk, versionlock Updating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Copr repository 73 kB/s | 1.5 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - BaseOS 24 kB/s | 4.1 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - AppStr 21 kB/s | 4.1 kB 00:00 Red Hat CodeReady Linux Builder for RHEL 10 x86 19 kB/s | 4.0 kB 00:00 Extra Packages for Enterprise Linux 10 - x86_64 219 kB/s | 34 kB 00:00 Package cmake-3.30.5-3.el10_0.x86_64 is already installed. Package curl-8.12.1-2.el10.x86_64 is already installed. Package gcc-c++-14.3.1-2.1.el10.x86_64 is already installed. Package git-2.47.3-1.el10_0.x86_64 is already installed. Package hipblas-devel-6.4.1-2.el10_1.x86_64 is already installed. Package langpacks-en-4.1-3.el10.noarch is already installed. Package libcurl-devel-8.12.1-2.el10.x86_64 is already installed. Package openmpi-2:5.0.2-5.el10.x86_64 is already installed. Package pthreadpool-devel-0.0^git20230829.4fe0e1e-7.el10_1.x86_64 is already installed. Package rocblas-devel-6.4.2-7.el10_1.x86_64 is already installed. Package rocm-comgr-devel-20-13.rocm7.1.1.el10.x86_64 is already installed. Package rocm-hip-devel-6.4.2-1.el10_1.x86_64 is already installed. Package rocm-rpm-macros-6.4.2-1.el10_1.noarch is already installed. Package rocm-runtime-devel-6.4.2-1.el10_1.x86_64 is already installed. Package wget-1.24.5-5.el10.x86_64 is already installed. Package xxd-2:9.1.083-6.el10_1.x86_64 is already installed. Dependencies resolved. Nothing to do. Complete! Finish: build setup for llama-cpp-b6153-1.el10.src.rpm Start: rpmbuild llama-cpp-b6153-1.el10.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1768780800 Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.z46FhB + umask 022 + cd /builddir/build/BUILD + cd /builddir/build/BUILD + rm -rf llama.cpp-b6153 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/llama.cpp-b6153.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd llama.cpp-b6153 + rm -rf /builddir/build/BUILD/llama.cpp-b6153-SPECPARTS + /usr/bin/mkdir -p /builddir/build/BUILD/llama.cpp-b6153-SPECPARTS + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' src/CMakeLists.txt + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' ggml/src/CMakeLists.txt + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' tools/mtmd/CMakeLists.txt + sed -i '/target_link_libraries(ggml-hip PRIVATE ggml-base.*/aset_target_properties(ggml-hip PROPERTIES SOVERSION b6153)' ggml/src/ggml-hip/CMakeLists.txt + sed -i '/target_compile_features(${GGML_CPU_NAME} PRIVATE c_std_11.*/aset_target_properties(${GGML_CPU_NAME} PROPERTIES SOVERSION b6153)' ggml/src/ggml-cpu/CMakeLists.txt + sed -i '/#include ' src/llama-mmap.h + rm -rf exmples/llma.android + find . -name .gitignore -exec rm -rf '{}' ';' + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.pRbaji + umask 022 + cd /builddir/build/BUILD + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd llama.cpp-b6153 + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + /usr/bin/cmake -S . -B redhat-linux-build -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON -DCMAKE_INSTALL_LIBDIR=lib64 -DCMAKE_SKIP_RPATH=ON -DGGML_AVX=OFF -DGGML_AVX2=OFF -DGGML_AVX512=OFF -DGGML_AVX512_VBMI=OFF -DGGML_AVX512_VNNI=OFF -DGGML_FMA=OFF -DGGML_F16C=OFF -DGGML_HIP=ON '-DAMDGPU_TARGETS=gfx900;gfx906:xnack-;gfx908:xnack-;gfx90a:xnack+;gfx90a:xnack-;gfx942;gfx950;gfx1010;gfx1012;gfx1030;gfx1031;gfx1035;gfx1100;gfx1101;gfx1102;gfx1103;gfx1150;gfx1151;gfx1152;gfx1153;gfx1200;gfx1201' -DLLAMA_BUILD_EXAMPLES=OFF -DLLAMA_BUILD_TESTS=OFF -- The C compiler identification is Clang 20.0.0 -- The CXX compiler identification is Clang 20.0.0 -- Detecting C compiler ABI info -- Detecting C compiler ABI info - done -- Check for working C compiler: /usr/bin/hipcc - skipped -- Detecting C compile features -- Detecting C compile features - done -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/hipcc - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Found Git: /usr/bin/git (found version "2.47.3") fatal: not a git repository (or any of the parent directories): .git fatal: not a git repository (or any of the parent directories): .git sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory -- Setting GGML_NATIVE_DEFAULT to OFF -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Warning: ccache not found - consider installing it for faster compilation or disable this warning with GGML_CCACHE=OFF -- CMAKE_SYSTEM_PROCESSOR: x86_64 -- GGML_SYSTEM_ARCH: x86 -- Including CPU backend -- Could NOT find OpenMP_C (missing: OpenMP_C_FLAGS OpenMP_C_LIB_NAMES) -- Could NOT find OpenMP_CXX (missing: OpenMP_CXX_FLAGS OpenMP_CXX_LIB_NAMES) -- Could NOT find OpenMP (missing: OpenMP_C_FOUND OpenMP_CXX_FOUND) CMake Warning at ggml/src/ggml-cpu/CMakeLists.txt:80 (message): OpenMP not found Call Stack (most recent call first): ggml/src/CMakeLists.txt:372 (ggml_add_cpu_backend_variant_impl) -- x86 detected -- Adding CPU backend variant ggml-cpu: CMake Warning at ggml/src/ggml-hip/CMakeLists.txt:27 (message): Setting hipcc as the C++ compiler is legacy behavior. Prefer setting the HIP compiler directly. See README for details. CMake Warning (dev) at /usr/lib64/cmake/hip/hip-config-amd.cmake:70 (message): AMDGPU_TARGETS is deprecated. Please use GPU_TARGETS instead. Call Stack (most recent call first): /usr/lib64/cmake/hip/hip-config.cmake:159 (include) ggml/src/ggml-hip/CMakeLists.txt:39 (find_package) This warning is for project developers. Use -Wno-dev to suppress it. -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP and hipBLAS found -- Including HIP backend -- ggml version: 0.0.0 -- ggml commit: unknown CMake Warning at common/CMakeLists.txt:32 (message): Git repository not found; to enable automatic generation of build info, make sure Git is installed and the project is a Git repository. -- Found CURL: /usr/lib64/libcurl.so (found version "8.12.1") -- Configuring done (5.4s) -- Generating done (0.0s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP INCLUDE_INSTALL_DIR LIB_INSTALL_DIR LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build + /usr/bin/cmake --build redhat-linux-build -j4 --verbose Change Dir: '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j4 /usr/bin/cmake -S/builddir/build/BUILD/llama.cpp-b6153 -B/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/CMakeFiles /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build//CMakeFiles/progress.marks /usr/bin/gmake -f CMakeFiles/Makefile2 all gmake[1]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-base.dir/build.make ggml/src/CMakeFiles/ggml-base.dir/depend /usr/bin/gmake -f common/CMakeFiles/build_info.dir/build.make common/CMakeFiles/build_info.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-llava-cli.dir/build.make tools/mtmd/CMakeFiles/llama-llava-cli.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build.make tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml-base.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/common /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common/CMakeFiles/build_info.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-llava-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f common/CMakeFiles/build_info.dir/build.make common/CMakeFiles/build_info.dir/build /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-base.dir/build.make ggml/src/CMakeFiles/ggml-base.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build.make tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-llava-cli.dir/build.make tools/mtmd/CMakeFiles/llama-llava-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 0%] Building CXX object common/CMakeFiles/build_info.dir/build-info.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/build_info.dir/build-info.cpp.o -MF CMakeFiles/build_info.dir/build-info.cpp.o.d -o CMakeFiles/build_info.dir/build-info.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common/build-info.cpp [ 1%] Building CXX object tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o [ 1%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp [ 2%] Building CXX object tools/mtmd/CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml.c.o -MF CMakeFiles/ggml-base.dir/ggml.c.o.d -o CMakeFiles/ggml-base.dir/ggml.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml.c cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 3%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml.cpp.o -MF CMakeFiles/ggml-base.dir/ggml.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 3%] Built target build_info [ 3%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml-alloc.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-alloc.c.o -MF CMakeFiles/ggml-base.dir/ggml-alloc.c.o.d -o CMakeFiles/ggml-base.dir/ggml-alloc.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-alloc.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 3%] Linking CXX executable ../../bin/llama-llava-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-llava-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-llava-cli [ 3%] Linking CXX executable ../../bin/llama-gemma3-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-gemma3-cli.dir/link.txt --verbose=1 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-gemma3-cli clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 3%] Built target llama-llava-cli [ 3%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-backend.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-backend.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-backend.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-backend.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-backend.cpp gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 3%] Built target llama-gemma3-cli [ 4%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-opt.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-opt.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-opt.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-opt.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-opt.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build.make tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build.make tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 5%] Building CXX object tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 5%] Linking CXX executable ../../bin/llama-minicpmv-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-minicpmv-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-minicpmv-cli sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 5%] Built target llama-minicpmv-cli [ 5%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-threading.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-threading.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-threading.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-threading.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-threading.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build.make tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build.make tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 5%] Building CXX object tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 6%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml-quants.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-quants.c.o -MF CMakeFiles/ggml-base.dir/ggml-quants.c.o.d -o CMakeFiles/ggml-base.dir/ggml-quants.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 6%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/gguf.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/gguf.cpp.o -MF CMakeFiles/ggml-base.dir/gguf.cpp.o.d -o CMakeFiles/ggml-base.dir/gguf.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/gguf.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 7%] Linking CXX executable ../../bin/llama-qwen2vl-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-qwen2vl-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-qwen2vl-cli sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 7%] Built target llama-qwen2vl-cli [ 8%] Linking CXX shared library ../../bin/libggml-base.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-base.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-base.so.b6153 -o ../../bin/libggml-base.so.b6153 "CMakeFiles/ggml-base.dir/ggml.c.o" "CMakeFiles/ggml-base.dir/ggml.cpp.o" "CMakeFiles/ggml-base.dir/ggml-alloc.c.o" "CMakeFiles/ggml-base.dir/ggml-backend.cpp.o" "CMakeFiles/ggml-base.dir/ggml-opt.cpp.o" "CMakeFiles/ggml-base.dir/ggml-threading.cpp.o" "CMakeFiles/ggml-base.dir/ggml-quants.c.o" "CMakeFiles/ggml-base.dir/gguf.cpp.o" -lm sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml-base.so.b6153 ../../bin/libggml-base.so.b6153 ../../bin/libggml-base.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 8%] Built target ggml-base /usr/bin/gmake -f ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build.make ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/depend /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-cpu.dir/build.make ggml/src/CMakeFiles/ggml-cpu.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml-cpu.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-cpu.dir/build.make ggml/src/CMakeFiles/ggml-cpu.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build.make ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 8%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o [ 8%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o [ 9%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/repack.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/ggml-cpu.c [ 10%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/ggml-cpu.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 10%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1010. [ 11%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu [ 12%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/hbm.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. [ 12%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. [ 13%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/traits.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1031. [ 13%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/amx/amx.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1031. [ 14%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/amx/mmq.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1100. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 14%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/binary-ops.cpp /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. [ 15%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/unary-ops.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 15%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/vec.cpp /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory 4 warnings generated when compiling for gfx1151. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1151. 4 warnings generated when compiling for gfx1152. [ 16%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/ops.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1153. 4 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ c) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1200. 4 warnings generated when compiling for gfx1201. [ 16%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/llamafile/sgemm.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1201. 4 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx900. 4 warnings generated when compiling for gfx906. [ 16%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/arch/x86/quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 17%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/arch/x86/repack.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx906. 4 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx908. 4 warnings generated when compiling for gfx90a. [ 17%] Linking CXX shared library ../../bin/libggml-cpu.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-cpu.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-cpu.so.b6153 -o ../../bin/libggml-cpu.so.b6153 "CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o" ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx942. 4 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | In file included from const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)s/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.curc1 + i11*nb11); | ^ :1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for host. [ 17%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 18%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-cpu.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 18%] Built target ggml-cpu [ 18%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 19%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated1 when compiling for gfx1150. warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1151. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 19%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 20%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 20%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1012. 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1030. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 1 warning generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 1 warning generated when compiling for gfx1150. 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1151. 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 2 warnings generated when compiling for host. [ 21%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 1 warning generated when compiling for gfx900. 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 1 warning generated when compiling for gfx906. 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 1 warning generated when compiling for gfx908. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx950. 1 warning generated when compiling for host. [ 21%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for host. [ 22%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 1 warning generated when compiling for host. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ [ 22%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1010. 4 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1012. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1153. 4 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1201. 4 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 2 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 4 warnings generated when compiling for gfx1031. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 2 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 4 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 2 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 4 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 2 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu 4 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1201. In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx906. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 4 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 1 warning generated when compiling for gfx1151. 2 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx942. 4 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 2 warnings generated when compiling for gfx950. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 4 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx900. 4 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 4 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx90a. 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx942. 4 warnings generated when compiling for gfx942. In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 1 warning generated when compiling for gfx950. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_avaIn file included from ilable(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 1 warning generated when compiling for gfx1200. 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 4 warnings generated when compiling for host. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 24%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 24%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 11 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1010. 7 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx942. 7 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 25%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu 7 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 22 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 22 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1035. 7 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 22 warnings generated when compiling for gfx1100. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinli7 warnings generated when compiling for gfx1102. ne__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1103. 11 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 7 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 7 warnings generated when compiling for gfx1151. 22 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1153. 7 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1201. 7 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 7 warnings generated when compiling for gfx1200. 22 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 20 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 20 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 20 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 24 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 22 warnings generated when compiling for gfx950. 7 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 22 warnings generated when compiling for host. [ 25%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu 7 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 11 warnings generated when compiling for gfx1103. 2 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 7 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1151. 11 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 7 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 7 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx1152. 2 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 7 warnings generated when compiling for host. [ 26%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 27 warnings generated when compiling for gfx1010. 2 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 11 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ 2 warnings generated when compiling for gfx1151. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; In file included from | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 27 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 27 warnings generated when compiling for gfx1103. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1201. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:1411:9: note: declared here 141 | int archLen = strlen(devName); | ^ warning generated when compiling for gfx900. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 27 warnings generated when compiling for gfx1150. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: 2 warnings generated when compiling for gfx908. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 27 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx900. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ 1 warning generated when compiling for gfx908. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx900. 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 27 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx950. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 25 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 2 warnings generated when compiling for host. [ 26%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 25 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 25 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 1 warning generated when compiling for gfx1012. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 1 warning generated when compiling for gfx90a. 29 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 1 warning generated when compiling for gfx1030. 27 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 1 warning generated when compiling for gfx1031. 27 warnings generated when compiling for host. [ 27%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu1:: In file included from 1/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh: :1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh::270270::4242:: warning: warning: unused parameter 'cc' [-Wunused-parameter]unused parameter 'cc' [-Wunused-parameter] 270270 | | ssttaattiicc bbooooll ffpp1166__mmmmaa__aavvaaiillaabbllee((ccoonnsstt iinntt cccc)) {{ | | ^ ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx950. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for host. [ 27%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh: :270/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh::42270:: 42:warning: unused parameter 'cc' [-Wunused-parameter]warning: unused parameter 'cc' [-Wunused-parameter] 270270 | | ssttaattiicc bbooooll ffpp1166__mmmmaa__aavvaaiillaabbllee((ccoonnsstt iinntt cccc)) {{ | | ^ ^ 1 warning generated when compiling for gfx1103. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for host. [ 28%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1200. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { bool fp16_mma_available(const int cc) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1201. 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1151. 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 13 warnings generated when compiling for gfx1030. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 1 warning generated when compiling for gfx908. 3 warnings generated when compiling for gfx1200. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 1 warning generated when compiling for gfx942. 3 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1035. In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx908. 1 warning generated when compiling for host. [ 28%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu 13 warnings generated when compiling for gfx1100. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 3 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 29%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ 3 warnings generated when compiling for gfx942. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for host. [ 29%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1035. 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh13 warnings generated when compiling for gfx1100. :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 13 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1150. 13 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 13 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1153. 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1201. 13 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1012. 13 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for host. [ 30%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 15 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1030. 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 13 warnings generated when compiling for host. [ 30%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 31%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 31%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 13 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 1 warning generated when compiling for gfx1103. 13 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1035. 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 1 warning generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for host. [ 33%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu 1 warning generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 33%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1153. 13 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 34%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 34%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 35%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 35%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1031. 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 36%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 36%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for host. [ 37%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1153. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 3 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx1201. 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx900. 3 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 37%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1151. 13 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx942. 3 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) 1 warning generated when compiling for gfx1151. { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx1152. 3 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1200. 1 warning generated when compiling for host. [ 38%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx1153. 3 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> In file included from & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ :463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1200. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for host. [ 38%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warning generated when compiling for gfx90a. warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 39%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 39%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { 1 warning generated when compiling for gfx90a. | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx950. 14 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 40%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * c14o warnings generated when compiling for gfx1100. nst __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 13 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc)In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx908. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1153. 12 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float>In file included from &/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu :D3,: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuhc:o3n: s/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuht: 302t:i90l:e & A, cons t302 | t i l e < 1 6 , 8 , htaillfe2<>1 6&, B8), {i n t| > ^ & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | 326 | t i l e < 1 6 , t1i6l,e 8&, Di,n tc>o n&s tD ,t icloen 8&, Ai,n tc>o n&s tA ,t icloen8 ,& iBn)t >{ & | B ^) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: :warning: 544:function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]92 : warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | 544 | t i lteit >& &D ,D ,c ocnosnts tt itliel> && AA,, ccoonnsstt ttiillee<<382,, 84,, hianltf>2 >& &B )B ){ { | ^| ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1200. 1 warning generated when compiling for host. [ 40%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8,12 warnings generated when compiling for gfx908. 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 13 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for host. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> &[ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<112 warnings generated when compiling for gfx90a. 6, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. 14 warnings generated when compiling for gfx900. [ 42%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_waiIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ t_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const In file included from tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float>In file included from & A, co/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3nst tile<8, 8,: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788 float> & B) { | ^ :43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | 14 warnings generated when compiling for gfx1035. tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for host. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ [ 42%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<1In file included from 6, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh const float * const __restrict__ sinks_f, | ^ :544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ 544 | tile<32/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh,:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const til14 warnings generated when compiling for gfx950. e<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 43%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16,In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device 8, int> & A, const t__ __forceinline__ void cp_async_wait_all() {il | ^ e<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const t ile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ tile<16/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh, 8:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, in, t> & B) { | ^ float> & D, con/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhst:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh<1:6436,: 968:, warning: hafunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]l f2> & B) { | ^ 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile & B) { | ^ 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | ti/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhle:<4801:698,: 1warning: 6function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn], int> & D, cons t480 | t i l e < 1 6 , 8 , itnitl>e <&1 6A,, 1c6, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ onst tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1030. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for host. [ 43%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_waiIn file included from t_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 44%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinlineIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ __ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | statiIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ c bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx942. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all()In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for host. [ 44%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ :516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | In file included from const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1030. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for host. [ 45%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const flIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cuoat * const __restrict__ sinks_f, | ^ :3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 45%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436In file included from | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ :3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ loat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, coIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ nst tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device_14 warnings generated when compiling for gfx1150. _ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for host. [ 46%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for host. [ 46%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 13 warnings generated when compiling for host. [ 47%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() {In file included from | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for host. [ 47%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:14 warnings generated when compiling for gfx1151. 270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ 1257:35: warning: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ unused parameter 'KV_max' [-Wunused-parameter] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 1257 | const int * __restrict__ KV_max, | ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restIn file included from rict__ KV_max, | ^/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu :3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuhp_async_wait_all() { | ^ :1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloaIn file included from t162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 14 warnings generated when compiling for gfx900. 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx900. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 48%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __devIn file included from ice__ __forceinline__ void cp_async_wait_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 48%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu 14 warnings generated when compiling for host. [ 49%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 49%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, conIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ st/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ tile<16/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh, 4, int> & A, const tile<8, 4, int> & B) { | ^ :/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ 356:96: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhwarning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ :356:96: warning: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ 356 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ :463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 463 | :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ tile & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ , float> &/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ :436:96: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ :516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ 516 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuht:480:98:i warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ le<16,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ :516:92: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhwarning: :544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool f10 warnings generated when compiling for gfx908. p16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 51%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16In file included from , 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ 302 | t/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhi:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ le<16, 8, int> &/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: 516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ D, const /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhtile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ :544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 51%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, In file included from 8/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ , int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. 12 warnings generated when compiling for host. [ 52%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 52%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 53%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for host. [ 53%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 15 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 54%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 54%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 55%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 55%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 13 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 13 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 13 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 17 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 56%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 56%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for host. [ 57%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for host. [ 57%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter]In file included from 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. 28 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int12 warnings generated when compiling for gfx1200. 32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1103. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx906. 28 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 28 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 58%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx942. 28 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for host. [ 58%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 28 warnings generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1031. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 28 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx950. 28 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1010. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1152. 8 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1153. 8 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1200. 8 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh const int3:2_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | _t nb32, const int64_t nb33) { | ^ for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1035. 28 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx942. 28 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx906. 8 warnings generated when compiling for gfx1103. 8 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1152. 28 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 28 warnings generated when compiling for host. [ 60%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1010. 8 warnings generated when compiling for gfx1200. 8 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1201. 8 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx900. 1 warning generated when compiling for host. [ 60%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu 8 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1150. 8 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1151. 8 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1152. 8 warnings generated when compiling for host. [ 61%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu 8 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1153. 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx942. 8 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for host. [ 61%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1103. 8 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1102. 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1103. 8 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 8 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bo1ol fp16_mma_available(const int cc) { | ^ warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx942. 8 warnings generated when compiling for host. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 62%] Linking CXX shared library ../../../bin/libggml-hip.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-hip.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-hip.so.b6153 -o ../../../bin/libggml-hip.so.b6153 "CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o" ../../../bin/libggml-base.so.b6153 /usr/lib64/libhipblas.so.2.4 --hip-link --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 /usr/lib64/librocblas.so.4.4 /usr/lib64/libamdhip64.so.6.4.43484 clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/cmake -E cmake_symlink_library ../../../bin/libggml-hip.so.b6153 ../../../bin/libggml-hip.so.b6153 ../../../bin/libggml-hip.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 62%] Built target ggml-hip /usr/bin/gmake -f ggml/src/CMakeFiles/ggml.dir/build.make ggml/src/CMakeFiles/ggml.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml.dir/build.make ggml/src/CMakeFiles/ggml.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 62%] Building CXX object ggml/src/CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_BUILD -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o -MF CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o.d -o CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-backend-reg.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 63%] Linking CXX shared library ../../bin/libggml.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml.so.b6153 -o ../../bin/libggml.so.b6153 "CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o" -ldl ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 63%] Built target ggml /usr/bin/gmake -f src/CMakeFiles/llama.dir/build.make src/CMakeFiles/llama.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src/CMakeFiles/llama.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f src/CMakeFiles/llama.dir/build.make src/CMakeFiles/llama.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 63%] Building CXX object src/CMakeFiles/llama.dir/llama-arch.cpp.o [ 63%] Building CXX object src/CMakeFiles/llama.dir/llama.cpp.o [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-adapter.cpp.o [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-batch.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama.cpp.o -MF CMakeFiles/llama.dir/llama.cpp.o.d -o CMakeFiles/llama.dir/llama.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-adapter.cpp.o -MF CMakeFiles/llama.dir/llama-adapter.cpp.o.d -o CMakeFiles/llama.dir/llama-adapter.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-adapter.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-arch.cpp.o -MF CMakeFiles/llama.dir/llama-arch.cpp.o.d -o CMakeFiles/llama.dir/llama-arch.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-arch.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-batch.cpp.o -MF CMakeFiles/llama.dir/llama-batch.cpp.o.d -o CMakeFiles/llama.dir/llama-batch.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-batch.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-chat.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-chat.cpp.o -MF CMakeFiles/llama.dir/llama-chat.cpp.o.d -o CMakeFiles/llama.dir/llama-chat.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-chat.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 66%] Building CXX object src/CMakeFiles/llama.dir/llama-context.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-context.cpp.o -MF CMakeFiles/llama.dir/llama-context.cpp.o.d -o CMakeFiles/llama.dir/llama-context.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-context.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 66%] Building CXX object src/CMakeFiles/llama.dir/llama-cparams.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-cparams.cpp.o -MF CMakeFiles/llama.dir/llama-cparams.cpp.o.d -o CMakeFiles/llama.dir/llama-cparams.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-cparams.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 67%] Building CXX object src/CMakeFiles/llama.dir/llama-grammar.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-grammar.cpp.o -MF CMakeFiles/llama.dir/llama-grammar.cpp.o.d -o CMakeFiles/llama.dir/llama-grammar.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-grammar.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 67%] Building CXX object src/CMakeFiles/llama.dir/llama-graph.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-graph.cpp.o -MF CMakeFiles/llama.dir/llama-graph.cpp.o.d -o CMakeFiles/llama.dir/llama-graph.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-graph.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 68%] Building CXX object src/CMakeFiles/llama.dir/llama-hparams.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-hparams.cpp.o -MF CMakeFiles/llama.dir/llama-hparams.cpp.o.d -o CMakeFiles/llama.dir/llama-hparams.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-hparams.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 68%] Building CXX object src/CMakeFiles/llama.dir/llama-impl.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-impl.cpp.o -MF CMakeFiles/llama.dir/llama-impl.cpp.o.d -o CMakeFiles/llama.dir/llama-impl.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-impl.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-io.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-io.cpp.o -MF CMakeFiles/llama.dir/llama-io.cpp.o.d -o CMakeFiles/llama.dir/llama-io.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-io.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o -MF CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o.d -o CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-kv-cache-unified.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o -MF CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o.d -o CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-kv-cache-unified-iswa.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 70%] Building CXX object src/CMakeFiles/llama.dir/llama-memory.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory.cpp.o -MF CMakeFiles/llama.dir/llama-memory.cpp.o.d -o CMakeFiles/llama.dir/llama-memory.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-memory.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 70%] Building CXX object src/CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o -MF CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o.d -o CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-memory-hybrid.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 71%] Building CXX object src/CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o -MF CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o.d -o CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-memory-recurrent.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 71%] Building CXX object src/CMakeFiles/llama.dir/llama-mmap.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-mmap.cpp.o -MF CMakeFiles/llama.dir/llama-mmap.cpp.o.d -o CMakeFiles/llama.dir/llama-mmap.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-mmap.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 72%] Building CXX object src/CMakeFiles/llama.dir/llama-model-loader.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model-loader.cpp.o -MF CMakeFiles/llama.dir/llama-model-loader.cpp.o.d -o CMakeFiles/llama.dir/llama-model-loader.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-model-loader.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 72%] Building CXX object src/CMakeFiles/llama.dir/llama-model-saver.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model-saver.cpp.o -MF CMakeFiles/llama.dir/llama-model-saver.cpp.o.d -o CMakeFiles/llama.dir/llama-model-saver.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-model-saver.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 73%] Building CXX object src/CMakeFiles/llama.dir/llama-model.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model.cpp.o -MF CMakeFiles/llama.dir/llama-model.cpp.o.d -o CMakeFiles/llama.dir/llama-model.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-model.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 73%] Building CXX object src/CMakeFiles/llama.dir/llama-quant.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-quant.cpp.o -MF CMakeFiles/llama.dir/llama-quant.cpp.o.d -o CMakeFiles/llama.dir/llama-quant.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-quant.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 74%] Building CXX object src/CMakeFiles/llama.dir/llama-sampling.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-sampling.cpp.o -MF CMakeFiles/llama.dir/llama-sampling.cpp.o.d -o CMakeFiles/llama.dir/llama-sampling.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-sampling.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 74%] Building CXX object src/CMakeFiles/llama.dir/llama-vocab.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-vocab.cpp.o -MF CMakeFiles/llama.dir/llama-vocab.cpp.o.d -o CMakeFiles/llama.dir/llama-vocab.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-vocab.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 75%] Building CXX object src/CMakeFiles/llama.dir/unicode-data.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/unicode-data.cpp.o -MF CMakeFiles/llama.dir/unicode-data.cpp.o.d -o CMakeFiles/llama.dir/unicode-data.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/unicode-data.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 75%] Building CXX object src/CMakeFiles/llama.dir/unicode.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/unicode.cpp.o -MF CMakeFiles/llama.dir/unicode.cpp.o.d -o CMakeFiles/llama.dir/unicode.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/unicode.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 76%] Linking CXX shared library ../bin/libllama.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libllama.so.b6153 -o ../bin/libllama.so.b6153 CMakeFiles/llama.dir/llama.cpp.o "CMakeFiles/llama.dir/llama-adapter.cpp.o" "CMakeFiles/llama.dir/llama-arch.cpp.o" "CMakeFiles/llama.dir/llama-batch.cpp.o" "CMakeFiles/llama.dir/llama-chat.cpp.o" "CMakeFiles/llama.dir/llama-context.cpp.o" "CMakeFiles/llama.dir/llama-cparams.cpp.o" "CMakeFiles/llama.dir/llama-grammar.cpp.o" "CMakeFiles/llama.dir/llama-graph.cpp.o" "CMakeFiles/llama.dir/llama-hparams.cpp.o" "CMakeFiles/llama.dir/llama-impl.cpp.o" "CMakeFiles/llama.dir/llama-io.cpp.o" "CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o" "CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o" "CMakeFiles/llama.dir/llama-memory.cpp.o" "CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o" "CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o" "CMakeFiles/llama.dir/llama-mmap.cpp.o" "CMakeFiles/llama.dir/llama-model-loader.cpp.o" "CMakeFiles/llama.dir/llama-model-saver.cpp.o" "CMakeFiles/llama.dir/llama-model.cpp.o" "CMakeFiles/llama.dir/llama-quant.cpp.o" "CMakeFiles/llama.dir/llama-sampling.cpp.o" "CMakeFiles/llama.dir/llama-vocab.cpp.o" "CMakeFiles/llama.dir/unicode-data.cpp.o" CMakeFiles/llama.dir/unicode.cpp.o ../bin/libggml.so.b6153 ../bin/libggml-cpu.so.b6153 ../bin/libggml-hip.so.b6153 ../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/cmake -E cmake_symlink_library ../bin/libllama.so.b6153 ../bin/libllama.so.b6153 ../bin/libllama.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 76%] Built target llama /usr/bin/gmake -f common/CMakeFiles/common.dir/build.make common/CMakeFiles/common.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/mtmd.dir/build.make tools/mtmd/CMakeFiles/mtmd.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/common /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common/CMakeFiles/common.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/mtmd.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f common/CMakeFiles/common.dir/build.make common/CMakeFiles/common.dir/build gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/mtmd.dir/build.make tools/mtmd/CMakeFiles/mtmd.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 77%] Building CXX object common/CMakeFiles/common.dir/chat-parser.cpp.o [ 77%] Building CXX object common/CMakeFiles/common.dir/arg.cpp.o [ 78%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd.cpp.o [ 78%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd-audio.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/arg.cpp.o -MF CMakeFiles/common.dir/arg.cpp.o.d -o CMakeFiles/common.dir/arg.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/arg.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/chat-parser.cpp.o -MF CMakeFiles/common.dir/chat-parser.cpp.o.d -o CMakeFiles/common.dir/chat-parser.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/chat-parser.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd-audio.cpp.o -MF CMakeFiles/mtmd.dir/mtmd-audio.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd-audio.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/mtmd-audio.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd.cpp.o -MF CMakeFiles/mtmd.dir/mtmd.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/mtmd.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 79%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/clip.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/clip.cpp.o -MF CMakeFiles/mtmd.dir/clip.cpp.o.d -o CMakeFiles/mtmd.dir/clip.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/clip.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 79%] Building CXX object common/CMakeFiles/common.dir/chat.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/chat.cpp.o -MF CMakeFiles/common.dir/chat.cpp.o.d -o CMakeFiles/common.dir/chat.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/chat.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 79%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd-helper.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd-helper.cpp.o -MF CMakeFiles/mtmd.dir/mtmd-helper.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd-helper.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/mtmd-helper.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 80%] Building CXX object common/CMakeFiles/common.dir/common.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/common.cpp.o -MF CMakeFiles/common.dir/common.cpp.o.d -o CMakeFiles/common.dir/common.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/common.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 80%] Building CXX object common/CMakeFiles/common.dir/console.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/console.cpp.o -MF CMakeFiles/common.dir/console.cpp.o.d -o CMakeFiles/common.dir/console.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/console.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 81%] Building CXX object common/CMakeFiles/common.dir/json-partial.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/json-partial.cpp.o -MF CMakeFiles/common.dir/json-partial.cpp.o.d -o CMakeFiles/common.dir/json-partial.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/json-partial.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 82%] Linking CXX shared library ../../bin/libmtmd.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/mtmd.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libmtmd.so.b6153 -o ../../bin/libmtmd.so.b6153 CMakeFiles/mtmd.dir/mtmd.cpp.o "CMakeFiles/mtmd.dir/mtmd-audio.cpp.o" CMakeFiles/mtmd.dir/clip.cpp.o "CMakeFiles/mtmd.dir/mtmd-helper.cpp.o" ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 82%] Building CXX object common/CMakeFiles/common.dir/json-schema-to-grammar.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/json-schema-to-grammar.cpp.o -MF CMakeFiles/common.dir/json-schema-to-grammar.cpp.o.d -o CMakeFiles/common.dir/json-schema-to-grammar.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/json-schema-to-grammar.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 83%] Building CXX object common/CMakeFiles/common.dir/llguidance.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/llguidance.cpp.o -MF CMakeFiles/common.dir/llguidance.cpp.o.d -o CMakeFiles/common.dir/llguidance.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/llguidance.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 83%] Building CXX object common/CMakeFiles/common.dir/log.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/log.cpp.o -MF CMakeFiles/common.dir/log.cpp.o.d -o CMakeFiles/common.dir/log.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/log.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_symlink_library ../../bin/libmtmd.so.b6153 ../../bin/libmtmd.so.b6153 ../../bin/libmtmd.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 83%] Built target mtmd [ 84%] Building CXX object common/CMakeFiles/common.dir/ngram-cache.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/ngram-cache.cpp.o -MF CMakeFiles/common.dir/ngram-cache.cpp.o.d -o CMakeFiles/common.dir/ngram-cache.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/ngram-cache.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 84%] Building CXX object common/CMakeFiles/common.dir/regex-partial.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/regex-partial.cpp.o -MF CMakeFiles/common.dir/regex-partial.cpp.o.d -o CMakeFiles/common.dir/regex-partial.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/regex-partial.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 85%] Building CXX object common/CMakeFiles/common.dir/sampling.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/sampling.cpp.o -MF CMakeFiles/common.dir/sampling.cpp.o.d -o CMakeFiles/common.dir/sampling.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/sampling.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 85%] Building CXX object common/CMakeFiles/common.dir/speculative.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/speculative.cpp.o -MF CMakeFiles/common.dir/speculative.cpp.o.d -o CMakeFiles/common.dir/speculative.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/speculative.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 86%] Linking CXX static library libcommon.a cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/cmake -P CMakeFiles/common.dir/cmake_clean_target.cmake cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/cmake -E cmake_link_script CMakeFiles/common.dir/link.txt --verbose=1 /usr/bin/ar qc libcommon.a CMakeFiles/common.dir/arg.cpp.o "CMakeFiles/common.dir/chat-parser.cpp.o" CMakeFiles/common.dir/chat.cpp.o CMakeFiles/common.dir/common.cpp.o CMakeFiles/common.dir/console.cpp.o "CMakeFiles/common.dir/json-partial.cpp.o" "CMakeFiles/common.dir/json-schema-to-grammar.cpp.o" CMakeFiles/common.dir/llguidance.cpp.o CMakeFiles/common.dir/log.cpp.o "CMakeFiles/common.dir/ngram-cache.cpp.o" "CMakeFiles/common.dir/regex-partial.cpp.o" CMakeFiles/common.dir/sampling.cpp.o CMakeFiles/common.dir/speculative.cpp.o "CMakeFiles/build_info.dir/build-info.cpp.o" /usr/bin/ranlib libcommon.a gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 86%] Built target common /usr/bin/gmake -f tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build.make tools/batched-bench/CMakeFiles/llama-batched-bench.dir/depend /usr/bin/gmake -f tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build.make tools/gguf-split/CMakeFiles/llama-gguf-split.dir/depend /usr/bin/gmake -f tools/imatrix/CMakeFiles/llama-imatrix.dir/build.make tools/imatrix/CMakeFiles/llama-imatrix.dir/depend /usr/bin/gmake -f tools/llama-bench/CMakeFiles/llama-bench.dir/build.make tools/llama-bench/CMakeFiles/llama-bench.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/batched-bench /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/batched-bench /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/batched-bench/CMakeFiles/llama-batched-bench.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/gguf-split /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/gguf-split /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/gguf-split/CMakeFiles/llama-gguf-split.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/imatrix /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/imatrix /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/imatrix/CMakeFiles/llama-imatrix.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/llama-bench /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/llama-bench /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/llama-bench/CMakeFiles/llama-bench.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build.make tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build /usr/bin/gmake -f tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build.make tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build /usr/bin/gmake -f tools/imatrix/CMakeFiles/llama-imatrix.dir/build.make tools/imatrix/CMakeFiles/llama-imatrix.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/llama-bench/CMakeFiles/llama-bench.dir/build.make tools/llama-bench/CMakeFiles/llama-bench.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 87%] Building CXX object tools/batched-bench/CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o [ 87%] Building CXX object tools/gguf-split/CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/gguf-split && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/gguf-split/CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o -MF CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o.d -o CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/gguf-split/gguf-split.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/batched-bench && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/batched-bench/CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o -MF CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o.d -o CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/batched-bench/batched-bench.cpp [ 88%] Building CXX object tools/imatrix/CMakeFiles/llama-imatrix.dir/imatrix.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/imatrix && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/imatrix/CMakeFiles/llama-imatrix.dir/imatrix.cpp.o -MF CMakeFiles/llama-imatrix.dir/imatrix.cpp.o.d -o CMakeFiles/llama-imatrix.dir/imatrix.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/imatrix/imatrix.cpp [ 88%] Building CXX object tools/llama-bench/CMakeFiles/llama-bench.dir/llama-bench.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/llama-bench && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/llama-bench/CMakeFiles/llama-bench.dir/llama-bench.cpp.o -MF CMakeFiles/llama-bench.dir/llama-bench.cpp.o.d -o CMakeFiles/llama-bench.dir/llama-bench.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/llama-bench/llama-bench.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 88%] Linking CXX executable ../../bin/llama-gguf-split cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/gguf-split && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-gguf-split.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o" -o ../../bin/llama-gguf-split ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 89%] Linking CXX executable ../../bin/llama-batched-bench cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/batched-bench && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-batched-bench.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o" -o ../../bin/llama-batched-bench ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 89%] Built target llama-gguf-split /usr/bin/gmake -f tools/main/CMakeFiles/llama-cli.dir/build.make tools/main/CMakeFiles/llama-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/main /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/main /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/main/CMakeFiles/llama-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/main/CMakeFiles/llama-cli.dir/build.make tools/main/CMakeFiles/llama-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 89%] Building CXX object tools/main/CMakeFiles/llama-cli.dir/main.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/main && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/main/CMakeFiles/llama-cli.dir/main.cpp.o -MF CMakeFiles/llama-cli.dir/main.cpp.o.d -o CMakeFiles/llama-cli.dir/main.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/main/main.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 89%] Linking CXX executable ../../bin/llama-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/main && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-cli.dir/main.cpp.o" -o ../../bin/llama-cli ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 89%] Linking CXX executable ../../bin/llama-imatrix cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/imatrix && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-imatrix.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-imatrix.dir/imatrix.cpp.o" -o ../../bin/llama-imatrix ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 90%] Linking CXX executable ../../bin/llama-bench cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/llama-bench && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-bench.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-bench.dir/llama-bench.cpp.o" -o ../../bin/llama-bench ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 90%] Built target llama-bench /usr/bin/gmake -f tools/perplexity/CMakeFiles/llama-perplexity.dir/build.make tools/perplexity/CMakeFiles/llama-perplexity.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/perplexity /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/perplexity /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/perplexity/CMakeFiles/llama-perplexity.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/perplexity/CMakeFiles/llama-perplexity.dir/build.make tools/perplexity/CMakeFiles/llama-perplexity.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 91%] Building CXX object tools/perplexity/CMakeFiles/llama-perplexity.dir/perplexity.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/perplexity && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/perplexity/CMakeFiles/llama-perplexity.dir/perplexity.cpp.o -MF CMakeFiles/llama-perplexity.dir/perplexity.cpp.o.d -o CMakeFiles/llama-perplexity.dir/perplexity.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/perplexity/perplexity.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 91%] Linking CXX executable ../../bin/llama-perplexity cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/perplexity && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-perplexity.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-perplexity.dir/perplexity.cpp.o" -o ../../bin/llama-perplexity ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 91%] Built target llama-batched-bench /usr/bin/gmake -f tools/quantize/CMakeFiles/llama-quantize.dir/build.make tools/quantize/CMakeFiles/llama-quantize.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/quantize /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/quantize /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/quantize/CMakeFiles/llama-quantize.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/quantize/CMakeFiles/llama-quantize.dir/build.make tools/quantize/CMakeFiles/llama-quantize.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 91%] Building CXX object tools/quantize/CMakeFiles/llama-quantize.dir/quantize.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/quantize && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/tools/quantize/../../common -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/quantize/CMakeFiles/llama-quantize.dir/quantize.cpp.o -MF CMakeFiles/llama-quantize.dir/quantize.cpp.o.d -o CMakeFiles/llama-quantize.dir/quantize.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/quantize/quantize.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 92%] Linking CXX executable ../../bin/llama-quantize cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/quantize && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-quantize.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-quantize.dir/quantize.cpp.o" -o ../../bin/llama-quantize ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 92%] Built target llama-quantize /usr/bin/gmake -f tools/server/CMakeFiles/llama-server.dir/build.make tools/server/CMakeFiles/llama-server.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 92%] Generating loading.html.hpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -DINPUT=/builddir/build/BUILD/llama.cpp-b6153/tools/server/public/loading.html -DOUTPUT=/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server/loading.html.hpp -P /builddir/build/BUILD/llama.cpp-b6153/scripts/xxd.cmake [ 93%] Generating index.html.gz.hpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -DINPUT=/builddir/build/BUILD/llama.cpp-b6153/tools/server/public/index.html.gz -DOUTPUT=/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server/index.html.gz.hpp -P /builddir/build/BUILD/llama.cpp-b6153/scripts/xxd.cmake gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 93%] Built target llama-cli /usr/bin/gmake -f tools/run/CMakeFiles/llama-run.dir/build.make tools/run/CMakeFiles/llama-run.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/run /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run/CMakeFiles/llama-run.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/run/CMakeFiles/llama-run.dir/build.make tools/run/CMakeFiles/llama-run.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 93%] Building CXX object tools/run/CMakeFiles/llama-run.dir/run.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/run/CMakeFiles/llama-run.dir/run.cpp.o -MF CMakeFiles/llama-run.dir/run.cpp.o.d -o CMakeFiles/llama-run.dir/run.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/run/run.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/server /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server/CMakeFiles/llama-server.dir/DependInfo.cmake "--color=" [ 93%] Built target llama-imatrix gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/server/CMakeFiles/llama-server.dir/build.make tools/server/CMakeFiles/llama-server.dir/build /usr/bin/gmake -f tools/tokenize/CMakeFiles/llama-tokenize.dir/build.make tools/tokenize/CMakeFiles/llama-tokenize.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/tokenize /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tokenize /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tokenize/CMakeFiles/llama-tokenize.dir/DependInfo.cmake "--color=" [ 94%] Building CXX object tools/server/CMakeFiles/llama-server.dir/server.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/tools/server -I/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server -I/builddir/build/BUILD/llama.cpp-b6153/tools/server/../llava -I/builddir/build/BUILD/llama.cpp-b6153 -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/server/CMakeFiles/llama-server.dir/server.cpp.o -MF CMakeFiles/llama-server.dir/server.cpp.o.d -o CMakeFiles/llama-server.dir/server.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/server/server.cpp gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/tokenize/CMakeFiles/llama-tokenize.dir/build.make tools/tokenize/CMakeFiles/llama-tokenize.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 95%] Building CXX object tools/tokenize/CMakeFiles/llama-tokenize.dir/tokenize.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tokenize && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/tokenize/CMakeFiles/llama-tokenize.dir/tokenize.cpp.o -MF CMakeFiles/llama-tokenize.dir/tokenize.cpp.o.d -o CMakeFiles/llama-tokenize.dir/tokenize.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/tokenize/tokenize.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 95%] Linking CXX executable ../../bin/llama-tokenize cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tokenize && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-tokenize.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-tokenize.dir/tokenize.cpp.o" -o ../../bin/llama-tokenize ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 95%] Built target llama-tokenize [ 96%] Building CXX object tools/run/CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/run/CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o -MF CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o.d -o CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/run/linenoise.cpp/linenoise.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/gmake -f tools/tts/CMakeFiles/llama-tts.dir/build.make tools/tts/CMakeFiles/llama-tts.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/tts /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tts /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tts/CMakeFiles/llama-tts.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/tts/CMakeFiles/llama-tts.dir/build.make tools/tts/CMakeFiles/llama-tts.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 97%] Building CXX object tools/tts/CMakeFiles/llama-tts.dir/tts.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tts && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/tts/CMakeFiles/llama-tts.dir/tts.cpp.o -MF CMakeFiles/llama-tts.dir/tts.cpp.o.d -o CMakeFiles/llama-tts.dir/tts.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/tts/tts.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 97%] Linking CXX executable ../../bin/llama-run cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-run.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-run.dir/run.cpp.o" "CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o" -o ../../bin/llama-run ../../common/libcommon.a ../../bin/libllama.so.b6153 /usr/lib64/libcurl.so ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 97%] Built target llama-perplexity /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build.make tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build.make tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 98%] Building CXX object tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o -MF CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o.d -o CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/mtmd-cli.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 98%] Linking CXX executable ../../bin/llama-tts cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tts && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-tts.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-tts.dir/tts.cpp.o" -o ../../bin/llama-tts ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 98%] Linking CXX executable ../../bin/llama-mtmd-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-mtmd-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o" -o ../../bin/llama-mtmd-cli ../../common/libcommon.a ../../bin/libmtmd.so.b6153 /usr/lib64/libcurl.so ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 98%] Built target llama-run /usr/bin/gmake -f tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build.make tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/cvector-generator /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build.make tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 99%] Building CXX object tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o -MF CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o.d -o CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/cvector-generator/cvector-generator.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 99%] Linking CXX executable ../../bin/llama-cvector-generator cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-cvector-generator.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o" -o ../../bin/llama-cvector-generator ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 99%] Built target llama-tts /usr/bin/gmake -f tools/export-lora/CMakeFiles/llama-export-lora.dir/build.make tools/export-lora/CMakeFiles/llama-export-lora.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/export-lora /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/export-lora /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/export-lora/CMakeFiles/llama-export-lora.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/export-lora/CMakeFiles/llama-export-lora.dir/build.make tools/export-lora/CMakeFiles/llama-export-lora.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Building CXX object tools/export-lora/CMakeFiles/llama-export-lora.dir/export-lora.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/export-lora && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/export-lora/CMakeFiles/llama-export-lora.dir/export-lora.cpp.o -MF CMakeFiles/llama-export-lora.dir/export-lora.cpp.o.d -o CMakeFiles/llama-export-lora.dir/export-lora.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/export-lora/export-lora.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-mtmd-cli [100%] Linking CXX executable ../../bin/llama-export-lora cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/export-lora && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-export-lora.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-export-lora.dir/export-lora.cpp.o" -o ../../bin/llama-export-lora ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [100%] Linking CXX executable ../../bin/llama-server cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-server.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-server.dir/server.cpp.o" -o ../../bin/llama-server ../../common/libcommon.a ../../bin/libmtmd.so.b6153 /usr/lib64/libcurl.so ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-cvector-generator gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-export-lora gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-server gmake[1]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/CMakeFiles 0 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.dPksMO + umask 022 + cd /builddir/build/BUILD + '[' /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 '!=' / ']' + rm -rf /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 ++ dirname /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 + mkdir -p /builddir/build/BUILDROOT + mkdir /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd llama.cpp-b6153 + DESTDIR=/builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 + /usr/bin/cmake --install redhat-linux-build -- Install configuration: "Release" -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-cpu.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-cpu.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-hip.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-hip.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-cpu.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-alloc.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-backend.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-blas.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-cann.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-cpp.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-cuda.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-opt.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-metal.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-rpc.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-sycl.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-vulkan.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-webgpu.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/gguf.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-base.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-base.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/cmake/ggml/ggml-config.cmake -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/cmake/ggml/ggml-version.cmake -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-batched-bench -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-gguf-split -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-imatrix -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-bench -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-cli -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-perplexity -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-quantize -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-server -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-run -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-tokenize -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-tts -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libmtmd.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libmtmd.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/mtmd.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/mtmd-helper.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-mtmd-cli -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-cvector-generator -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-export-lora -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libllama.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libllama.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/llama.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/llama-cpp.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/cmake/llama/llama-config.cmake -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/cmake/llama/llama-version.cmake -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/convert_hf_to_gguf.py -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/pkgconfig/llama.pc + rm -rf '/builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml_shared.*' + rm /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/convert_hf_to_gguf.py + /usr/bin/find-debuginfo -j4 --strict-build-id -m -i --build-id-seed b6153-1.el10 --unique-debug-suffix -b6153-1.el10.x86_64 --unique-debug-src-base llama-cpp-b6153-1.el10.x86_64 --run-dwz --dwz-low-mem-die-limit 10000000 --dwz-max-die-limit 110000000 -S debugsourcefiles.list /builddir/build/BUILD/llama.cpp-b6153 find-debuginfo: starting Extracting debug info from 20 files DWARF-compressing 20 files dwz: ./usr/bin/llama-batched-bench-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-bench-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-cli-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-cvector-generator-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-export-lora-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-gguf-split-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-imatrix-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-mtmd-cli-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-perplexity-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-quantize-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-run-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-server-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-tokenize-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-tts-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-base.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-cpu.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-hip.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libllama.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libmtmd.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: Too few files for multifile optimization sepdebugcrcfix: Updated 0 CRC32s, 20 CRC32s did match. Creating .debug symlinks for symlinks to ELF files Copying sources found by 'debugedit -l' to /usr/src/debug/llama-cpp-b6153-1.el10.x86_64 find-debuginfo: done + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/brp-strip-static-archive /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + /usr/lib/rpm/redhat/brp-python-rpm-in-distinfo + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j4 + /usr/lib/rpm/redhat/brp-python-hardlink Processing files: llama-cpp-b6153-1.el10.x86_64 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.uv9IWo + umask 022 + cd /builddir/build/BUILD + cd llama.cpp-b6153 + LICENSEDIR=/builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/licenses/llama-cpp + export LC_ALL= + LC_ALL= + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/licenses/llama-cpp + cp -pr /builddir/build/BUILD/llama.cpp-b6153/LICENSE /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/licenses/llama-cpp + RPM_EC=0 ++ jobs -p + exit 0 Provides: libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libllama.so.b6153()(64bit) libmtmd.so.b6153()(64bit) llama-cpp = b6153-1.el10 llama-cpp(x86-64) = b6153-1.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: ld-linux-x86-64.so.2()(64bit) ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) libamdhip64.so.6()(64bit) libamdhip64.so.6(hip_4.2)(64bit) libamdhip64.so.6(hip_6.0)(64bit) libc.so.6()(64bit) libc.so.6(GLIBC_2.10)(64bit) libc.so.6(GLIBC_2.14)(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.2.5)(64bit) libc.so.6(GLIBC_2.29)(64bit) libc.so.6(GLIBC_2.3.2)(64bit) libc.so.6(GLIBC_2.3.4)(64bit) libc.so.6(GLIBC_2.32)(64bit) libc.so.6(GLIBC_2.33)(64bit) libc.so.6(GLIBC_2.34)(64bit) libc.so.6(GLIBC_2.38)(64bit) libc.so.6(GLIBC_2.4)(64bit) libc.so.6(GLIBC_2.7)(64bit) libc.so.6(GLIBC_ABI_DT_RELR)(64bit) libcurl.so.4()(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_3.0)(64bit) libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libhipblas.so.2()(64bit) libllama.so.b6153()(64bit) libm.so.6()(64bit) libm.so.6(GLIBC_2.2.5)(64bit) libm.so.6(GLIBC_2.27)(64bit) libm.so.6(GLIBC_2.29)(64bit) libmtmd.so.b6153()(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.11)(64bit) libstdc++.so.6(CXXABI_1.3.13)(64bit) libstdc++.so.6(CXXABI_1.3.2)(64bit) libstdc++.so.6(CXXABI_1.3.3)(64bit) libstdc++.so.6(CXXABI_1.3.5)(64bit) libstdc++.so.6(CXXABI_1.3.7)(64bit) libstdc++.so.6(CXXABI_1.3.9)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.14)(64bit) libstdc++.so.6(GLIBCXX_3.4.15)(64bit) libstdc++.so.6(GLIBCXX_3.4.17)(64bit) libstdc++.so.6(GLIBCXX_3.4.18)(64bit) libstdc++.so.6(GLIBCXX_3.4.19)(64bit) libstdc++.so.6(GLIBCXX_3.4.20)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.22)(64bit) libstdc++.so.6(GLIBCXX_3.4.25)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.30)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) Recommends: numactl Processing files: llama-cpp-devel-b6153-1.el10.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.GhwUdt + umask 022 + cd /builddir/build/BUILD + cd llama.cpp-b6153 + DOCDIR=/builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/doc/llama-cpp-devel + export LC_ALL= + LC_ALL= + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/doc/llama-cpp-devel + cp -pr /builddir/build/BUILD/llama.cpp-b6153/README.md /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/doc/llama-cpp-devel + RPM_EC=0 ++ jobs -p + exit 0 Provides: cmake(ggml) cmake(llama) llama-cpp-devel = b6153-1.el10 llama-cpp-devel(x86-64) = b6153-1.el10 pkgconfig(llama) = 0.0.0 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: /usr/bin/pkg-config cmake-filesystem(x86-64) libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libllama.so.b6153()(64bit) libmtmd.so.b6153()(64bit) Processing files: llama-cpp-debugsource-b6153-1.el10.x86_64 Provides: llama-cpp-debugsource = b6153-1.el10 llama-cpp-debugsource(x86-64) = b6153-1.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: llama-cpp-debuginfo-b6153-1.el10.x86_64 Provides: debuginfo(build-id) = 181471c5afac4a176f2db729e3c81d43fe0b6041 debuginfo(build-id) = 1db3ca7a8c04338b2c911a7fc9d9689ed4027b3c debuginfo(build-id) = 2db68bfc52a71361d3366373c506221c617ffb23 debuginfo(build-id) = 2e5d42edbc00e3564ba6209ca9473d0eca95eb4a debuginfo(build-id) = 43158f102cc51200462fcdca49500406ad96272e debuginfo(build-id) = 4b412322cbe71aca4ba365dd1448fbc6655f8a27 debuginfo(build-id) = 6b9bbce3cc8bd51cd1a2b8b79f1d7498efade21b debuginfo(build-id) = 706fb90cdc6912d3059ff19eb84c7e0e617ab27a debuginfo(build-id) = 8292cd4445f3d6f19751b61c2ea82c2ff8d79423 debuginfo(build-id) = 8558a1b97b71b69afa5e5100fec8662541f121ab debuginfo(build-id) = 8f25328a3ce9cf961736230861f4b42eb1b8166a debuginfo(build-id) = b4f172ba73683c424f1707f944611e63686b358e debuginfo(build-id) = ceb5d2c2dc0dff9ddd09b3caf9989414078ef786 debuginfo(build-id) = d639d128469d5dba460952594cd0ab7b4c199e61 debuginfo(build-id) = dbb3d388c96ef44d818ab143f3d6efc1174f7623 debuginfo(build-id) = e1f0e9ff65f327d2dec9ce1145dba5f1d8371670 debuginfo(build-id) = e2b06962a0ce7bd3838dcb5b8b0e2cf8acd4af8f debuginfo(build-id) = eab8c5e49a0de1a8dd6a29fdb81eecd4eb45e381 debuginfo(build-id) = eb3652d20400cdba0fd1595a62ddd40d29e3a40b debuginfo(build-id) = edac8094301ab8432a9e33a1e1c40433ecb084d9 libggml-base.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libggml-cpu.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libggml-hip.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libggml.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libllama.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libmtmd.so.b6153-b6153-1.el10.x86_64.debug()(64bit) llama-cpp-debuginfo = b6153-1.el10 llama-cpp-debuginfo(x86-64) = b6153-1.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Recommends: llama-cpp-debugsource(x86-64) = b6153-1.el10 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 Wrote: /builddir/build/RPMS/llama-cpp-devel-b6153-1.el10.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-debugsource-b6153-1.el10.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-debuginfo-b6153-1.el10.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-b6153-1.el10.x86_64.rpm Executing(%clean): /bin/sh -e /var/tmp/rpm-tmp.SgTulI + umask 022 + cd /builddir/build/BUILD + cd llama.cpp-b6153 + /usr/bin/rm -rf /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 + RPM_EC=0 ++ jobs -p + exit 0 Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.Ak8s5Y + umask 022 + cd /builddir/build/BUILD + rm -rf /builddir/build/BUILD/llama.cpp-b6153-SPECPARTS + rm -rf llama.cpp-b6153 llama.cpp-b6153.gemspec + RPM_EC=0 ++ jobs -p + exit 0 Finish: rpmbuild llama-cpp-b6153-1.el10.src.rpm Finish: build phase for llama-cpp-b6153-1.el10.src.rpm INFO: chroot_scan: 3 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/rhel+epel-10-x86_64-1768866066.175986/root/var/log/dnf.rpm.log /var/lib/mock/rhel+epel-10-x86_64-1768866066.175986/root/var/log/dnf.librepo.log /var/lib/mock/rhel+epel-10-x86_64-1768866066.175986/root/var/log/dnf.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/llama-cpp-b6153-1.el10.src.rpm) Config(child) 91 minutes 37 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "llama-cpp", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "x86_64" }, { "name": "llama-cpp", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "src" }, { "name": "llama-cpp-debugsource", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "x86_64" }, { "name": "llama-cpp-debuginfo", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "x86_64" }, { "name": "llama-cpp-devel", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "x86_64" } ] } RPMResults finished