Package: librocblas0 Version: 5.5.1+dfsg-7 Severity: minor Dear Maintainer,
When profiling the rocblas calls from llama.cpp, I was using the following comamnd: ROCBLAS_LAYER=2 ROCBLAS_LOG_BENCH_PATH=$HOME/bench.log \ ./llama-cli -ngl 99 --color -c 2048 --temp 0.7 \ --repeat_penalty 1.1 -n -1 -m dolphin-2.2.1-mistral-7b.Q5_K_M.gguf \ -no-cnv --prompt "Once upon a time" and got output like this in bench.log: ./rocblas-bench -f gemm_batched_ex --transposeA T --transposeB N -m 32 -n 2 -k 128 --alpha 1 --a_type f16_r --lda 1024 --b_type f16_r --ldb 4096 --beta 0 --c_type f16_r --ldc --d_type f16_r --ldd 32 --batch_count 32 --compute_type f16_r --algo 0 --solution_index 0 --flags 1 However, these arguments are incomplete, as shown when passing them to librocblas0-bench: $ cd /usr/libexec/rocm/librocblas0-bench $ ./rocblas-bench -f gemm_batched_ex --transposeA T --transposeB N -m 32 -n 2 -k 128 --alpha 1 --a_type f16_r --lda 1024 --b_type f16_r --ldb 4096 --beta 0 --c_type f16_r --ldc --d_type f16_r --ldd 32 --batch_count 32 --compute_type f16_r --algo 0 --solution_index 0 --flags 1 Invalid value for --ldc This can be worked around by dropping the --ldc flag, but this is definitely a bug. The command line emitted by the bench logging should be suitable for pasing directly to rocblas-bench. Sincerely, Cory Bloor -- System Information: Debian Release: trixie/sid APT prefers testing APT policy: (500, 'testing') Architecture: amd64 (x86_64) Kernel: Linux 6.12.9-amd64 (SMP w/32 CPU threads; PREEMPT) Locale: LANG=en_CA.UTF-8, LC_CTYPE=en_CA.UTF-8 (charmap=UTF-8), LANGUAGE=en_CA:en Shell: /bin/sh linked to /usr/bin/dash Init: systemd (via /run/systemd/system) LSM: AppArmor: enabled Versions of packages librocblas0 depends on: ii libamdhip64-5 5.7.1-5 ii libc6 2.40-5 ii libgcc-s1 14.2.0-12 ii libstdc++6 14.2.0-12 librocblas0 recommends no packages. librocblas0 suggests no packages. -- no debconf information