| Name | Module | Max Thread Time / Walltime orig_0 (%) | Coverage orig_0 (%) | Coverage Excluding Loops orig_0 (%) | Max Inclusive Time Over Threads orig_0 (s) | Max Exclusive Time Over Threads orig_0 (s) | Inclusive Time w.r.t. Wall Time orig_0 (s) | Exclusive Time w.r.t. Wall Time orig_0 (s) | Nb Threads orig_0 | Deviation (coverage) orig_0 | Deviation (walltime) orig_0 | Categories orig_0 | Compilation Options |
| ►kai_run_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm+ | libggml-cpu.so | 40.02 | 76.44 | 0.00 | 1.73 | 0.00 | 2.14 | 0.00 | 95 | 7.59 | 0.16 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ►Loop 2353 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2352 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2351 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2356 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+ | | 0.12 | 76.43 | 0.00 | 1.76 | 0.00 | 2.14 | 0.00 | 1 | 0.00 | 0.00 | | |
| ►Loop 2355 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+ | | 0.58 | 76.43 | 0.39 | 1.76 | 0.03 | 2.14 | 0.01 | 77 | 0.26 | 0.01 | | |
| ○Loop 2354 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | | 40.02 | 76.04 | 76.04 | 1.73 | 1.73 | 2.13 | 2.13 | 95 | 7.59 | 0.16 | | |
| ○kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | 37.94 | 13.87 | 13.87 | 1.64 | 1.64 | 0.39 | 0.39 | 96 | 10.30 | 0.20 | OMP (%): 100.00 | |
| ►ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+ | libggml-cpu.so | 1.50 | 1.63 | 0.00 | 0.07 | 0.00 | 0.05 | 0.00 | 96 | 0.52 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 1400 - ops.cpp:6210-6484 - libggml-cpu.so [...]+ | | 0.23 | 1.63 | 0.04 | 0.10 | 0.01 | 0.05 | 0.00 | 17 | 0.06 | 0.00 | | |
| ○Loop 1403 - ops.cpp:6446-6456 - libggml-cpu.so [...] | | 0.35 | 0.08 | 0.08 | 0.01 | 0.01 | 0.00 | 0.00 | 29 | 0.10 | 0.00 | | |
| ○Loop 1405 - ops.cpp:6429-6442 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1401 - ops.cpp:6462-6475 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1404 - ops.cpp:6413-6426 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1399 - ops.cpp:6210-6409 - libggml-cpu.so [...]+ | | 0.35 | 1.51 | 0.08 | 0.07 | 0.01 | 0.04 | 0.00 | 29 | 0.11 | 0.00 | | |
| ○Loop 1407 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1409 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1408 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 1.38 | 1.43 | 1.43 | 0.06 | 0.06 | 0.04 | 0.04 | 96 | 0.45 | 0.01 | | |
| ○Loop 1406 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1402 - ops.cpp:6479-6484 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_vec_swiglu_f32+ | libggml-cpu.so | 1.38 | 1.62 | 0.00 | 0.06 | 0.00 | 0.05 | 0.00 | 86 | 0.49 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ○Loop 886 - vec.cpp:385-387 - libggml-cpu.so [...] | | 1.38 | 1.62 | 1.62 | 0.06 | 0.06 | 0.05 | 0.05 | 86 | 0.49 | 0.01 | | |
| ►Loop 884 - vec.cpp:402-405 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 883 - vec.cpp:402-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 885 - vec.cpp:403-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_compute_forward_flash_attn_ext+ | libggml-cpu.so | 1.38 | 1.30 | 0.00 | 0.06 | 0.00 | 0.04 | 0.00 | 94 | 0.53 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 1713 - vec.h:282-725 - libggml-cpu.so [...]+ | | 0.58 | 1.30 | 0.29 | 0.12 | 0.03 | 0.04 | 0.01 | 69 | 0.24 | 0.00 | | |
| ○Loop 1718 - vec.h:411-458 - libggml-cpu.so | | 0.35 | 0.22 | 0.22 | 0.01 | 0.01 | 0.01 | 0.01 | 57 | 0.18 | 0.00 | | |
| ○Loop 1720 - vec.h:710-717 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1715 - vec.h:290-338 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1714 - vec.h:343-348 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1719 - vec.h:710-717 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1710 - vec.h:282-662 - libggml-cpu.so [...]+ | | 0.92 | 0.78 | 0.58 | 0.08 | 0.04 | 0.02 | 0.02 | 87 | 0.32 | 0.01 | | |
| ►Loop 1721 - ops.cpp:8778-8920 - libggml-cpu.so [...]+ | | 0.46 | 0.20 | 0.17 | 0.04 | 0.02 | 0.01 | 0.00 | 51 | 0.15 | 0.00 | | |
| ►Loop 1709 - vec.h:474-662 - libggml-cpu.so [...]+ | | 0.12 | 0.03 | 0.01 | 0.02 | 0.01 | 0.00 | 0.00 | 6 | 0.00 | 0.00 | | |
| ○Loop 1723 - vec.h:646-653 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1725 - ops.cpp:8885-8886 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1724 - ops.cpp:8885-8886 - libggml-cpu.so [...] | | 0.12 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | | |
| ○Loop 1722 - vec.h:646-653 - libggml-cpu.so | | 0.12 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | | |
| ○Loop 1711 - vec.h:343-348 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1716 - vec.h:646-653 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1712 - vec.h:290-338 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1717 - vec.h:461-466 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | 2.08 | 0.73 | 0.73 | 0.09 | 0.09 | 0.02 | 0.02 | 93 | 0.69 | 0.01 | OMP (%): 100.00 | |
| ►kai_run_lhs_quant_pack_qsi8d32p4x8sb_f32_neon+ | libggml-cpu.so | 0.81 | 0.56 | 0.00 | 0.04 | 0.00 | 0.02 | 0.00 | 64 | 0.39 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ►Loop 2318 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:93-264 - libggml-cpu.so [...]+ | | 0.00 | 0.56 | 0.00 | 0.04 | 0.00 | 0.02 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2317 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:96-258 - libggml-cpu.so [...] | | 0.81 | 0.56 | 0.56 | 0.04 | 0.04 | 0.02 | 0.02 | 64 | 0.39 | 0.01 | | |
| ►Loop 2315 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:268-335 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2316 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:271-332 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_compute_forward_rms_norm+ | libggml-cpu.so | 0.69 | 0.46 | 0.00 | 0.03 | 0.00 | 0.01 | 0.00 | 83 | 0.27 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 1248 - ops.cpp:4319-4365 - libggml-cpu.so [...]+ | | 0.00 | 0.46 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1250 - vec.h:638-661 - libggml-cpu.so [...]+ | | 0.00 | 0.46 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1249 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.69 | 0.41 | 0.41 | 0.03 | 0.03 | 0.01 | 0.01 | 78 | 0.26 | 0.01 | | |
| ○Loop 1251 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1252 - vec.h:646-653 - libggml-cpu.so | | 0.23 | 0.05 | 0.05 | 0.01 | 0.01 | 0.00 | 0.00 | 18 | 0.08 | 0.00 | | |
| ○__sincosf_finite | libamath.so | 0.69 | 0.43 | 0.43 | 0.03 | 0.03 | 0.01 | 0.01 | 77 | 0.31 | 0.01 | Math (%): 100.00 | |
| ○__GI___sched_yield | libc.so.6 | 1.27 | 0.38 | 0.38 | 0.05 | 0.05 | 0.01 | 0.01 | 78 | 0.43 | 0.01 | OMP (%): 100.00 | |
| ○__expf_finite | libamath.so | 0.46 | 0.31 | 0.31 | 0.02 | 0.02 | 0.01 | 0.01 | 70 | 0.19 | 0.00 | Math (%): 100.00 | |
| ►ggml_vec_dot_f16+ | libggml-cpu.so | 0.46 | 0.27 | 0.05 | 0.02 | 0.01 | 0.01 | 0.00 | 67 | 0.21 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ○Loop 878 - vec.cpp:266-269 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 879 - vec.cpp:231-262 - libggml-cpu.so | | 0.35 | 0.22 | 0.22 | 0.02 | 0.02 | 0.01 | 0.01 | 60 | 0.16 | 0.00 | | |
| ►kai_run_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0+ | libggml-cpu.so | 12.69 | 0.27 | 0.00 | 0.55 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ►Loop 2336 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 2.54 | 0.27 | 0.05 | 0.55 | 0.11 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | | |
| ○Loop 2338 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | | 3.23 | 0.07 | 0.07 | 0.14 | 0.14 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | |
| ○Loop 2337 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-139 - libggml-cpu.so [...] | | 6.92 | 0.15 | 0.15 | 0.30 | 0.30 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | |
| ►Loop 2333 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2334 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-134 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2335 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2329 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2331 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2332 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2330 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-142 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2327 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2328 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2326 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:123-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2325 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-148 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2324 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:145-148 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2323 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_compute_forward_add_non_quantized+ | libggml-cpu.so | 0.46 | 0.25 | 0.00 | 0.02 | 0.00 | 0.01 | 0.00 | 64 | 0.20 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 395 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 394 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 393 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 392 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 391 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 410 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 409 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 417 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 416 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 402 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 400 - binary-ops.cpp:84-101 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 403 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 401 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 380 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 381 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 383 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 384 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 382 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 408 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 407 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 406 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 405 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 404 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 411 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.24 | 0.00 | 0.02 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 412 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.24 | 0.00 | 0.02 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 415 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 414 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 413 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.46 | 0.24 | 0.24 | 0.02 | 0.02 | 0.01 | 0.01 | 63 | 0.21 | 0.00 | | |
| ►Loop 375 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 374 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 376 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 390 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 387 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 386 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 388 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 389 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 385 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 419 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 418 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 378 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 379 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 377 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 368 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 367 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 370 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 369 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 372 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 373 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 371 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 396 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 398 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 397 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 399 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_compute_forward_mul+ | libggml-cpu.so | 0.35 | 0.22 | 0.01 | 0.02 | 0.01 | 0.01 | 0.00 | 62 | 0.15 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 524 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 523 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 485 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 486 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 484 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 497 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 494 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 495 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 493 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 496 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 492 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 515 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 511 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 514 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 512 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 513 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 518 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.12 | 0.21 | 0.00 | 0.02 | 0.01 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | | |
| ►Loop 519 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.21 | 0.00 | 0.02 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 520 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.35 | 0.21 | 0.21 | 0.02 | 0.02 | 0.01 | 0.01 | 60 | 0.14 | 0.00 | | |
| ○Loop 522 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 521 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 509 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 510 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 508 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 507 - binary-ops.cpp:84-101 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 482 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 481 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 483 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 475 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 474 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 477 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 476 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 479 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 480 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 478 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 526 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 525 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 502 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 501 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 500 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 499 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 498 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 487 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 488 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 489 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 491 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 490 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 503 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 505 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 504 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 506 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 517 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 516 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_vec_dot_q6_K_q8_K+ | libggml-cpu.so | 0.12 | 0.19 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 77 | 0.01 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ○Loop 2234 - quants.c:2683-2812 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2237 - quants.c:2492-2660 - libggml-cpu.so [...]+ | | 0.12 | 0.19 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 5 | 0.00 | 0.00 | | |
| ○Loop 2236 - quants.c:2506-2590 - libggml-cpu.so [...] | | 0.12 | 0.17 | 0.17 | 0.01 | 0.01 | 0.00 | 0.00 | 71 | 0.01 | 0.00 | | |
| ○Loop 2235 - quants.c:2683-2758 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○__GI___pthread_mutex_lock | libc.so.6 | 0.81 | 0.18 | 0.18 | 0.04 | 0.04 | 0.01 | 0.01 | 49 | 0.24 | 0.01 | Pthread (%): 100.00 | |
| ►ggml_cpu_fp32_to_fp16+ | libggml-cpu.so | 0.35 | 0.16 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 45 | 0.15 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ○Loop 0 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...] | | 0.35 | 0.16 | 0.16 | 0.02 | 0.02 | 0.00 | 0.00 | 45 | 0.15 | 0.00 | | |
| ○Loop 1 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○unknown_function | [vdso] | 0.46 | 0.11 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 31 | 0.20 | 0.00 | OMP (%): 100.00 | |
| ○unknown_function | libggml-cpu.so | 0.23 | 0.09 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 31 | 0.08 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | |
| ►ggml_graph_compute_thread+ | libggml-cpu.so | 0.23 | 0.07 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 26 | 0.08 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ○Loop 90 - ggml-cpu.c:2087-2088 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 84 - ggml-cpu.c:1585-1587 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 73 - ggml-cpu.c:533-2897 - libggml-cpu.so [...]+ | | 0.23 | 0.06 | 0.06 | 0.01 | 0.01 | 0.00 | 0.00 | 23 | 0.07 | 0.00 | | |
| ○Loop 72 - ggml-cpu.c:533-2897 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 75 - ggml-cpu.c:1436-1642 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 79 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 78 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 77 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 76 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 83 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 82 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 81 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 80 - ggml-cpu.c:1461-1462 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 74 - ggml-cpu.c:1592-1601 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 89 - ggml-cpu.c:1552-1560 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 88 - ggml-cpu.c:1552-1560 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 87 - ggml-cpu.c:1552-1560 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 86 - ggml-cpu.c:1572-1579 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 85 - ggml-cpu.c:1573-1579 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○__memcpy | libastring.so | 0.23 | 0.05 | 0.05 | 0.01 | 0.01 | 0.00 | 0.00 | 17 | 0.10 | 0.00 | String (%): 100.00 | |
| ○__aarch64_ldadd8_acq_rel | libomp.so | 0.12 | 0.04 | 0.04 | 0.01 | 0.01 | 0.00 | 0.00 | 15 | 0.02 | 0.00 | OMP (%): 100.00 | |
| ○__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | 0.23 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 11 | 0.08 | 0.00 | OMP (%): 100.00 | |
| ○@plt_start@ | libomp.so | 0.35 | 0.03 | 0.03 | 0.02 | 0.02 | 0.00 | 0.00 | 8 | 0.19 | 0.00 | OMP (%): 100.00 | |
| ○ggml::cpu::kleidiai::extra_buffer_type::get_tensor_traits(ggml_tensor const*) | libggml-cpu.so | 0.23 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 9 | 0.08 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ○__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | 0.12 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 6 | 0.00 | 0.00 | OMP (%): 100.00 | |
| ○__GI___lll_lock_wait | libc.so.6 | 0.12 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 6 | 0.00 | 0.00 | System (%): 100.00 | |
| ○__kmp_yield | libomp.so | 0.12 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 6 | 0.02 | 0.00 | OMP (%): 100.00 | |
| ○__memset | libastring.so | 0.46 | 0.01 | 0.01 | 0.02 | 0.02 | 0.00 | 0.00 | 2 | 0.33 | 0.01 | String (%): 100.00 | |
| ►ggml_cpu_extra_compute_forward+ | libggml-cpu.so | 0.12 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 5 | 0.00 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-131-3962/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ○Loop 365 - traits.cpp:13-17 - libggml-cpu.so [...] | | 0.12 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | |