options

Functions and Loops

Columns Filter

Max Thread Time / Walltime armclang_1 (%) Coverage armclang_1 (%) Coverage Excluding Loops armclang_1 (%) Max Inclusive Time Over Threads armclang_1 (s) Max Exclusive Time Over Threads armclang_1 (s) Inclusive Time w.r.t. Wall Time armclang_1 (s) Exclusive Time w.r.t. Wall Time armclang_1 (s) Nb Threads armclang_1 Deviation (coverage) armclang_1 Deviation (walltime) armclang_1 Categories armclang_1 Compilation Options Max Thread Time / Walltime Coverage Coverage Excluding Loops Max Inclusive Time Over Threads Max Exclusive Time Over Threads Inclusive Time w.r.t. Wall Time Exclusive Time w.r.t. Wall Time Nb Threads Deviation (coverage) Deviation (walltime) Categories Compilation Options
NameModuleMax Thread Time / Walltime armclang_1 (%)Coverage armclang_1 (%)Coverage Excluding Loops armclang_1 (%)Max Inclusive Time Over Threads armclang_1 (s)Max Exclusive Time Over Threads armclang_1 (s)Inclusive Time w.r.t. Wall Time armclang_1 (s)Exclusive Time w.r.t. Wall Time armclang_1 (s)Nb Threads armclang_1Deviation (coverage) armclang_1Deviation (walltime) armclang_1Categories armclang_1Compilation Options
kai_run_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm+libggml-cpu.so43.8586.020.002.960.003.580.00642.220.03/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE...
Loop 2398 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.0786.020.002.980.013.580.0020.000.00
Loop 2397 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.4486.010.412.970.033.580.02610.180.01
Loop 2396 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so43.5585.6085.602.942.943.573.57642.220.03
Loop 2395 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 2394 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 2393 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so3.705.865.860.250.250.240.24640.950.03OMP (%): 100.00
ggml_compute_forward_flash_attn_ext+libggml-cpu.so1.111.460.000.080.010.060.00640.310.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -...
Loop 1739 - ops.cpp:8778-8920 - libggml-cpu.so [...]+0.001.460.000.110.000.060.0000.000.00
Loop 1740 - vec.h:282-725 - libggml-cpu.so [...]+0.001.460.000.110.000.060.0000.000.00
Loop 1741 - vec.h:646-653 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1738 - vec.h:282-725 - libggml-cpu.so [...]+0.071.460.020.110.010.060.0080.000.00
Loop 1754 - vec.h:646-653 - libggml-cpu.so0.070.020.020.010.010.000.0090.000.00
Loop 1742 - ops.cpp:8885-8886 - libggml-cpu.so [...]0.070.000.000.000.000.000.0010.000.00
Loop 1743 - ops.cpp:8885-8886 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1744 - ops.cpp:8793-8881 - libggml-cpu.so [...]+1.111.411.250.100.080.060.05640.290.01
Loop 1749 - vec.h:646-653 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1751 - vec.h:411-458 - libggml-cpu.so0.300.160.160.020.020.010.01410.130.00
Loop 1753 - vec.h:710-717 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1747 - vec.h:343-348 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1752 - vec.h:710-717 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1748 - vec.h:290-338 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1750 - vec.h:461-466 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1746 - vec.h:290-338 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1745 - vec.h:343-348 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+libggml-cpu.so1.701.370.010.120.000.060.00640.450.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -...
Loop 1433 - ops.cpp:6210-6484 - libggml-cpu.so [...]+0.221.360.050.140.020.060.00170.080.00
Loop 1437 - ops.cpp:6413-6426 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1434 - ops.cpp:6462-6475 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1435 - ops.cpp:6479-6484 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1438 - ops.cpp:6429-6442 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1432 - ops.cpp:6210-6462 - libggml-cpu.so [...]+0.071.220.040.110.010.050.00170.000.00
Loop 1440 - ops.cpp:6220-6245 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1439 - ops.cpp:6210-6303 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1441 - ops.cpp:6220-6245 - libggml-cpu.so [...]1.481.181.180.100.100.050.05640.410.01
Loop 1442 - ops.cpp:6210-6245 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1436 - ops.cpp:6446-6456 - libggml-cpu.so [...]0.220.090.090.020.020.000.00350.060.00
ggml_vec_swiglu_f32+libggml-cpu.so0.740.840.000.050.000.040.00640.230.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -...
Loop 908 - vec.cpp:402-403 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 909 - vec.cpp:403-403 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 910 - vec.cpp:385-387 - libggml-cpu.so [...]0.740.840.840.050.050.040.04640.230.01
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.740.620.620.050.050.030.03640.300.01OMP (%): 100.00
__sincosf_finitelibamath.so0.440.440.440.030.030.020.02590.210.01Math (%): 100.00
kai_run_lhs_quant_pack_qsi8d32p4x8sb_f32_neon+libggml-cpu.so0.520.360.000.040.010.020.00610.210.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE...
Loop 2358 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:268-335 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2359 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:271-332 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2361 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:93-264 - libggml-cpu.so [...]+0.000.360.000.040.000.010.0000.000.00
Loop 2360 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:96-258 - libggml-cpu.so [...]0.520.360.360.040.040.010.01610.210.01
__expf_finitelibamath.so0.440.350.350.030.030.010.01590.200.01Math (%): 100.00
ggml_compute_forward_mul+libggml-cpu.so0.520.310.000.040.010.010.00600.180.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -...
Loop 524 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 519 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 521 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 520 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 522 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 523 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 548 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 550 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 549 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 547 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 551 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 554 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 556 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 555 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 553 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 552 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 515 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 516 - binary-ops.cpp:18-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 514 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 543 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.150.310.000.040.010.010.0010.000.00
Loop 545 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 544 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.370.300.300.030.030.010.01600.160.01
Loop 546 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 542 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 535 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 533 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 537 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 534 - binary-ops.cpp:84-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 536 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 504 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 503 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 506 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 505 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 517 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 518 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 508 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 509 - binary-ops.cpp:18-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 507 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 526 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 525 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 529 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 527 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 528 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 540 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 538 - binary-ops.cpp:84-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 541 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 539 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 512 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 513 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 511 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 510 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 531 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 532 - binary-ops.cpp:18-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 530 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 558 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 557 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
ggml_compute_forward_rms_norm+libggml-cpu.so0.440.300.000.030.000.010.00570.170.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -...
Loop 1280 - ops.cpp:4319-4365 - libggml-cpu.so [...]+0.000.300.000.040.000.010.0000.000.00
Loop 1282 - ops.cpp:4319-4365 - libggml-cpu.so [...]+0.000.300.000.040.000.010.0000.000.00
Loop 1283 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1281 - ops.cpp:4325-4326 - libggml-cpu.so0.370.270.270.030.030.010.01560.160.01
Loop 1284 - vec.h:646-653 - libggml-cpu.so0.150.030.030.010.010.000.00110.040.00
kai_run_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0+libggml-cpu.so9.100.290.000.620.000.010.0010.000.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE...
Loop 2375 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2377 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2376 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-134 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2369 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2365 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2370 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2368 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:123-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2367 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-148 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2366 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:145-148 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2371 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2373 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2374 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2372 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-142 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2378 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+1.260.290.040.610.090.010.0010.000.00
Loop 2380 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so3.400.110.110.230.230.000.0010.000.00
Loop 2379 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-139 - libggml-cpu.so [...]4.440.140.140.300.300.010.0110.000.00
ggml_vec_dot_q6_K_q8_K+libggml-cpu.so0.150.280.000.010.000.010.00640.050.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE...
Loop 2280 - quants.c:2683-2812 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2281 - quants.c:2683-2758 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2283 - quants.c:2492-2660 - libggml-cpu.so [...]+0.070.270.030.020.010.010.00120.010.00
Loop 2282 - quants.c:2506-2590 - libggml-cpu.so [...]0.150.250.250.010.010.010.01630.070.00
ggml_compute_forward_add_non_quantized+libggml-cpu.so0.300.270.000.020.000.010.00550.140.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -...
Loop 415 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 417 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 414 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 418 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 416 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 404 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 403 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 405 - binary-ops.cpp:10-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 397 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 396 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 398 - binary-ops.cpp:10-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 406 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 407 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 432 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.260.000.020.000.010.0000.000.00
Loop 434 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 431 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 435 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 433 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.300.260.260.020.020.010.01540.140.00
Loop 446 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 445 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 437 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 439 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 438 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 436 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 440 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 393 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 395 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 394 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 392 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 413 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 408 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 410 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 412 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 411 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 409 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 401 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 402 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 400 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 399 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 441 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 443 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 444 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 442 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 420 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 419 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 421 - binary-ops.cpp:10-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 424 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 422 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 426 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 423 - binary-ops.cpp:84-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 425 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 429 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 430 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 427 - binary-ops.cpp:84-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 428 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
__sched_yieldlibc.so.60.370.250.250.030.030.010.01530.140.00OMP (%): 100.00
ggml_vec_dot_f16+libggml-cpu.so0.300.210.090.020.020.010.00490.140.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -...
Loop 902 - vec.cpp:266-269 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 903 - vec.cpp:231-262 - libggml-cpu.so0.300.120.120.020.020.010.01380.100.00
ggml_cpu_fp32_to_fp16+libggml-cpu.so0.220.130.000.020.000.010.00370.100.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE...
Loop 0 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...]0.220.130.130.020.020.010.01370.100.00
$xlibc.so.60.220.070.070.020.020.000.00210.100.00System (%): 100.00
unknown_functionlibggml-cpu.so0.150.050.000.010.000.000.00200.050.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00
__memcpylibastring.so0.150.050.050.010.010.000.00200.050.00String (%): 100.00
unknown_function[vdso]0.150.040.000.010.000.000.00160.050.00OMP (%): 100.00
__memsetlibastring.so1.040.040.040.070.070.000.0030.880.04String (%): 100.00
__pthread_mutex_locklibc.so.60.070.030.030.010.010.000.00130.000.00Pthread (%): 100.00
__aarch64_ldadd8_acq_rellibomp.so0.070.030.030.010.010.000.00110.010.00OMP (%): 100.00
ggml_graph_compute_thread+libggml-cpu.so0.150.030.000.010.000.000.00100.050.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE...
Loop 87 - ggml-cpu.c:1592-1601 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 88 - ggml-cpu.c:1436-1642 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 96 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 95 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 94 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 93 - ggml-cpu.c:1461-1462 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 92 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 91 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 90 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 89 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 97 - ggml-cpu.c:1585-1587 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 86 - ggml-cpu.c:533-2897 - libggml-cpu.so [...]+0.150.030.030.010.010.000.00100.050.00
Loop 85 - ggml-cpu.c:533-2897 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 99 - ggml-cpu.c:1572-1579 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 98 - ggml-cpu.c:1573-1579 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 102 - ggml-cpu.c:1552-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 101 - ggml-cpu.c:1552-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 100 - ggml-cpu.c:1552-1560 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 103 - ggml-cpu.c:2087-2088 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
std::_Hashtable<std::pair<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > >, std::pair<std::pai...+libllama.so0.740.020.020.050.040.000.0010.000.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libllama.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_SHARED -D GGML_SHARED -D GGML_USE_BLAS -D GGML_USE_CPU -D LLAMA_BUILD -D...
Loop 3073 - hashtable.h:2077-2077 - libllama.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 3074 - hashtable.h:2074-2077 - libllama.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 3072 - hashtable.h:2077-2077 - libllama.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 3075 - hashtable.h:2074-2077 - libllama.so [...]0.220.010.010.020.020.000.0010.000.00
__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*)libomp.so0.070.020.020.010.010.000.0070.000.00OMP (%): 100.00
__kmp_barrierlibomp.so0.150.020.020.010.010.000.0060.060.00OMP (%): 100.00
ggml_cpu_extra_compute_forward+libggml-cpu.so0.070.010.000.010.000.000.0060.000.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-131-5415/llama.cpp/build/llama.cpp/../armclang_1/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -...
Loop 390 - traits.cpp:13-17 - libggml-cpu.so [...]0.070.010.010.010.010.000.0060.000.00
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*)libomp.so0.150.010.010.010.010.000.0050.070.00OMP (%): 100.00
×