options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_64_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_64_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_64_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_64_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_64_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_64_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_64_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_64_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_64_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_64_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_64_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_64_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_64_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_64_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined]+kmeans-acfl-O3-all99.9898.9197.0395.6794.1987.8049.430.000.000.000.000.000.000.00121.5361.5531.5115.957.973.942.050.000.000.000.000.000.000.00121.5360.8930.6115.357.673.701.300.000.000.000.000.000.000.0012481632640.001.522.012.882.884.8621.060.001.160.820.620.310.300.56Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.050.090.220.621.704.2015.26Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-...1010.20.990.720.990.990.990.871.0301.460
Loop 6 - main.cpp:114-122 - kmeans-acfl-O3-all+99.9898.9197.0395.6794.1987.8049.431.030.900.920.930.860.880.45121.5361.5531.5215.978.003.982.051.250.570.300.190.110.060.03121.5360.8930.6115.357.673.701.301.250.550.290.150.070.040.0112481632530.000.040.080.210.280.340.540.000.020.020.030.020.010.010.040.070.220.631.744.0916.18101.1301.0801.0501.1201.0501.640
Loop 7 - main.cpp:116-122 - kmeans-acfl-O3-all98.9698.0196.1294.7493.3386.9148.9898.9698.0196.1294.7493.3386.9148.98120.2860.9831.2115.777.893.922.02120.2860.9831.2115.777.893.922.02120.2860.3330.3215.207.603.661.29120.2860.3330.3215.207.603.661.2912481632640.001.492.022.732.764.7920.740.001.130.820.590.300.300.560.050.090.220.621.704.2015.261010.310.990.790.991.030.990.971.0301.460
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined.3]+kmeans-acfl-O3-all0.010.020.080.230.622.480.750.000.000.000.000.000.000.000.010.010.030.070.080.200.080.000.000.000.000.000.000.000.010.010.020.040.050.100.020.000.000.000.000.000.000.0012481531640.000.000.020.150.341.470.600.000.000.010.020.030.050.01Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.000.000.000.170.550.927.28Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-...100.7500.160.060.050.210.020.6102.470.010.74
Loop 19 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 22 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 14 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 24 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 18 - main.cpp:138-138 - kmeans-acfl-O3-all+0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 17 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 16 - main.cpp:138-138 - kmeans-acfl-O3-all+0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 15 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 23 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 21 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 20 - main.cpp:138-138 - kmeans-acfl-O3-all0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 25 - main.cpp:139-143 - kmeans-acfl-O3-all0.010.020.080.230.622.480.750.010.020.080.230.622.480.750.010.010.030.070.080.200.080.010.010.030.070.080.200.080.010.010.020.040.050.100.020.010.010.020.040.050.100.0212481531640.000.000.020.150.341.470.600.000.000.010.020.030.050.010.000.000.000.170.550.927.28100.7500.160.060.050.210.020.6102.470.010.74
__kmp_yieldlibomp.so0.000.000.000.000.010.020.070.000.000.000.000.010.020.070.000.000.000.000.000.000.010.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.00001035110.000.000.000.000.000.000.190.000.000.000.000.000.000.00NANAOMP (%): 100.00NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.00
unknown_function[vdso]0.000.010.010.040.030.050.380.000.000.000.000.000.000.000.000.010.010.010.010.010.020.000.000.000.000.000.000.000.000.010.000.010.000.000.010.000.000.000.000.000.000.000125512430.000.000.010.030.030.050.390.000.000.000.000.000.000.00NAOMP (%): 100.00OMP (%): 66.67
Pthread (%): 33.33
Pthread (%): 77.78
OMP (%): 22.22
Pthread (%): 71.43
OMP (%): 28.57
Pthread (%): 85.71
OMP (%): 14.29
Pthread (%): 90.54
OMP (%): 9.46
0.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.000.170.310.380.440.753.730.000.170.310.380.440.753.730.000.210.220.130.120.090.110.000.210.220.130.120.090.110.000.110.100.060.040.030.100.000.110.100.060.040.030.1001381531630.000.000.270.310.380.442.160.000.000.080.050.030.020.02NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.0010101010101010
__sched_yieldlibc.so.60.000.070.110.150.180.341.890.000.070.110.150.180.341.890.000.080.060.040.040.030.060.000.080.060.040.040.030.060.000.040.040.020.010.010.050.000.040.040.020.010.010.0501381326630.000.000.040.100.130.181.080.000.000.010.020.010.010.01NAOMP (%): 100.00Pthread (%): 57.14
OMP (%): 42.86
System (%): 0.00
Pthread (%): 78.38
OMP (%): 21.62
Pthread (%): 91.11
OMP (%): 8.89
Pthread (%): 94.19
OMP (%): 5.81
Pthread (%): 95.10
OMP (%): 4.90
System (%): 0.00
0.000.000.000.000.000.000.0010101010101010
__kmpc_for_static_finilibomp.so0.000.000.000.000.000.000.010.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check()libomp.so0.000.000.000.000.000.010.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000200.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.00
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*)libomp.so0.000.000.000.000.000.000.010.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.000.000.000.000.000.000.000.00
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so0.000.822.443.544.518.5043.560.000.822.443.544.518.5043.560.001.011.141.090.750.760.970.001.011.141.090.750.760.970.000.510.770.570.370.361.140.000.510.770.570.370.361.1401481632640.000.001.632.392.534.0418.710.000.000.510.370.200.160.14NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.0010101010101010
×