options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_64_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_64_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_64_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_64_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_64_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_64_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_64_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_64_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_64_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_64_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_64_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_64_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_64_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_64_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined]+kmeans-acfl-Ofast100.0098.9497.0795.8094.2787.8834.190.000.000.000.000.000.000.00121.6261.5031.4915.808.053.981.370.000.000.000.000.000.000.00121.6260.8830.6215.297.643.710.920.000.000.000.000.000.000.0012481632640.001.451.912.763.215.6712.560.001.170.810.580.360.340.20Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.070.100.210.671.744.2536.53Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -Ofast -greco...1010.110.990.660.990.520.990.481.0202.060
Loop 6 - main.cpp:114-122 - kmeans-acfl-Ofast+100.0098.9497.0795.8094.2787.8834.190.940.940.930.890.800.760.42121.6261.5031.4915.888.124.011.381.140.600.380.170.100.050.03121.6260.8830.6215.297.643.710.921.140.580.290.140.060.030.0112481632630.000.060.200.160.270.300.370.000.040.070.020.020.010.010.060.100.180.721.724.6728.40100.990.010.970.021.0101.101.1201.560
Loop 7 - main.cpp:116-122 - kmeans-acfl-Ofast99.0698.0096.1494.9293.4787.1333.7799.0698.0096.1494.9293.4787.1333.77120.4860.9031.1115.708.013.951.35120.4860.9031.1115.708.013.951.35120.4860.3030.3215.147.583.680.91120.4860.3030.3215.147.583.680.9112481632640.001.381.752.793.125.6812.350.001.120.760.580.360.340.200.070.100.210.671.744.2536.631010.10.990.640.990.520.990.561.0202.070
unknown_function[vdso]0.000.020.020.040.060.070.400.000.000.000.000.000.000.000.000.020.010.020.010.010.030.000.000.000.000.000.000.000.000.010.010.010.000.000.010.000.000.000.000.000.000.0001351014480.000.000.010.040.050.080.310.000.000.000.010.000.000.01NAOMP (%): 100.00Pthread (%): 60.00
OMP (%): 40.00
Pthread (%): 60.00
OMP (%): 40.00
Pthread (%): 92.86
OMP (%): 7.14
Pthread (%): 84.21
OMP (%): 15.79
Pthread (%): 96.36
OMP (%): 3.64
0.000.000.000.000.000.000.0010101010101010
__kmp_yieldlibomp.so0.000.000.020.000.000.010.090.000.000.020.000.000.010.090.000.000.010.000.000.000.010.000.000.010.000.000.000.010.000.000.010.000.000.000.000.000.000.010.000.000.000.00003113210.000.000.010.000.000.010.110.000.000.000.000.000.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so0.000.150.310.400.420.764.470.000.150.310.400.420.764.470.000.180.220.150.130.100.270.000.180.220.150.130.100.270.000.090.100.060.030.030.120.000.090.100.060.030.030.1201381431620.000.000.280.320.430.601.690.000.000.090.050.030.020.04NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.0010101010101010
__kmp_finish_implicit_tasklibomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.00
__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*)libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check()libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.000.000.000.000.000.000.000.00
__sched_yieldlibc.so.60.000.030.110.120.230.362.200.000.030.110.120.230.362.200.000.040.050.050.060.030.090.000.040.050.050.060.030.090.000.020.030.020.020.020.060.000.020.030.020.020.020.0601371227620.000.000.060.100.220.220.760.000.000.020.020.020.010.02NAOMP (%): 100.00Pthread (%): 59.26
OMP (%): 40.74
Pthread (%): 90.00
OMP (%): 10.00
Pthread (%): 91.38
OMP (%): 8.62
System (%): 0.00
Pthread (%): 93.41
System (%): 0.00
OMP (%): 6.59
Pthread (%): 95.72
OMP (%): 4.28
System (%): 0.00
0.000.000.000.000.000.000.0010101010101010
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so0.000.822.393.404.458.3150.860.000.822.393.404.458.3150.860.001.001.051.100.780.791.500.001.001.051.100.780.791.500.000.500.750.540.360.351.370.000.500.750.540.360.351.3701481632630.000.001.552.372.564.359.410.000.000.480.370.200.160.31NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.0010101010101010
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*)libomp.so0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.0000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.000.000.000.000.000.000.000.00
__kmp_now_nseclibomp.so0.000.000.000.000.000.000.050.000.000.000.000.000.000.050.000.000.000.000.000.000.010.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.00000001120.000.000.000.000.000.000.080.000.000.000.000.000.000.00NANANANANAOMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.00
×