Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | |
|---|---|---|---|---|---|---|---|---|---|---|
| Total Time (s) | 46.82 | 24.20 | 12.85 | 7.23 | 4.56 | 3.86 | 4.85 | 9.56 | 15.22 | |
| Max (Thread Active Time) (s) | 46.65 | 23.92 | 12.51 | 6.77 | 4.03 | 3.11 | 2.88 | 4.51 | 6.41 | |
| Average Active Time (s) | 46.65 | 23.90 | 10.02 | 3.47 | 3.97 | 3.02 | 2.50 | 3.51 | 5.10 | |
| Activity Ratio (%) | 99.6 | 98.6 | 78.5 | 49.7 | 87.6 | 80.1 | 53.5 | 38.1 | 34.6 | |
| Average number of active threads | 0.996 | 1.975 | 3.898 | 7.677 | 13.934 | 25.023 | 32.949 | 46.958 | 64.379 | |
| Affinity Stability (%) | 100.0 | 99.8 | 99.7 | 98.4 | 99.7 | 99.7 | 99.8 | 99.9 | 99.9 | |
| Time in analyzed loops (%) | 64.2 | 61.5 | 58.9 | 53.8 | 46.9 | 37.8 | 23.2 | 8.33 | 3.76 | |
| Time in analyzed innermost loops (%) | 1.17 | 1.12 | 0.97 | 0.98 | 0.86 | 0.70 | 0.48 | 0.17 | 0.08 | |
| Time in user code (%) | 64.2 | 61.5 | 58.9 | 53.8 | 47.0 | 37.9 | 23.2 | 8.38 | 3.79 | |
| Compilation Options Score (%) | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Array Access Efficiency (%) | 50.1 | 50.1 | 50.0 | 50.1 | 50.1 | 50.1 | 50.1 | 50.1 | 50.2 | |
| Potential Speedups | ||||||||||
| Perfect Flow Complexity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| Perfect OpenMP/MPI/Pthread/TBB | 1.00 | 1.01 | 1.03 | 1.07 | 1.16 | 1.31 | 1.70 | 2.17 | 1.94 | |
| Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution | 1.00 | 1.01 | 1.29 | 2.10 | 1.19 | 1.37 | 1.97 | 2.61 | 2.55 | |
| Scalability - Gap | 1.00 | 1.03 | 1.37 | 2.47 | 1.56 | 2.64 | 6.63 | 26.15 | 62.44 | |
| No Scalar Integer | Potential Speedup | 1.26 | 1.24 | 1.23 | 1.21 | 1.17 | 1.14 | 1.08 | 1.03 | 1.01 |
| Nb Loops to get 80% | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | |
| FP Vectorised | Potential Speedup | 1.42 | 1.40 | 1.38 | 1.33 | 1.28 | 1.21 | 1.12 | 1.04 | 1.02 |
| Nb Loops to get 80% | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | |
| Fully Vectorised | Potential Speedup | 2.20 | 2.10 | 2.01 | 1.84 | 1.67 | 1.47 | 1.25 | 1.08 | 1.03 |
| Nb Loops to get 80% | 3 | 3 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | |
| Only FP Arithmetic | Potential Speedup | 1.55 | 1.51 | 1.48 | 1.42 | 1.35 | 1.27 | 1.15 | 1.05 | 1.02 |
| Nb Loops to get 80% | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | |
| Source Object | Issue |
|---|---|
| ▼xy_model | |
| ▼mpi_utils.cpp | |
| ○ | |
| ▼simulation.cpp | |
| ○ |
| r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | |
|---|---|---|---|---|---|---|---|---|---|
| Experiment Name | |||||||||
| Application | ./xy_model | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Timestamp | 2026-06-19 17:51:40 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Experiment Type | MPI; | same as r0 | MPI; OpenMP; | same as r2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Machine | kairoscomp92 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | GRANITE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Model Name | Intel(R) Xeon(R) 6952P | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Cache Size | 491520 KB | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of Cores | 96 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Maximal Frequency | 2.10 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 5.14.0-570.62.1.el9_6.x86_64 #1 SMP PREEMPT_DYNAMIC Mon Nov 3 10:38:50 EST 2025 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | GRANITE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Compilation Options | xy_model: --driver-mode=g++ --intel -std=c++20 -Wall -Wextra -O3 -ffast-math -march=native -funroll-loops -g -fno-omit-frame-pointer -I include -c simulation.cpp -o simulation.o -I /softs/x86/openmpi/openmpi-5.0.8/intel/include -fveclib=SVML | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of processes observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 |
| Number of threads observed | 1 | 2 | 5 | 16 | same as r3 | 32 | 64 | 128 | 192 |
| Frequency Driver | acpi-cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Frequency Governor | performance | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Huge Pages | always | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of sockets | 2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 96 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2026.0.1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO build | acf4f6a73554ef689d8bd1203c034c7f918bf3ab::20260616-181519 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Comments | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |