Help is available by moving the cursor above any symbol or by checking MAQAO website.
Total Time (s) | 238.17 | ||
Max (Thread Active Time) (s) | 237.57 | ||
Average Active Time (s) | 237.57 | ||
Activity Ratio (%) | 99.7 | ||
Average number of active threads | 1.000 | ||
Affinity Stability (%) | 100.0 | ||
GFLOPS | 9.893 | ||
Time in analyzed loops (%) | 99.6 | ||
Time in analyzed innermost loops (%) | 99.6 | ||
Time in user code (%) | 99.9 | ||
Compilation Options Score (%) | 75.0 | ||
Array Access Efficiency (%) | 70.3 | ||
Potential Speedups | |||
Perfect Flow Complexity | 1.00 | ||
Perfect OpenMP + MPI + Pthread | 1.00 | ||
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | ||
No Scalar Integer | Potential Speedup | 1.05 | |
Nb Loops to get 80% | 1 | ||
FP Vectorised | Potential Speedup | 2.02 | |
Nb Loops to get 80% | 3 | ||
Fully Vectorised | Potential Speedup | 3.95 | |
Nb Loops to get 80% | 3 | ||
FP Arithmetic Only | Potential Speedup | 1.06 | |
Nb Loops to get 80% | 1 |
Source Object | Issue |
---|---|
▼bench | |
○vrank-geq1.c | -funroll-loops is missing. |
○t2fv_4.c | -funroll-loops is missing. |
○dftw-direct.c | -funroll-loops is missing. |
○direct.c | -funroll-loops is missing. |
○execute.c | -funroll-loops is missing. |
○solve.c | -funroll-loops is missing. |
○t2fv_32.c | -funroll-loops is missing. |
○ct.c | -funroll-loops is missing. |
○n2fv_64.c | -funroll-loops is missing. |
Experiment Name | FFTW GCC-SSE-128 | ||||
Application | /home/fmusial/FFTW_Benchmarks/fftw-3.3.10-gcc-sse-128/tests/bench | ||||
Timestamp | 2025-04-14 16:31:43 | Universal Timestamp | 1744641103 | ||
Number of processes observed | 1 | Number of threads observed | 1 | ||
Experiment Type | Sequential | ||||
Machine | otterfall | ||||
Model Name | Intel(R) Xeon(R) Silver 4210R CPU @ 2.40GHz | ||||
Architecture | x86_64 | Micro Architecture | SKYLAKE | ||
Cache Size | 14080 KB | Number of Cores | 10 | ||
OS Version | Linux 6.12.1-arch1-1 #1 SMP PREEMPT_DYNAMIC Fri, 22 Nov 2024 16:04:27 +0000 | ||||
Architecture used during static analysis | x86_64 | Micro Architecture used during static analysis | SKYLAKE | ||
Frequency Driver | intel_pstate | Frequency Governor | performance | ||
Huge Pages | always | Hyperthreading | off | ||
Number of sockets | 1 | Number of cores per socket | 10 | ||
Compilation Options | bench: GNU C17 14.2.1 20250207 --param=l1-cache-size=32 --param=l1-cache-line-size=64 --param=l2-cache-size=14080 -mtune=cascadelake -msse2 -march=cascadelake -g -O3 -fno-omit-frame-pointer |
Dataset | |
Run Command | <executable> -v2 -opatient -owisdom -r 300000 -t 1 -s ocf8192 |
Number Processes | 1 |
Number Nodes | 1 |
Filter | Not Used |
Profile Start | Not Used |