Loops
MultiBsplineRef.hpp: 68 - 101.61 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
859 | 33.15 | 32.17 | 25.62 | 100 | 25 | 122.07 | 956 | 31.90 | 31.59 | 24.62 | 100 | 50 | 124.27 | 754 | 33.42 | 32.71 | 26.29 | 100 | 25 | 120.04 | 964 | 32.62 | 32.30 | 25.08 | 100 | 25 | 121.56 |
Sum on 1 analyzed binary loop (exec - 859) | Sum on 1 analyzed binary loop (exec - 956) | Sum on 1 analyzed binary loop (exec - 754) | Sum on 1 analyzed binary loop (exec - 964) | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 |
MultiBsplineRef.hpp: 242 - 48.90 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
866 | 14.09 | 13.64 | 10.86 | 100 | 50 | 401.28 | 1021 | 20.71 | 18.25 | 14.22 | 15.22 | 14.4 | 293.12 | 761 | 13.00 | 12.41 | 9.97 | 100 | 100 | 441.02 | 1032 | 20.60 | 17.83 | 13.84 | 15.22 | 14.4 | 299.95 |
Sum on 1 analyzed binary loop (exec - 866) | Sum on 1 analyzed binary loop (exec - 1021) | Sum on 1 analyzed binary loop (exec - 761) | Sum on 1 analyzed binary loop (exec - 1032) | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||||||
More than 10% of the vector loads instructions are unaligned | More than 10% of the vector loads instructions are unaligned | More than 10% of the vector loads instructions are unaligned | 1 | More than 10% of the vector loads instructions are unaligned |
SoaDistanceTableAAOMPTarget.h: 440 - 29.37 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1840 | 9.79 | 9.56 | 7.62 | 54.55 | 15.91 | 0 | 2539 | 9.28 | 9.03 | 7.04 | 27.27 | 15.91 | 0 | 1672 | 10.16 | 9.71 | 7.81 | 54.55 | 15.91 | 0 | 2550 | 9.12 | 8.90 | 6.91 | 27.27 | 15.91 | 0 |
Sum on 1 analyzed binary loop (exec - 1840) | Sum on 1 analyzed binary loop (exec - 2539) | Sum on 1 analyzed binary loop (exec - 1672) | Sum on 1 analyzed binary loop (exec - 2550) | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
SoaDistanceTableABOMPTarget.h: 228 - 27.56 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2066 | 8.90 | 8.22 | 6.55 | 12.77 | 14.1 | 324.69 | 3109 | 10.37 | 9.52 | 7.42 | 12.24 | 14.03 | 280.18 | 1888 | 8.90 | 8.38 | 6.74 | 12.77 | 14.1 | 318.35 | 3122 | 9.31 | 8.83 | 6.85 | 12.24 | 14.03 | 302.26 |
Sum on 1 analyzed binary loop (exec - 2066) | Sum on 1 analyzed binary loop (exec - 3109) | Sum on 1 analyzed binary loop (exec - 1888) | Sum on 1 analyzed binary loop (exec - 3122) | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||
Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 |
inner_product.hpp: 155 - 3.96 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
979 | 0.69 | 0.62 | 0.49 | 33.33 | 16.67 | 66.19 | 1127 | 0.63 | 0.54 | 0.42 | 100 | 50 | 75.77 | 852 | 0.78 | 0.66 | 0.53 | 33.33 | 16.67 | 61.95 | 1134 | 0.11 | 0.07 | 0.05 | 100 | 25 | 586.76 |
966 | 0.80 | 0.64 | 0.51 | 33.33 | 16.67 | 64.02 | 1139 | 0.13 | 0.09 | 0.07 | 100 | 50 | 92.06 | 850 | 0.16 | 0.12 | 0.10 | 33.33 | 16.67 | 67.22 | 1137 | 0.67 | 0.54 | 0.42 | 100 | 25 | 74.57 |
967 | 0.20 | 0.15 | 0.12 | 33.33 | 16.67 | 277.18 | 1131 | 0.61 | 0.54 | 0.42 | 100 | 50 | 74.95 | 861 | 0.70 | 0.62 | 0.50 | 33.33 | 16.67 | 65.88 | |||||||
964 | 0.16 | 0.12 | 0.09 | 33.33 | 16.67 | 70.71 | 1123 | 0.24 | 0.13 | 0.10 | 100 | 50 | 315.4 | 853 | 0.22 | 0.16 | 0.13 | 33.33 | 16.67 | 246.96 | |||||||
Sum on 2 analyzed binary loops (exec - 979, exec - 966) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 2 analyzed binary loops (exec - 852, exec - 861) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||||
Inefficient Vectorization | Inefficient Vectorization | ||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 |
einspline_spo_ref.hpp: 223 - 3.52 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
861 | 1.30 | 1.07 | 0.85 | 31.58 | 15.46 | 0 | 1025 | 1.36 | 1.16 | 0.91 | 11.11 | 13.89 | 0 | 756 | 1.24 | 1.07 | 0.86 | 20 | 13.13 | 0 | 1036 | 1.31 | 1.16 | 0.90 | 11.11 | 13.89 | 0 |
Sum on 1 analyzed binary loop (exec - 861) | Sum on 1 analyzed binary loop (exec - 1025) | Sum on 1 analyzed binary loop (exec - 756) | Sum on 1 analyzed binary loop (exec - 1036) | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | ||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | ||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 |
TwoBodyJastrowRef.h: 342 - 3.33 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
348 | 0.47 | 0.36 | 0.28 | 100 | 50 | 113.58 | 619 | 1.19 | 1.04 | 0.81 | 100 | 50 | 117.34 | 311 | 0.40 | 0.31 | 0.25 | 100 | 100 | 129.25 | 576 | 1.39 | 1.20 | 0.93 | 100 | 25 | 102.03 |
350 | 0.43 | 0.35 | 0.28 | 100 | 50 | 116.6 | 313 | 0.40 | 0.32 | 0.25 | 100 | 100 | 129.19 | ||||||||||||||
346 | 0.45 | 0.35 | 0.28 | 100 | 50 | 116.46 | 312 | 0.39 | 0.31 | 0.25 | 100 | 100 | 133.83 | ||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 619) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 576) | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 |
inner_product.hpp: 82 - 2.29 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
981 | 0.36 | 0.28 | 0.22 | 100 | 50 | 49.07 | 1132 | 0.37 | 0.29 | 0.22 | 100 | 50 | 47.41 | 763 | 0.32 | 0.23 | 0.18 | 100 | 100 | 238.87 | 1136 | 0.55 | 0.39 | 0.30 | 100 | 25 | 140.51 |
962 | 0.05 | 0.03 | 0.02 | 100 | 50 | 107.37 | 1124 | 0.17 | 0.12 | 0.09 | 100 | 50 | 115.54 | 855 | 0.17 | 0.09 | 0.07 | 100 | 100 | 134.9 | 1148 | 0.06 | 0.03 | 0.02 | 100 | 25 | 86.42 |
869 | 0.34 | 0.26 | 0.21 | 100 | 50 | 204.12 | 1138 | 0.06 | 0.03 | 0.02 | 100 | 50 | 99.16 | 862 | 0.34 | 0.28 | 0.22 | 100 | 100 | 49.09 | 1133 | 0.23 | 0.16 | 0.13 | 100 | 25 | 82.81 |
970 | 0.19 | 0.11 | 0.09 | 100 | 50 | 123.33 | 1126 | 0.38 | 0.29 | 0.23 | 100 | 50 | 184.68 | 848 | 0.05 | 0.03 | 0.02 | 100 | 100 | 101.27 | 1141 | 0.37 | 0.31 | 0.24 | 100 | 25 | 43.65 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
BsplineFunctor.h: 291 - 1.74 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
321 | 0.69 | 0.55 | 0.44 | 86.96 | 44.57 | 0.4 | 291 | 0.42 | 0.31 | 0.24 | 0 | 9.38 | 0.01 | 291 | 0.57 | 0.45 | 0.36 | 63.57 | 48.71 | 1.41 | 571 | 0.38 | 0.27 | 0.21 | 0 | 9.38 | 0.01 |
639 | 0.08 | 0.03 | 0.02 | 0 | 9.38 | 0.26 | 293 | 0.38 | 0.30 | 0.23 | 0 | 9.38 | 0.02 | ||||||||||||||
614 | 0.38 | 0.27 | 0.21 | 0 | 9.38 | 0.03 | 596 | 0.07 | 0.03 | 0.02 | 0 | 9.38 | 0.15 | ||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
TwoBodyJastrowRef.h: 324 - 1.61 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
352 | 0.55 | 0.45 | 0.36 | 100 | 50 | 136.14 | 620 | 0.80 | 0.63 | 0.49 | 0 | 12.5 | 97 | 315 | 0.54 | 0.40 | 0.32 | 100 | 100 | 149.58 | 577 | 0.78 | 0.58 | 0.45 | 0 | 12.5 | 105.73 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
MultiBsplineRef.hpp: 276 - 1.51 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
863 | 0.36 | 0.24 | 0.19 | 100 | 50 | 305.88 | 1020 | 1.05 | 0.72 | 0.56 | 0 | 12.5 | 102.51 | 758 | 0.31 | 0.24 | 0.19 | 100 | 100 | 313.05 | 1031 | 1.01 | 0.73 | 0.56 | 0 | 12.5 | 101.24 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 1020) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 1031) | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
TwoBodyJastrowRef.h: 155 - 1.31 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
341 | 0.17 | 0.12 | 0.10 | 100 | 50 | 219.72 | 285 | 0.53 | 0.39 | 0.30 | 100 | 50 | 207.29 | 307 | 0.17 | 0.11 | 0.09 | 100 | 100 | 244.01 | 289 | 0.70 | 0.59 | 0.46 | 100 | 25 | 138.9 |
340 | 0.19 | 0.12 | 0.10 | 100 | 50 | 225.78 | 306 | 0.16 | 0.10 | 0.08 | 100 | 100 | 274.13 | ||||||||||||||
339 | 0.23 | 0.12 | 0.10 | 100 | 50 | 224.3 | 305 | 0.17 | 0.11 | 0.09 | 100 | 100 | 246.67 | ||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
inner_product.hpp: 211 - 0.89 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
948 | 0.39 | 0.34 | 0.27 | 85.71 | 41.07 | 0 | 1149 | 0.27 | 0.23 | 0.18 | 33.33 | 16.67 | 0 | 836 | 0.36 | 0.31 | 0.25 | 85.71 | 76.79 | 0 | 1159 | 0.28 | 0.24 | 0.19 | 33.33 | 16.67 | 0 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
BsplineFunctor.h: 246 - 0.30 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
380 | 0.24 | 0.16 | 0.13 | 100 | 46.88 | 244.52 | 383 | 0.11 | 0.06 | 0.05 | 61.9 | 31.55 | 713.57 | 334 | 0.15 | 0.08 | 0.07 | 100 | 90.63 | 433.31 | 340 | 0.13 | 0.08 | 0.06 | 70.8 | 20.46 | 563.36 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
stl_numeric.h: 140 - 0.24 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
343 | 0.13 | 0.08 | 0.06 | 100 | 50 | 184.27 | 289 | 0.13 | 0.08 | 0.06 | 100 | 50 | 182.45 | 309 | 0.04 | 0.02 | 0.02 | 100 | 100 | 751.33 | 291 | 0.23 | 0.14 | 0.11 | 100 | 25 | 96.75 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
TwoBodyJastrowRef.h: 381 - 0.20 %
Run orig_default | Run gcc_default | Run icx_2 | Run gcc_4 | ||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
372 | 0.05 | 0.02 | 0.01 | 100 | 50 | 164.93 | 633 | 0.11 | 0.06 | 0.04 | 100 | 50 | 138.48 | 327 | 0.05 | 0.02 | 0.02 | 100 | 100 | 117.85 | 592 | 0.15 | 0.08 | 0.06 | 100 | 25 | 109.8 |
368 | 0.05 | 0.02 | 0.02 | 100 | 50 | 131.73 | 325 | 0.05 | 0.02 | 0.02 | 100 | 100 | 140.85 | ||||||||||||||
370 | 0.05 | 0.02 | 0.02 | 100 | 50 | 123.65 | 326 | 0.05 | 0.02 | 0.01 | 100 | 100 | 165.53 | ||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |