Loop id | Source Location | Source Function | Level | Exclusive Coverage 6x1 (%) | Exclusive Coverage 72x1 (%) | Exclusive Coverage 96x1 (%) | Exclusive Coverage 120x1 (%) | Exclusive Coverage 126x1 (%) | Exclusive Coverage 144x1 (%) | Exclusive Coverage 168x1 (%) | Exclusive Coverage 192x1 (%) | Exclusive Coverage 216x1 (%) | Exclusive Coverage 240x1 (%) | Exclusive Coverage 256x1 (%) | Inclusive Coverage 6x1 (%) | Inclusive Coverage 72x1 (%) | Inclusive Coverage 96x1 (%) | Inclusive Coverage 120x1 (%) | Inclusive Coverage 126x1 (%) | Inclusive Coverage 144x1 (%) | Inclusive Coverage 168x1 (%) | Inclusive Coverage 192x1 (%) | Inclusive Coverage 216x1 (%) | Inclusive Coverage 240x1 (%) | Inclusive Coverage 256x1 (%) | Max Exclusive Time Over Threads 6x1 (s) | Max Exclusive Time Over Threads 72x1 (s) | Max Exclusive Time Over Threads 96x1 (s) | Max Exclusive Time Over Threads 120x1 (s) | Max Exclusive Time Over Threads 126x1 (s) | Max Exclusive Time Over Threads 144x1 (s) | Max Exclusive Time Over Threads 168x1 (s) | Max Exclusive Time Over Threads 192x1 (s) | Max Exclusive Time Over Threads 216x1 (s) | Max Exclusive Time Over Threads 240x1 (s) | Max Exclusive Time Over Threads 256x1 (s) | Max Inclusive Time Over Threads 6x1 (s) | Max Inclusive Time Over Threads 72x1 (s) | Max Inclusive Time Over Threads 96x1 (s) | Max Inclusive Time Over Threads 120x1 (s) | Max Inclusive Time Over Threads 126x1 (s) | Max Inclusive Time Over Threads 144x1 (s) | Max Inclusive Time Over Threads 168x1 (s) | Max Inclusive Time Over Threads 192x1 (s) | Max Inclusive Time Over Threads 216x1 (s) | Max Inclusive Time Over Threads 240x1 (s) | Max Inclusive Time Over Threads 256x1 (s) | Exclusive Time w.r.t. Wall Time 6x1 (s) | Exclusive Time w.r.t. Wall Time 72x1 (s) | Exclusive Time w.r.t. Wall Time 96x1 (s) | Exclusive Time w.r.t. Wall Time 120x1 (s) | Exclusive Time w.r.t. Wall Time 126x1 (s) | Exclusive Time w.r.t. Wall Time 144x1 (s) | Exclusive Time w.r.t. Wall Time 168x1 (s) | Exclusive Time w.r.t. Wall Time 192x1 (s) | Exclusive Time w.r.t. Wall Time 216x1 (s) | Exclusive Time w.r.t. Wall Time 240x1 (s) | Exclusive Time w.r.t. Wall Time 256x1 (s) | Inclusive Time w.r.t. Wall Time 6x1 (s) | Inclusive Time w.r.t. Wall Time 72x1 (s) | Inclusive Time w.r.t. Wall Time 96x1 (s) | Inclusive Time w.r.t. Wall Time 120x1 (s) | Inclusive Time w.r.t. Wall Time 126x1 (s) | Inclusive Time w.r.t. Wall Time 144x1 (s) | Inclusive Time w.r.t. Wall Time 168x1 (s) | Inclusive Time w.r.t. Wall Time 192x1 (s) | Inclusive Time w.r.t. Wall Time 216x1 (s) | Inclusive Time w.r.t. Wall Time 240x1 (s) | Inclusive Time w.r.t. Wall Time 256x1 (s) | Nb Threads 6x1 | Nb Threads 72x1 | Nb Threads 96x1 | Nb Threads 120x1 | Nb Threads 126x1 | Nb Threads 144x1 | Nb Threads 168x1 | Nb Threads 192x1 | Nb Threads 216x1 | Nb Threads 240x1 | Nb Threads 256x1 | GFLOPS 6x1 | GFLOPS 72x1 | GFLOPS 96x1 | GFLOPS 120x1 | GFLOPS 126x1 | GFLOPS 144x1 | GFLOPS 168x1 | GFLOPS 192x1 | GFLOPS 216x1 | GFLOPS 240x1 | GFLOPS 256x1 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing 6x1 | Speedup If Perfect Load Balancing 72x1 | Speedup If Perfect Load Balancing 96x1 | Speedup If Perfect Load Balancing 120x1 | Speedup If Perfect Load Balancing 126x1 | Speedup If Perfect Load Balancing 144x1 | Speedup If Perfect Load Balancing 168x1 | Speedup If Perfect Load Balancing 192x1 | Speedup If Perfect Load Balancing 216x1 | Speedup If Perfect Load Balancing 240x1 | Speedup If Perfect Load Balancing 256x1 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (6x1) Efficiency | (6x1) Potential Speed-Up (%) | (72x1) Efficiency | (72x1) Potential Speed-Up (%) | (96x1) Efficiency | (96x1) Potential Speed-Up (%) | (120x1) Efficiency | (120x1) Potential Speed-Up (%) | (126x1) Efficiency | (126x1) Potential Speed-Up (%) | (144x1) Efficiency | (144x1) Potential Speed-Up (%) | (168x1) Efficiency | (168x1) Potential Speed-Up (%) | (192x1) Efficiency | (192x1) Potential Speed-Up (%) | (216x1) Efficiency | (216x1) Potential Speed-Up (%) | (240x1) Efficiency | (240x1) Potential Speed-Up (%) | (256x1) Efficiency | (256x1) Potential Speed-Up (%) |
---|
31406 | exec - pair_eam_intel.cpp:291-605 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 0, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Outermost | 20.50 | 18.42 | 17.74 | 16.85 | 16.58 | 16.90 | 16.54 | 15.88 | 16.24 | 15.98 | 14.78 | 46.17 | 41.79 | 40.09 | 38.09 | 37.51 | 37.87 | 37.12 | 35.49 | 36.25 | 35.49 | 33.17 | 167.02 | 14.52 | 11.00 | 9.06 | 9.12 | 8.14 | 7.68 | 7.19 | 6.65 | 6.07 | 7.01 | 375.19 | 31.96 | 23.99 | 19.68 | 20.16 | 17.65 | 16.03 | 15.54 | 14.90 | 12.97 | 16.76 | 166.45 | 14.01 | 10.57 | 8.54 | 8.38 | 7.76 | 7.02 | 6.51 | 6.18 | 5.78 | 5.44 | 374.86 | 31.77 | 23.89 | 19.32 | 18.95 | 17.39 | 15.75 | 14.55 | 13.80 | 12.84 | 12.22 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 74.68 | 887.54 | 1177.10 | 1453.74 | 1484.90 | 1601.23 | 1773.06 | 1912.28 | 2012.15 | 2150.52 | 2285.13 | 86.43 | 64.13 | 1.15 | 1.01 | 1.06 | 1.01 | 1.04 | 1.04 | 1.07 | 1.1 | 1.06 | 1.1 | 1.11 | 1.08 | 1.06 | 1.31 | NA | NA | NA | NA | NA | 1 | 0 | 0.99 | 0.18 | 0.98 | 0.28 | 0.97 | 0.44 | 0.95 | 0.89 | 0.89 | 1.8 | 0.85 | 2.53 | 0.8 | 3.18 | 0.75 | 4.09 | 0.72 | 4.48 | 0.72 | 4.18 |
31408 | exec - pair_eam_intel.cpp:545-602 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 0, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Innermost | 13.72 | 12.41 | 11.87 | 11.19 | 11.11 | 11.24 | 11.04 | 10.53 | 10.81 | 10.65 | 9.86 | 13.72 | 12.41 | 11.87 | 11.19 | 11.11 | 11.24 | 11.04 | 10.53 | 10.81 | 10.65 | 9.86 | 111.54 | 9.79 | 7.59 | 6.05 | 6.15 | 5.65 | 5.05 | 4.84 | 4.74 | 4.14 | 4.44 | 111.54 | 9.79 | 7.59 | 6.05 | 6.15 | 5.65 | 5.05 | 4.84 | 4.74 | 4.14 | 4.44 | 111.43 | 9.43 | 7.07 | 5.68 | 5.61 | 5.16 | 4.68 | 4.32 | 4.11 | 3.86 | 3.63 | 111.43 | 9.43 | 7.07 | 5.68 | 5.61 | 5.16 | 4.68 | 4.32 | 4.11 | 3.86 | 3.63 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 77.37 | 914.34 | 1219.23 | 1521.57 | 1538.33 | 1672.95 | 1840.99 | 1999.15 | 2097.78 | 2238.39 | 2377.58 | 100 | 82.64 | 1.02 | 1 | 1.02 | 1 | 1.04 | 1.08 | 1.07 | 1.1 | 1.1 | 1.09 | 1.13 | 1.16 | 1.09 | 1.24 | 1 | 5 | 0 | 0 | 4 | 1 | 0 | 0.98 | 0.2 | 0.98 | 0.18 | 0.98 | 0.2 | 0.95 | 0.61 | 0.9 | 1.13 | 0.85 | 1.66 | 0.81 | 2.04 | 0.75 | 2.68 | 0.72 | 2.96 | 0.72 | 2.76 |
31407 | exec - pair_eam_intel.cpp:513-521 | void LAMMPS_NS::PairEAMIntel::eval<1, 0, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Innermost | 11.94 | 10.96 | 10.49 | 10.05 | 9.82 | 9.73 | 9.54 | 9.08 | 9.20 | 8.86 | 8.54 | 11.94 | 10.96 | 10.49 | 10.05 | 9.82 | 9.73 | 9.54 | 9.08 | 9.20 | 8.86 | 8.54 | 98.46 | 8.71 | 6.60 | 5.38 | 5.52 | 4.80 | 4.39 | 4.15 | 3.75 | 3.51 | 5.31 | 98.46 | 8.71 | 6.60 | 5.38 | 5.52 | 4.80 | 4.39 | 4.15 | 3.75 | 3.51 | 5.31 | 96.98 | 8.33 | 6.25 | 5.10 | 4.96 | 4.47 | 4.05 | 3.72 | 3.50 | 3.21 | 3.15 | 96.98 | 8.33 | 6.25 | 5.10 | 4.96 | 4.47 | 4.05 | 3.72 | 3.50 | 3.21 | 3.15 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 56.51 | 657.45 | 875.63 | 1074.98 | 1102.88 | 1225.94 | 1353.03 | 1467.94 | 1563.09 | 1707.68 | 1739.57 | 100 | 89.29 | 1 | 1 | 1 | 1.02 | 1.05 | 1.06 | 1.06 | 1.12 | 1.08 | 1.09 | 1.12 | 1.08 | 1.11 | 1.71 | 0 | 1 | 0 | 0 | 6 | 1 | 0 | 0.97 | 0.33 | 0.97 | 0.31 | 0.95 | 0.49 | 0.93 | 0.68 | 0.9 | 0.93 | 0.86 | 1.38 | 0.81 | 1.69 | 0.77 | 2.12 | 0.76 | 2.16 | 0.72 | 2.37 |
31416 | exec - pair_eam_intel.cpp:291-363 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 0, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Outermost | 10.68 | 9.57 | 9.23 | 8.71 | 8.51 | 8.71 | 8.55 | 8.22 | 8.41 | 8.21 | 7.62 | 26.98 | 24.13 | 23.35 | 22.38 | 21.99 | 22.37 | 22.07 | 21.24 | 21.75 | 21.31 | 20.66 | 87.08 | 7.66 | 5.83 | 4.68 | 4.78 | 4.31 | 3.94 | 3.75 | 3.57 | 3.27 | 3.03 | 219.38 | 18.49 | 14.01 | 11.72 | 11.69 | 10.67 | 10.09 | 9.22 | 8.67 | 7.91 | 8.44 | 86.71 | 7.28 | 5.50 | 4.42 | 4.30 | 4.00 | 3.63 | 3.37 | 3.20 | 2.97 | 2.81 | 219.06 | 18.35 | 13.91 | 11.35 | 11.11 | 10.27 | 9.36 | 8.70 | 8.28 | 7.71 | 7.61 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 67.54 | 804.23 | 1063.68 | 1327.61 | 1364.33 | 1466.47 | 1615.43 | 1743.55 | 1832.44 | 1974.28 | 2089.27 | 77.11 | 59.71 | 1.14 | 1 | 1.15 | 1.01 | 1.06 | 1.06 | 1.06 | 1.12 | 1.08 | 1.09 | 1.12 | 1.12 | 1.11 | 1.1 | NA | NA | NA | NA | NA | 1 | 0 | 0.99 | 0.07 | 0.99 | 0.14 | 0.98 | 0.16 | 0.96 | 0.34 | 0.9 | 0.84 | 0.85 | 1.25 | 0.8 | 1.6 | 0.75 | 2.08 | 0.73 | 2.22 | 0.72 | 2.1 |
31417 | exec - pair_eam_intel.cpp:312-320 | void LAMMPS_NS::PairEAMIntel::eval<1, 0, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Innermost | 9.53 | 8.57 | 8.38 | 8.20 | 8.12 | 8.24 | 8.16 | 7.92 | 8.12 | 7.98 | 8.27 | 9.53 | 8.57 | 8.38 | 8.20 | 8.12 | 8.24 | 8.16 | 7.92 | 8.12 | 7.98 | 8.27 | 78.06 | 6.96 | 5.26 | 4.46 | 4.47 | 4.22 | 4.03 | 3.77 | 3.54 | 3.19 | 3.96 | 78.06 | 6.96 | 5.26 | 4.46 | 4.47 | 4.22 | 4.03 | 3.77 | 3.54 | 3.19 | 3.96 | 77.35 | 6.51 | 4.99 | 4.16 | 4.10 | 3.79 | 3.46 | 3.25 | 3.09 | 2.89 | 3.05 | 77.35 | 6.51 | 4.99 | 4.16 | 4.10 | 3.79 | 3.46 | 3.25 | 3.09 | 2.89 | 3.05 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 54.16 | 643.83 | 840.58 | 1007.90 | 1020.85 | 1108.24 | 1211.13 | 1289.10 | 1358.98 | 1454.09 | 1377.05 | 100 | 87.5 | 1 | 1 | 1 | 1.01 | 1.07 | 1.06 | 1.08 | 1.1 | 1.12 | 1.17 | 1.17 | 1.15 | 1.12 | 1.32 | 0 | 1 | 0 | 0 | 3 | 1 | 0 | 0.99 | 0.09 | 0.97 | 0.27 | 0.93 | 0.57 | 0.9 | 0.83 | 0.85 | 1.22 | 0.8 | 1.65 | 0.74 | 2.03 | 0.7 | 2.47 | 0.67 | 2.64 | 0.6 | 3.35 |
31418 | exec - pair_eam_intel.cpp:340-361 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 0, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Innermost | 6.77 | 5.99 | 5.74 | 5.47 | 5.36 | 5.41 | 5.35 | 5.09 | 5.22 | 5.12 | 4.77 | 6.77 | 5.99 | 5.74 | 5.47 | 5.36 | 5.41 | 5.35 | 5.09 | 5.22 | 5.12 | 4.77 | 55.49 | 4.91 | 3.67 | 3.04 | 3.14 | 2.69 | 2.89 | 2.49 | 2.30 | 2.09 | 1.98 | 55.49 | 4.91 | 3.67 | 3.04 | 3.14 | 2.69 | 2.89 | 2.49 | 2.30 | 2.09 | 1.98 | 55.00 | 4.56 | 3.42 | 2.78 | 2.71 | 2.49 | 2.27 | 2.09 | 1.99 | 1.85 | 1.76 | 55.00 | 4.56 | 3.42 | 2.78 | 2.71 | 2.49 | 2.27 | 2.09 | 1.99 | 1.85 | 1.76 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 40.92 | 494.15 | 657.59 | 807.28 | 828.61 | 898.75 | 986.31 | 1073.40 | 1125.60 | 1205.57 | 1273.72 | 100 | 81.25 | 1.1 | 1 | 1.05 | 1.01 | 1.08 | 1.08 | 1.1 | 1.17 | 1.09 | 1.28 | 1.2 | 1.17 | 1.14 | 1.14 | 0 | 2 | 0 | 0 | 2 | 1 | 0 | 1.01 | 0 | 1.01 | 0 | 0.99 | 0.05 | 0.97 | 0.18 | 0.92 | 0.42 | 0.87 | 0.72 | 0.82 | 0.9 | 0.77 | 1.21 | 0.74 | 1.32 | 0.73 | 1.27 |
27776 | exec - npair_intel.cpp:330-761 [...] | void LAMMPS_NS::NPairIntel::bin_newton<float, double, 0, 0, 0, 0, 0>(int, LAMMPS_NS::NeighList*, LAMMPS_NS::IntelBuffers<float, double>*, int, int, int) [clone .extracted] | Outermost | 2.95 | 2.67 | 2.53 | 2.41 | 2.39 | 2.45 | 2.39 | 2.28 | 2.36 | 2.32 | 2.13 | 7.98 | 7.29 | 7.00 | 6.67 | 6.60 | 6.71 | 6.59 | 6.30 | 6.50 | 6.38 | 5.92 | 24.06 | 2.18 | 1.68 | 1.38 | 1.35 | 1.28 | 1.19 | 1.11 | 1.09 | 0.97 | 0.93 | 66.39 | 5.98 | 4.52 | 3.64 | 3.64 | 3.41 | 2.97 | 2.86 | 2.58 | 2.41 | 2.50 | 23.92 | 2.03 | 1.51 | 1.22 | 1.21 | 1.12 | 1.01 | 0.94 | 0.90 | 0.84 | 0.78 | 64.78 | 5.54 | 4.17 | 3.38 | 3.33 | 3.08 | 2.80 | 2.58 | 2.47 | 2.31 | 2.18 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 57.65 | 674.59 | 912.53 | 1123.47 | 1134.60 | 1219.20 | 1352.64 | 1464.70 | 1518.99 | 1632.08 | 1743.95 | 52.79 | 47.98 | 1.11 | 1 | 1.13 | 1.01 | 1.08 | 1.12 | 1.14 | 1.12 | 1.15 | 1.18 | 1.19 | 1.22 | 1.17 | 1.2 | NA | NA | NA | NA | NA | 1 | 0 | 0.98 | 0.05 | 0.99 | 0.02 | 0.98 | 0.05 | 0.94 | 0.14 | 0.89 | 0.28 | 0.84 | 0.37 | 0.8 | 0.46 | 0.74 | 0.62 | 0.71 | 0.67 | 0.72 | 0.61 |
27790 | exec - npair_intel.cpp:366-371 | void LAMMPS_NS::NPairIntel::bin_newton<float, double, 0, 0, 0, 0, 0>(int, LAMMPS_NS::NeighList*, LAMMPS_NS::IntelBuffers<float, double>*, int, int, int) [clone .extracted] | Innermost | 2.01 | 1.88 | 1.81 | 1.77 | 1.75 | 1.74 | 1.73 | 1.65 | 1.71 | 1.66 | 1.58 | 2.01 | 1.88 | 1.81 | 1.77 | 1.75 | 1.74 | 1.73 | 1.65 | 1.71 | 1.66 | 1.58 | 16.68 | 1.57 | 1.28 | 1.03 | 1.13 | 0.94 | 0.86 | 0.84 | 0.77 | 0.74 | 0.70 | 16.68 | 1.57 | 1.28 | 1.03 | 1.13 | 0.94 | 0.86 | 0.84 | 0.77 | 0.74 | 0.70 | 16.36 | 1.43 | 1.08 | 0.90 | 0.88 | 0.80 | 0.73 | 0.67 | 0.65 | 0.60 | 0.58 | 16.36 | 1.43 | 1.08 | 0.90 | 0.88 | 0.80 | 0.73 | 0.67 | 0.65 | 0.60 | 0.58 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100 | 78.57 | 1 | 1 | 1 | 1.02 | 1.11 | 1.19 | 1.16 | 1.28 | 1.18 | 1.18 | 1.25 | 1.2 | 1.24 | 1.23 | 0 | 5 | 0 | 0 | 1 | 1 | 0 | 0.95 | 0.09 | 0.95 | 0.1 | 0.91 | 0.16 | 0.88 | 0.21 | 0.86 | 0.25 | 0.8 | 0.35 | 0.76 | 0.4 | 0.7 | 0.51 | 0.68 | 0.53 | 0.66 | 0.54 |
27785 | exec - npair_intel.cpp:474-558 [...] | void LAMMPS_NS::NPairIntel::bin_newton<float, double, 0, 0, 0, 0, 0>(int, LAMMPS_NS::NeighList*, LAMMPS_NS::IntelBuffers<float, double>*, int, int, int) [clone .extracted] | Innermost | 1.70 | 1.53 | 1.49 | 1.41 | 1.38 | 1.42 | 1.39 | 1.33 | 1.35 | 1.35 | 1.26 | 1.70 | 1.53 | 1.49 | 1.41 | 1.38 | 1.42 | 1.39 | 1.33 | 1.35 | 1.35 | 1.26 | 14.09 | 1.27 | 1.02 | 0.85 | 0.84 | 0.78 | 0.69 | 0.69 | 0.64 | 0.64 | 0.58 | 14.09 | 1.27 | 1.02 | 0.85 | 0.84 | 0.78 | 0.69 | 0.69 | 0.64 | 0.64 | 0.58 | 13.84 | 1.16 | 0.89 | 0.72 | 0.70 | 0.65 | 0.59 | 0.55 | 0.52 | 0.49 | 0.46 | 13.84 | 1.16 | 0.89 | 0.72 | 0.70 | 0.65 | 0.59 | 0.55 | 0.52 | 0.49 | 0.46 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 158.37 | 1885.97 | 2468.62 | 3059.75 | 3127.64 | 3363.47 | 3711.12 | 4014.04 | 4247.17 | 4488.51 | 4739.25 | 100 | 76.32 | 1.15 | 1 | 1.21 | 1.02 | 1.1 | 1.16 | 1.19 | 1.21 | 1.21 | 1.19 | 1.27 | 1.25 | 1.33 | 1.26 | 0 | 5 | 0 | 0 | 2 | 1 | 0 | 0.99 | 0.01 | 0.97 | 0.04 | 0.97 | 0.05 | 0.94 | 0.08 | 0.89 | 0.16 | 0.84 | 0.22 | 0.79 | 0.28 | 0.75 | 0.34 | 0.71 | 0.39 | 0.7 | 0.38 |
6251 | exec - fix_nve_intel.cpp:79-81 | LAMMPS_NS::FixNVEIntel::initial_integrate(int) | Single | 1.29 | 2.64 | 3.21 | 3.64 | 3.64 | 3.81 | 4.26 | 4.15 | 4.46 | 4.70 | 3.27 | 1.29 | 2.64 | 3.21 | 3.64 | 3.64 | 3.81 | 4.26 | 4.15 | 4.46 | 4.70 | 3.27 | 10.73 | 2.23 | 2.11 | 2.06 | 2.13 | 2.00 | 2.02 | 2.02 | 1.93 | 1.89 | 1.77 | 10.73 | 2.23 | 2.11 | 2.06 | 2.13 | 2.00 | 2.02 | 2.02 | 1.93 | 1.89 | 1.77 | 10.48 | 2.01 | 1.92 | 1.85 | 1.84 | 1.75 | 1.81 | 1.70 | 1.70 | 1.70 | 1.20 | 10.48 | 2.01 | 1.92 | 1.85 | 1.84 | 1.75 | 1.81 | 1.70 | 1.70 | 1.70 | 1.20 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 23.92 | 124.81 | 130.99 | 135.45 | 136.12 | 143.34 | 138.97 | 147.06 | 147.76 | 146.76 | 207.74 | 90.91 | 92.05 | 1 | 1 | 1.06 | 1.03 | 1.11 | 1.1 | 1.12 | 1.16 | 1.15 | 1.13 | 1.19 | 1.14 | 1.13 | 1.49 | 1 | 3 | 0 | 0 | 0 | 1 | 0 | 0.43 | 1.49 | 0.34 | 2.11 | 0.28 | 2.61 | 0.27 | 2.65 | 0.25 | 2.86 | 0.21 | 3.38 | 0.19 | 3.36 | 0.17 | 3.7 | 0.15 | 3.98 | 0.2 | 2.6 |
27791 | exec - npair_intel.cpp:330-354 [...] | void LAMMPS_NS::NPairIntel::bin_newton<float, double, 0, 0, 0, 0, 0>(int, LAMMPS_NS::NeighList*, LAMMPS_NS::IntelBuffers<float, double>*, int, int, int) [clone .extracted] | InBetween | 1.07 | 0.97 | 0.94 | 0.88 | 0.88 | 0.90 | 0.87 | 0.84 | 0.86 | 0.85 | 0.77 | 1.16 | 1.06 | 1.02 | 0.96 | 0.95 | 0.98 | 0.95 | 0.91 | 0.94 | 0.93 | 0.84 | 9.72 | 1.24 | 0.95 | 0.68 | 0.67 | 0.86 | 0.56 | 0.52 | 0.46 | 0.44 | 0.38 | 10.50 | 1.35 | 1.01 | 0.75 | 0.74 | 0.90 | 0.62 | 0.56 | 0.51 | 0.50 | 0.42 | 8.68 | 0.74 | 0.56 | 0.44 | 0.44 | 0.41 | 0.37 | 0.34 | 0.33 | 0.31 | 0.28 | 9.41 | 0.80 | 0.61 | 0.49 | 0.48 | 0.45 | 0.40 | 0.37 | 0.36 | 0.33 | 0.31 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.25 | 2.89 | 3.59 | 4.81 | 4.48 | 4.70 | 5.48 | 6.43 | 6.28 | 4.71 | 7.85 | 17.14 | 24.94 | 1.86 | 1 | 1.86 | 1.12 | 1.68 | 1.71 | 1.53 | 1.52 | 2.1 | 1.51 | 1.52 | 1.39 | 1.46 | 1.38 | NA | NA | NA | NA | NA | 1 | 0 | 0.98 | 0.02 | 0.97 | 0.03 | 0.98 | 0.02 | 0.93 | 0.06 | 0.87 | 0.11 | 0.84 | 0.14 | 0.79 | 0.18 | 0.73 | 0.23 | 0.71 | 0.25 | 0.72 | 0.21 |
6053 | exec - fix_intel.cpp:884-887 | void LAMMPS_NS::FixIntel::add_oresults<LAMMPS_NS::IntelBuffers<float, double>::vec3_acc_t, double>(LAMMPS_NS::IntelBuffers<float, double>::vec3_acc_t const*, double const*, int, int, int, int) [clone .extracted] | Single | 1.06 | 2.01 | 2.58 | 2.92 | 2.55 | 3.21 | 2.76 | 3.33 | 3.56 | 4.16 | 3.11 | 1.06 | 2.01 | 2.58 | 2.92 | 2.55 | 3.21 | 2.76 | 3.33 | 3.56 | 4.16 | 3.11 | 8.89 | 1.74 | 1.69 | 1.66 | 1.54 | 1.74 | 1.49 | 1.57 | 1.56 | 1.78 | 1.69 | 8.89 | 1.74 | 1.69 | 1.66 | 1.54 | 1.74 | 1.49 | 1.57 | 1.56 | 1.78 | 1.69 | 8.64 | 1.53 | 1.54 | 1.48 | 1.29 | 1.47 | 1.17 | 1.37 | 1.36 | 1.51 | 1.14 | 8.64 | 1.53 | 1.54 | 1.48 | 1.29 | 1.47 | 1.17 | 1.37 | 1.36 | 1.51 | 1.14 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 6.41 | 40.35 | 41.16 | 43.43 | 49.86 | 44.53 | 56.10 | 49.04 | 50.12 | 45.24 | 61.22 | 97.06 | 90.81 | 1 | 1 | 1.05 | 1.03 | 1.14 | 1.11 | 1.12 | 1.2 | 1.19 | 1.28 | 1.15 | 1.15 | 1.2 | 1.5 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0.47 | 1.06 | 0.35 | 1.67 | 0.29 | 2.07 | 0.32 | 1.74 | 0.24 | 2.42 | 0.26 | 2.04 | 0.2 | 2.67 | 0.18 | 2.93 | 0.14 | 3.56 | 0.18 | 2.56 |
6254 | exec - fix_nve_intel.cpp:135-135 | LAMMPS_NS::FixNVEIntel::final_integrate() | Single | 0.82 | 1.56 | 1.87 | 2.10 | 2.08 | 2.15 | 2.30 | 2.27 | 2.40 | 2.49 | 1.89 | 0.82 | 1.56 | 1.87 | 2.10 | 2.08 | 2.15 | 2.30 | 2.27 | 2.40 | 2.49 | 1.89 | 6.86 | 1.34 | 1.27 | 1.45 | 1.26 | 1.35 | 1.25 | 1.21 | 1.19 | 1.09 | 1.18 | 6.86 | 1.34 | 1.27 | 1.45 | 1.26 | 1.35 | 1.25 | 1.21 | 1.19 | 1.09 | 1.18 | 6.67 | 1.18 | 1.11 | 1.06 | 1.05 | 0.99 | 0.98 | 0.93 | 0.91 | 0.90 | 0.69 | 6.67 | 1.18 | 1.11 | 1.06 | 1.05 | 0.99 | 0.98 | 0.93 | 0.91 | 0.90 | 0.69 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 22.56 | 127.21 | 134.93 | 142.19 | 143.49 | 151.47 | 154.01 | 161.92 | 163.81 | 167.02 | 216.46 | 100 | 100 | 1 | 1 | 1 | 1.03 | 1.14 | 1.15 | 1.37 | 1.21 | 1.37 | 1.28 | 1.31 | 1.31 | 1.22 | 1.72 | 0 | 2 | 0 | 0 | 0 | 1 | 0 | 0.47 | 0.82 | 0.37 | 1.17 | 0.31 | 1.44 | 0.3 | 1.45 | 0.28 | 1.55 | 0.24 | 1.74 | 0.22 | 1.76 | 0.2 | 1.91 | 0.18 | 2.03 | 0.22 | 1.46 |
31380 | exec - intel_buffers.h:228-231 | void LAMMPS_NS::PairEAMIntel::compute<float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&) [clone .extracted] | Single | 0.73 | 1.61 | 2.11 | 2.52 | 2.53 | 2.76 | 3.01 | 3.06 | 3.29 | 3.42 | 2.78 | 0.73 | 1.61 | 2.11 | 2.52 | 2.53 | 2.76 | 3.01 | 3.06 | 3.29 | 3.42 | 2.78 | 6.20 | 1.37 | 1.45 | 1.50 | 1.43 | 1.53 | 1.52 | 1.39 | 1.40 | 1.37 | 1.42 | 6.20 | 1.37 | 1.45 | 1.50 | 1.43 | 1.53 | 1.52 | 1.39 | 1.40 | 1.37 | 1.42 | 5.89 | 1.23 | 1.26 | 1.28 | 1.28 | 1.27 | 1.28 | 1.25 | 1.25 | 1.24 | 1.02 | 5.89 | 1.23 | 1.26 | 1.28 | 1.28 | 1.27 | 1.28 | 1.25 | 1.25 | 1.24 | 1.02 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100 | 84.72 | 1 | 1 | 1 | 1.05 | 1.12 | 1.16 | 1.18 | 1.13 | 1.21 | 1.19 | 1.11 | 1.13 | 1.12 | 1.41 | 0 | 2 | 0 | 0 | 2 | 1 | 0 | 0.4 | 0.97 | 0.29 | 1.49 | 0.23 | 1.94 | 0.22 | 1.97 | 0.19 | 2.22 | 0.16 | 2.52 | 0.15 | 2.61 | 0.13 | 2.86 | 0.12 | 3.01 | 0.13 | 2.4 |
7840 | exec - comm_brick.cpp:841-844 | LAMMPS_NS::CommBrick::borders() | Innermost | 0.58 | 0.65 | 0.70 | 0.70 | 0.68 | 0.74 | 0.69 | 0.70 | 0.70 | 0.75 | 0.72 | 0.58 | 0.65 | 0.70 | 0.70 | 0.68 | 0.74 | 0.69 | 0.70 | 0.70 | 0.75 | 0.72 | 5.16 | 0.63 | 0.51 | 0.44 | 0.48 | 0.44 | 0.38 | 0.39 | 0.38 | 0.37 | 0.36 | 5.16 | 0.63 | 0.51 | 0.44 | 0.48 | 0.44 | 0.38 | 0.39 | 0.38 | 0.37 | 0.36 | 4.69 | 0.50 | 0.42 | 0.35 | 0.35 | 0.34 | 0.29 | 0.29 | 0.27 | 0.27 | 0.26 | 4.69 | 0.50 | 0.42 | 0.35 | 0.35 | 0.34 | 0.29 | 0.29 | 0.27 | 0.27 | 0.26 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 10.94 | 1.3 | 1 | 11.27 | 1.1 | 1.27 | 1.23 | 1.27 | 1.41 | 1.32 | 1.32 | 1.37 | 1.46 | 1.39 | 1.4 | 1.5 | 1 | 0 | 1.75 | 0.75 | 1 | 0 | 0.79 | 0.14 | 0.7 | 0.21 | 0.67 | 0.23 | 0.65 | 0.24 | 0.58 | 0.31 | 0.57 | 0.3 | 0.51 | 0.34 | 0.49 | 0.36 | 0.43 | 0.43 | 0.42 | 0.42 |
31384 | exec - pair_eam_intel.cpp:291-614 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 1, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Outermost | 0.54 | 0.48 | 0.46 | 0.45 | 0.44 | 0.45 | 0.44 | 0.42 | 0.43 | 0.43 | 0.39 | 1.27 | 1.14 | 1.11 | 1.06 | 1.03 | 1.04 | 1.02 | 0.98 | 1.00 | 0.98 | 0.92 | 4.50 | 0.44 | 0.35 | 0.29 | 0.29 | 0.29 | 0.24 | 0.25 | 0.21 | 0.21 | 0.19 | 10.28 | 0.89 | 0.69 | 0.56 | 0.56 | 0.49 | 0.45 | 0.44 | 0.40 | 0.37 | 0.46 | 4.36 | 0.37 | 0.28 | 0.23 | 0.22 | 0.21 | 0.19 | 0.17 | 0.16 | 0.16 | 0.14 | 10.28 | 0.87 | 0.66 | 0.54 | 0.52 | 0.48 | 0.43 | 0.40 | 0.38 | 0.35 | 0.34 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 90.99 | 1078.63 | 1445.14 | 1756.61 | 1790.35 | 1922.01 | 2154.06 | 2315.27 | 2440.07 | 2556.22 | 2769.94 | 86.72 | 61.14 | 1.16 | 1 | 1.14 | 1.04 | 1.21 | 1.29 | 1.28 | 1.29 | 1.43 | 1.28 | 1.46 | 1.32 | 1.33 | 1.34 | NA | NA | NA | NA | NA | 1 | 0 | 0.99 | 0.01 | 0.99 | 0.01 | 0.96 | 0.02 | 0.93 | 0.03 | 0.88 | 0.06 | 0.84 | 0.07 | 0.79 | 0.09 | 0.74 | 0.11 | 0.7 | 0.13 | 0.71 | 0.11 |
31386 | exec - pair_eam_intel.cpp:533-602 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 1, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Innermost | 0.38 | 0.34 | 0.34 | 0.31 | 0.30 | 0.31 | 0.30 | 0.29 | 0.30 | 0.30 | 0.27 | 0.38 | 0.34 | 0.34 | 0.31 | 0.30 | 0.31 | 0.30 | 0.29 | 0.30 | 0.30 | 0.27 | 3.20 | 0.34 | 0.29 | 0.21 | 0.21 | 0.20 | 0.19 | 0.19 | 0.17 | 0.16 | 0.15 | 3.20 | 0.34 | 0.29 | 0.21 | 0.21 | 0.20 | 0.19 | 0.19 | 0.17 | 0.16 | 0.15 | 3.11 | 0.26 | 0.20 | 0.16 | 0.15 | 0.14 | 0.13 | 0.12 | 0.11 | 0.11 | 0.10 | 3.11 | 0.26 | 0.20 | 0.16 | 0.15 | 0.14 | 0.13 | 0.12 | 0.11 | 0.11 | 0.10 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 77.57 | 925.05 | 1180.74 | 1531.14 | 1585.42 | 1708.57 | 1884.70 | 2011.29 | 2137.24 | 2255.83 | 2414.48 | 99.14 | 80.67 | 1.11 | 1 | 1.07 | 1.03 | 1.3 | 1.43 | 1.3 | 1.35 | 1.43 | 1.45 | 1.55 | 1.52 | 1.51 | 1.52 | 2 | 5 | 0 | 0 | 4 | 1 | 0 | 0.99 | 0 | 0.95 | 0.02 | 0.98 | 0.01 | 0.97 | 0.01 | 0.92 | 0.03 | 0.86 | 0.04 | 0.81 | 0.06 | 0.76 | 0.07 | 0.73 | 0.08 | 0.73 | 0.07 |
31409 | exec - pair_eam_intel.cpp:440-464 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 0, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Single | 0.36 | 0.48 | 0.53 | 0.49 | 0.47 | 0.51 | 0.48 | 0.53 | 0.52 | 0.47 | 0.46 | 0.36 | 0.48 | 0.53 | 0.49 | 0.47 | 0.51 | 0.48 | 0.53 | 0.52 | 0.47 | 0.46 | 2.97 | 0.48 | 0.41 | 0.39 | 0.38 | 0.37 | 0.31 | 0.29 | 0.28 | 0.25 | 0.25 | 2.97 | 0.48 | 0.41 | 0.39 | 0.38 | 0.37 | 0.31 | 0.29 | 0.28 | 0.25 | 0.25 | 2.92 | 0.37 | 0.32 | 0.25 | 0.24 | 0.23 | 0.21 | 0.22 | 0.20 | 0.17 | 0.17 | 2.92 | 0.37 | 0.32 | 0.25 | 0.24 | 0.23 | 0.21 | 0.22 | 0.20 | 0.17 | 0.17 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 44.67 | 355.67 | 411.28 | 523.13 | 552.13 | 564.19 | 637.98 | 591.51 | 655.97 | 776.06 | 769.94 | 100 | 83.62 | 1 | 1 | 1 | 1.02 | 1.33 | 1.3 | 1.59 | 1.59 | 1.62 | 1.52 | 1.35 | 1.44 | 1.5 | 1.51 | 0 | 0 | 0 | 0 | 4 | 1 | 0 | 0.67 | 0.16 | 0.58 | 0.22 | 0.59 | 0.2 | 0.58 | 0.2 | 0.52 | 0.24 | 0.51 | 0.24 | 0.42 | 0.31 | 0.41 | 0.31 | 0.43 | 0.26 | 0.4 | 0.28 |
31385 | exec - pair_eam_intel.cpp:513-521 | void LAMMPS_NS::PairEAMIntel::eval<1, 1, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Innermost | 0.35 | 0.31 | 0.30 | 0.30 | 0.29 | 0.29 | 0.28 | 0.27 | 0.27 | 0.25 | 0.26 | 0.35 | 0.31 | 0.30 | 0.30 | 0.29 | 0.29 | 0.28 | 0.27 | 0.27 | 0.25 | 0.26 | 3.04 | 0.31 | 0.26 | 0.20 | 0.22 | 0.19 | 0.17 | 0.16 | 0.15 | 0.14 | 0.16 | 3.04 | 0.31 | 0.26 | 0.20 | 0.22 | 0.19 | 0.17 | 0.16 | 0.15 | 0.14 | 0.16 | 2.81 | 0.24 | 0.18 | 0.15 | 0.15 | 0.13 | 0.12 | 0.11 | 0.10 | 0.09 | 0.09 | 2.81 | 0.24 | 0.18 | 0.15 | 0.15 | 0.13 | 0.12 | 0.11 | 0.10 | 0.09 | 0.09 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 54.30 | 628.99 | 840.97 | 991.50 | 1032.79 | 1152.72 | 1251.81 | 1386.06 | 1443.69 | 1671.18 | 1602.99 | 100 | 89.29 | 1 | 1 | 1 | 1.08 | 1.28 | 1.46 | 1.38 | 1.52 | 1.46 | 1.47 | 1.47 | 1.41 | 1.56 | 1.73 | 0 | 1 | 0 | 0 | 6 | 1 | 0 | 0.98 | 0.01 | 0.98 | 0.01 | 0.94 | 0.02 | 0.92 | 0.02 | 0.9 | 0.03 | 0.84 | 0.05 | 0.8 | 0.05 | 0.76 | 0.07 | 0.77 | 0.06 | 0.7 | 0.08 |
31395 | exec - pair_eam_intel.cpp:291-363 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 1, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Outermost | 0.30 | 0.27 | 0.26 | 0.25 | 0.24 | 0.25 | 0.24 | 0.23 | 0.24 | 0.23 | 0.21 | 0.76 | 0.68 | 0.66 | 0.63 | 0.62 | 0.63 | 0.62 | 0.60 | 0.62 | 0.60 | 0.58 | 2.56 | 0.27 | 0.21 | 0.19 | 0.18 | 0.16 | 0.14 | 0.16 | 0.15 | 0.12 | 0.13 | 6.16 | 0.54 | 0.41 | 0.34 | 0.34 | 0.31 | 0.28 | 0.27 | 0.26 | 0.24 | 0.25 | 2.47 | 0.21 | 0.15 | 0.13 | 0.12 | 0.11 | 0.10 | 0.09 | 0.09 | 0.08 | 0.08 | 6.16 | 0.52 | 0.39 | 0.32 | 0.31 | 0.29 | 0.26 | 0.25 | 0.23 | 0.22 | 0.21 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 66.10 | 799.60 | 1079.96 | 1276.89 | 1342.06 | 1442.05 | 1619.98 | 1764.25 | 1847.88 | 1974.99 | 2094.35 | 78.05 | 60.29 | 1.14 | 1 | 1.14 | 1.04 | 1.32 | 1.42 | 1.45 | 1.53 | 1.36 | 1.43 | 1.67 | 1.68 | 1.52 | 1.68 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1.01 | -0 | 0.96 | 0.01 | 0.96 | 0.01 | 0.9 | 0.03 | 0.87 | 0.03 | 0.83 | 0.04 | 0.76 | 0.06 | 0.74 | 0.06 | 0.74 | 0.06 |
7836 | exec - comm_brick.cpp:709-715 | LAMMPS_NS::CommBrick::exchange() | Innermost | 0.28 | 0.28 | 0.27 | 0.26 | 0.26 | 0.24 | 0.23 | 0.22 | 0.25 | 0.25 | 0.22 | 0.28 | 0.28 | 0.27 | 0.26 | 0.26 | 0.24 | 0.23 | 0.22 | 0.25 | 0.25 | 0.22 | 2.53 | 0.28 | 0.21 | 0.19 | 0.18 | 0.18 | 0.16 | 0.16 | 0.15 | 0.15 | 0.15 | 2.53 | 0.28 | 0.21 | 0.19 | 0.18 | 0.18 | 0.16 | 0.16 | 0.15 | 0.15 | 0.15 | 2.31 | 0.21 | 0.16 | 0.13 | 0.13 | 0.11 | 0.10 | 0.09 | 0.09 | 0.09 | 0.08 | 2.31 | 0.21 | 0.16 | 0.13 | 0.13 | 0.11 | 0.10 | 0.09 | 0.09 | 0.09 | 0.08 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 9.17 | 2.1 | 1 | 14.44 | 1.1 | 1.29 | 1.28 | 1.44 | 1.37 | 1.61 | 1.59 | 1.82 | 1.67 | 1.64 | 1.8 | NA | NA | NA | NA | NA | 1 | 0 | 0.9 | 0.03 | 0.89 | 0.03 | 0.87 | 0.03 | 0.85 | 0.04 | 0.88 | 0.03 | 0.84 | 0.04 | 0.79 | 0.05 | 0.69 | 0.08 | 0.64 | 0.09 | 0.66 | 0.08 |
5195 | exec - domain_omp.cpp:71-153 | LAMMPS_NS::DomainOMP::pbc() [clone .extracted] | Single | 0.28 | 0.21 | 0.19 | 0.17 | 0.17 | 0.16 | 0.15 | 0.13 | 0.12 | 0.12 | 0.10 | 0.28 | 0.21 | 0.19 | 0.17 | 0.17 | 0.16 | 0.15 | 0.13 | 0.12 | 0.12 | 0.10 | 2.56 | 0.22 | 0.17 | 0.14 | 0.13 | 0.13 | 0.12 | 0.10 | 0.09 | 0.08 | 0.07 | 2.56 | 0.22 | 0.17 | 0.14 | 0.13 | 0.13 | 0.12 | 0.10 | 0.09 | 0.08 | 0.07 | 2.23 | 0.16 | 0.11 | 0.09 | 0.09 | 0.07 | 0.06 | 0.05 | 0.05 | 0.04 | 0.04 | 2.23 | 0.16 | 0.11 | 0.09 | 0.09 | 0.07 | 0.06 | 0.05 | 0.05 | 0.04 | 0.04 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.05 | 0.07 | 0.07 | 0.07 | 0.10 | 0.08 | 0.11 | 0.11 | 0.16 | 0.11 | 13.64 | 11.93 | 1.35 | 1.61 | 10.29 | 1.15 | 1.45 | 1.5 | 1.64 | 1.54 | 1.71 | 1.95 | 1.86 | 1.8 | 1.99 | 2.08 | NA | NA | NA | NA | NA | 1 | 0 | 1.19 | 0 | 1.23 | 0 | 1.26 | 0 | 1.25 | 0 | 1.26 | 0 | 1.29 | 0 | 1.36 | 0 | 1.31 | 0 | 1.29 | 0 | 1.43 | 0 |
31396 | exec - pair_eam_intel.cpp:312-320 | void LAMMPS_NS::PairEAMIntel::eval<1, 1, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Innermost | 0.27 | 0.24 | 0.24 | 0.23 | 0.23 | 0.24 | 0.23 | 0.23 | 0.23 | 0.23 | 0.23 | 0.27 | 0.24 | 0.24 | 0.23 | 0.23 | 0.24 | 0.23 | 0.23 | 0.23 | 0.23 | 0.23 | 2.23 | 0.24 | 0.20 | 0.18 | 0.16 | 0.16 | 0.13 | 0.14 | 0.14 | 0.13 | 0.13 | 2.23 | 0.24 | 0.20 | 0.18 | 0.16 | 0.16 | 0.13 | 0.14 | 0.14 | 0.13 | 0.13 | 2.20 | 0.18 | 0.14 | 0.12 | 0.12 | 0.11 | 0.10 | 0.09 | 0.09 | 0.08 | 0.09 | 2.20 | 0.18 | 0.14 | 0.12 | 0.12 | 0.11 | 0.10 | 0.09 | 0.09 | 0.08 | 0.09 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 53.64 | 649.43 | 834.01 | 1018.56 | 1029.93 | 1081.48 | 1235.15 | 1255.22 | 1325.95 | 1437.90 | 1368.09 | 100 | 87.5 | 1 | 1 | 1 | 1.02 | 1.33 | 1.38 | 1.57 | 1.4 | 1.48 | 1.42 | 1.5 | 1.64 | 1.65 | 1.58 | 0 | 1 | 0 | 0 | 3 | 1 | 0 | 1.02 | -0 | 0.97 | 0.01 | 0.95 | 0.01 | 0.91 | 0.02 | 0.85 | 0.04 | 0.82 | 0.04 | 0.73 | 0.06 | 0.69 | 0.07 | 0.67 | 0.08 | 0.6 | 0.09 |
7088 | exec - atom_vec.cpp:733-737 | LAMMPS_NS::AtomVec::unpack_reverse(int, int*, double*) | Single | 0.24 | 0.49 | 0.54 | 0.60 | 0.63 | 0.73 | 0.66 | 0.66 | 0.74 | 0.92 | 0.90 | 0.24 | 0.49 | 0.54 | 0.60 | 0.63 | 0.73 | 0.66 | 0.66 | 0.74 | 0.92 | 0.90 | 2.04 | 0.48 | 0.45 | 0.45 | 0.48 | 0.48 | 0.40 | 0.38 | 0.42 | 0.50 | 0.48 | 2.04 | 0.48 | 0.45 | 0.45 | 0.48 | 0.48 | 0.40 | 0.38 | 0.42 | 0.50 | 0.48 | 1.93 | 0.37 | 0.32 | 0.30 | 0.32 | 0.33 | 0.28 | 0.27 | 0.28 | 0.33 | 0.33 | 1.93 | 0.37 | 0.32 | 0.30 | 0.32 | 0.33 | 0.28 | 0.27 | 0.28 | 0.33 | 0.33 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 2.60 | 29.99 | 39.84 | 46.65 | 43.81 | 43.88 | 55.31 | 60.92 | 59.11 | 53.33 | 59.12 | 0 | 12.5 | 1.14 | 1.28 | 8 | 1.06 | 1.3 | 1.4 | 1.51 | 1.53 | 1.45 | 1.43 | 1.42 | 1.49 | 1.52 | 1.47 | 0 | 2 | 0 | 2 | 1 | 1 | 0 | 0.43 | 0.28 | 0.37 | 0.34 | 0.32 | 0.41 | 0.29 | 0.45 | 0.24 | 0.55 | 0.24 | 0.5 | 0.22 | 0.51 | 0.19 | 0.6 | 0.14 | 0.79 | 0.14 | 0.78 |
6728 | exec - atom.cpp:2361-2373 | LAMMPS_NS::Atom::sort() | Single | 0.21 | 0.19 | 0.18 | 0.17 | 0.16 | 0.16 | 0.14 | 0.14 | 0.14 | 0.13 | 0.13 | 0.21 | 0.19 | 0.18 | 0.17 | 0.16 | 0.16 | 0.14 | 0.14 | 0.14 | 0.13 | 0.13 | 1.72 | 0.20 | 0.17 | 0.15 | 0.12 | 0.13 | 0.10 | 0.11 | 0.09 | 0.08 | 0.09 | 1.72 | 0.20 | 0.17 | 0.15 | 0.12 | 0.13 | 0.10 | 0.11 | 0.09 | 0.08 | 0.09 | 1.68 | 0.14 | 0.11 | 0.09 | 0.08 | 0.07 | 0.06 | 0.06 | 0.05 | 0.05 | 0.05 | 1.68 | 0.14 | 0.11 | 0.09 | 0.08 | 0.07 | 0.06 | 0.06 | 0.05 | 0.05 | 0.05 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 10.31 | 119.70 | 163.54 | 200.05 | 218.21 | 232.81 | 283.56 | 310.83 | 335.21 | 376.74 | 363.74 | 0 | 8.84 | 1.7 | 2 | 12.8 | 1.03 | 1.39 | 1.57 | 1.74 | 1.59 | 1.83 | 1.64 | 1.98 | 1.76 | 1.64 | 1.82 | 1 | 0 | 0 | 4 | 0 | 1 | 0 | 0.97 | 0.01 | 0.99 | 0 | 0.97 | 0.01 | 1.01 | -0 | 0.94 | 0.01 | 0.98 | 0 | 0.94 | 0.01 | 0.91 | 0.01 | 0.91 | 0.01 | 0.83 | 0.02 |
730 | exec - neighbor.cpp:2479-2482 | LAMMPS_NS::Neighbor::build(int) | Single | 0.20 | 0.31 | 0.35 | 0.31 | 0.30 | 0.30 | 0.29 | 0.29 | 0.29 | 0.30 | 0.29 | 0.20 | 0.31 | 0.35 | 0.31 | 0.30 | 0.30 | 0.29 | 0.29 | 0.29 | 0.30 | 0.29 | 1.64 | 0.30 | 0.26 | 0.23 | 0.22 | 0.19 | 0.19 | 0.20 | 0.17 | 0.16 | 0.16 | 1.64 | 0.30 | 0.26 | 0.23 | 0.22 | 0.19 | 0.19 | 0.20 | 0.17 | 0.16 | 0.16 | 1.59 | 0.23 | 0.21 | 0.16 | 0.15 | 0.14 | 0.12 | 0.12 | 0.11 | 0.11 | 0.11 | 1.59 | 0.23 | 0.21 | 0.16 | 0.15 | 0.14 | 0.12 | 0.12 | 0.11 | 0.11 | 0.11 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 12.5 | 1.11 | 1 | 8 | 1.03 | 1.32 | 1.26 | 1.45 | 1.41 | 1.4 | 1.6 | 1.69 | 1.62 | 1.52 | 1.59 | 0 | 2 | 0 | 8 | 0 | 1 | 0 | 0.57 | 0.13 | 0.48 | 0.18 | 0.5 | 0.16 | 0.49 | 0.15 | 0.48 | 0.15 | 0.46 | 0.16 | 0.42 | 0.17 | 0.4 | 0.17 | 0.36 | 0.19 | 0.35 | 0.18 |
721 | exec - neighbor.cpp:2429-2434 | LAMMPS_NS::Neighbor::check_distance() | Single | 0.19 | 0.42 | 0.53 | 0.64 | 0.64 | 0.70 | 0.77 | 0.77 | 0.86 | 0.91 | 0.70 | 0.19 | 0.42 | 0.53 | 0.64 | 0.64 | 0.70 | 0.77 | 0.77 | 0.86 | 0.91 | 0.70 | 1.67 | 0.40 | 0.38 | 0.41 | 0.44 | 0.50 | 0.52 | 0.45 | 0.51 | 0.43 | 0.43 | 1.67 | 0.40 | 0.38 | 0.41 | 0.44 | 0.50 | 0.52 | 0.45 | 0.51 | 0.43 | 0.43 | 1.54 | 0.32 | 0.32 | 0.32 | 0.32 | 0.32 | 0.32 | 0.32 | 0.33 | 0.33 | 0.26 | 1.54 | 0.32 | 0.32 | 0.32 | 0.32 | 0.32 | 0.32 | 0.32 | 0.33 | 0.33 | 0.26 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 16.14 | 108.28 | 113.90 | 114.95 | 116.48 | 117.49 | 117.49 | 122.33 | 118.90 | 119.53 | 152.97 | 44.44 | 18.06 | 1 | 1.76 | 5.71 | 1.09 | 1.26 | 1.2 | 1.29 | 1.36 | 1.56 | 1.61 | 1.45 | 1.56 | 1.32 | 1.7 | 0 | 2 | 0 | 2 | 0 | 1 | 0 | 0.4 | 0.25 | 0.3 | 0.37 | 0.24 | 0.49 | 0.23 | 0.49 | 0.2 | 0.56 | 0.17 | 0.64 | 0.15 | 0.65 | 0.13 | 0.75 | 0.12 | 0.8 | 0.14 | 0.61 |
31397 | exec - pair_eam_intel.cpp:340-361 [...] | void LAMMPS_NS::PairEAMIntel::eval<1, 1, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Innermost | 0.18 | 0.17 | 0.16 | 0.15 | 0.15 | 0.15 | 0.15 | 0.14 | 0.15 | 0.15 | 0.14 | 0.18 | 0.17 | 0.16 | 0.15 | 0.15 | 0.15 | 0.15 | 0.14 | 0.15 | 0.15 | 0.14 | 1.53 | 0.18 | 0.14 | 0.11 | 0.14 | 0.11 | 0.10 | 0.10 | 0.09 | 0.09 | 0.09 | 1.53 | 0.18 | 0.14 | 0.11 | 0.14 | 0.11 | 0.10 | 0.10 | 0.09 | 0.09 | 0.09 | 1.49 | 0.13 | 0.10 | 0.08 | 0.08 | 0.07 | 0.07 | 0.06 | 0.06 | 0.05 | 0.05 | 1.49 | 0.13 | 0.10 | 0.08 | 0.08 | 0.07 | 0.07 | 0.06 | 0.06 | 0.05 | 0.05 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 43.25 | 493.85 | 650.72 | 850.64 | 822.73 | 936.75 | 964.73 | 1066.98 | 1114.35 | 1176.91 | 1262.66 | 100 | 81.25 | 1.1 | 1 | 1.05 | 1.03 | 1.4 | 1.45 | 1.46 | 1.75 | 1.57 | 1.55 | 1.78 | 1.71 | 1.7 | 1.83 | 0 | 2 | 0 | 0 | 2 | 1 | 0 | 0.96 | 0.01 | 0.96 | 0.01 | 0.98 | 0 | 0.91 | 0.01 | 0.92 | 0.01 | 0.82 | 0.03 | 0.78 | 0.03 | 0.74 | 0.04 | 0.7 | 0.04 | 0.7 | 0.04 |
6218 | exec - nbin_intel.cpp:220-225 | void LAMMPS_NS::NBinIntel::bin_atoms<float, double>(LAMMPS_NS::IntelBuffers<float, double>*) | Single | 0.16 | 0.16 | 0.15 | 0.17 | 0.15 | 0.16 | 0.16 | 0.15 | 0.15 | 0.15 | 0.15 | 0.16 | 0.16 | 0.15 | 0.17 | 0.15 | 0.16 | 0.16 | 0.15 | 0.15 | 0.15 | 0.15 | 1.37 | 0.20 | 0.14 | 0.13 | 0.11 | 0.11 | 0.11 | 0.12 | 0.12 | 0.09 | 0.10 | 1.37 | 0.20 | 0.14 | 0.13 | 0.11 | 0.11 | 0.11 | 0.12 | 0.12 | 0.09 | 0.10 | 1.28 | 0.12 | 0.09 | 0.08 | 0.08 | 0.07 | 0.07 | 0.06 | 0.06 | 0.05 | 0.06 | 1.28 | 0.12 | 0.09 | 0.08 | 0.08 | 0.07 | 0.07 | 0.06 | 0.06 | 0.05 | 0.06 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 7.74 | 88.18 | 120.72 | 132.79 | 148.76 | 156.49 | 172.63 | 189.30 | 202.25 | 216.19 | 217.11 | 0 | 8.33 | 1 | 1 | 14.48 | 1.07 | 1.62 | 1.58 | 1.6 | 1.52 | 1.61 | 1.74 | 2.04 | 2.19 | 1.77 | 1.9 | 2 | 0 | 0 | 5 | 1 | 1 | 0 | 0.86 | 0.02 | 0.87 | 0.02 | 0.76 | 0.04 | 0.8 | 0.03 | 0.74 | 0.04 | 0.69 | 0.05 | 0.65 | 0.05 | 0.62 | 0.06 | 0.59 | 0.06 | 0.54 | 0.07 |
6226 | exec - intel_buffers.h:210-214 | void LAMMPS_NS::NBinIntel::bin_atoms<float, double>(LAMMPS_NS::IntelBuffers<float, double>*) | Single | 0.15 | 0.26 | 0.33 | 0.32 | 0.29 | 0.32 | 0.31 | 0.32 | 0.33 | 0.35 | 0.34 | 0.15 | 0.26 | 0.33 | 0.32 | 0.29 | 0.32 | 0.31 | 0.32 | 0.33 | 0.35 | 0.34 | 1.33 | 0.24 | 0.25 | 0.24 | 0.21 | 0.22 | 0.20 | 0.20 | 0.21 | 0.19 | 0.21 | 1.33 | 0.24 | 0.25 | 0.24 | 0.21 | 0.22 | 0.20 | 0.20 | 0.21 | 0.19 | 0.21 | 1.26 | 0.20 | 0.19 | 0.16 | 0.15 | 0.15 | 0.13 | 0.13 | 0.12 | 0.12 | 0.13 | 1.26 | 0.20 | 0.19 | 0.16 | 0.15 | 0.15 | 0.13 | 0.13 | 0.12 | 0.12 | 0.13 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100 | 54.31 | 1 | 1 | 1.3 | 1.06 | 1.23 | 1.29 | 1.5 | 1.48 | 1.51 | 1.51 | 1.57 | 1.66 | 1.5 | 1.7 | 0 | 3 | 0 | 0 | 2 | 1 | 0 | 0.53 | 0.12 | 0.4 | 0.19 | 0.38 | 0.2 | 0.41 | 0.17 | 0.36 | 0.2 | 0.34 | 0.21 | 0.3 | 0.22 | 0.28 | 0.23 | 0.25 | 0.26 | 0.24 | 0.26 |
7005 | exec - atom_vec.cpp:360-364 | LAMMPS_NS::AtomVec::pack_comm(int, int*, double*, int, int*) | Single | 0.10 | 0.38 | 0.43 | 0.50 | 0.54 | 0.65 | 0.61 | 0.62 | 0.71 | 0.83 | 0.64 | 0.10 | 0.38 | 0.43 | 0.50 | 0.54 | 0.65 | 0.61 | 0.62 | 0.71 | 0.83 | 0.64 | 1.11 | 0.44 | 0.43 | 0.43 | 0.43 | 0.50 | 0.39 | 0.41 | 0.40 | 0.48 | 0.44 | 1.11 | 0.44 | 0.43 | 0.43 | 0.43 | 0.50 | 0.39 | 0.41 | 0.40 | 0.48 | 0.44 | 0.82 | 0.29 | 0.26 | 0.25 | 0.27 | 0.30 | 0.26 | 0.26 | 0.27 | 0.30 | 0.23 | 0.82 | 0.29 | 0.26 | 0.25 | 0.27 | 0.30 | 0.26 | 0.26 | 0.27 | 0.30 | 0.23 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 12.5 | 1.11 | 1 | 8 | 1.35 | 1.54 | 1.67 | 1.69 | 1.6 | 1.7 | 1.53 | 1.63 | 1.5 | 1.64 | 1.9 | 0 | 2 | 0 | 2 | 1 | 1 | 0 | 0.24 | 0.29 | 0.2 | 0.35 | 0.16 | 0.42 | 0.14 | 0.46 | 0.11 | 0.58 | 0.11 | 0.54 | 0.1 | 0.56 | 0.08 | 0.65 | 0.07 | 0.77 | 0.08 | 0.58 |
27792 | exec - npair_intel.cpp:353-354 | void LAMMPS_NS::NPairIntel::bin_newton<float, double, 0, 0, 0, 0, 0>(int, LAMMPS_NS::NeighList*, LAMMPS_NS::IntelBuffers<float, double>*, int, int, int) [clone .extracted] | Innermost | 0.09 | 0.08 | 0.08 | 0.08 | 0.07 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.07 | 0.09 | 0.08 | 0.08 | 0.08 | 0.07 | 0.08 | 0.08 | 0.08 | 0.08 | 0.08 | 0.07 | 0.77 | 0.11 | 0.10 | 0.09 | 0.08 | 0.07 | 0.09 | 0.07 | 0.06 | 0.06 | 0.06 | 0.77 | 0.11 | 0.10 | 0.09 | 0.08 | 0.07 | 0.09 | 0.07 | 0.06 | 0.06 | 0.06 | 0.73 | 0.06 | 0.05 | 0.04 | 0.04 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.73 | 0.06 | 0.05 | 0.04 | 0.04 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 238 | 254 | 0.08 | 0.88 | 1.19 | 1.36 | 1.53 | 1.81 | 1.51 | 1.51 | 2.16 | 0.59 | 2.45 | 100 | 100 | 1 | 1 | 1 | 1.06 | 1.82 | 2.13 | 2.2 | 2.2 | 2.13 | 2.69 | 2.05 | 2.21 | 2.39 | 2.31 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0.96 | 0 | 0.97 | 0 | 0.89 | 0.01 | 0.95 | 0 | 0.86 | 0.01 | 0.82 | 0.01 | 0.72 | 0.02 | 0.69 | 0.02 | 0.67 | 0.02 | 0.66 | 0.02 |
7006 | exec - atom_vec.cpp:376-380 | LAMMPS_NS::AtomVec::pack_comm(int, int*, double*, int, int*) | Single | 0.09 | 0.11 | 0.12 | 0.13 | 0.12 | 0.12 | 0.12 | 0.12 | 0.13 | 0.13 | 0.10 | 0.09 | 0.11 | 0.12 | 0.13 | 0.12 | 0.12 | 0.12 | 0.12 | 0.13 | 0.13 | 0.10 | 0.91 | 0.28 | 0.26 | 0.29 | 0.31 | 0.21 | 0.25 | 0.23 | 0.25 | 0.22 | 0.12 | 0.91 | 0.28 | 0.26 | 0.29 | 0.31 | 0.21 | 0.25 | 0.23 | 0.25 | 0.22 | 0.12 | 0.73 | 0.08 | 0.07 | 0.06 | 0.06 | 0.06 | 0.05 | 0.05 | 0.05 | 0.05 | 0.04 | 0.73 | 0.08 | 0.07 | 0.06 | 0.06 | 0.06 | 0.05 | 0.05 | 0.05 | 0.05 | 0.04 | 6 | 64 | 80 | 96 | 106 | 116 | 128 | 144 | 160 | 180 | 184 | 3.26 | 30.54 | 35.38 | 38.86 | 39.63 | 46.36 | 47.75 | 52.35 | 52.47 | 54.71 | 72.44 | 0 | 12.5 | 1.11 | 1.11 | 8 | 1.26 | 3.08 | 3.16 | 3.66 | 4.16 | 3.12 | 3.67 | 3.55 | 3.91 | 3.56 | 2.49 | 0 | 2 | 0 | 2 | 1 | 1 | 0 | 0.74 | 0.03 | 0.66 | 0.04 | 0.57 | 0.05 | 0.56 | 0.05 | 0.54 | 0.06 | 0.51 | 0.06 | 0.48 | 0.06 | 0.42 | 0.07 | 0.39 | 0.08 | 0.48 | 0.05 |
27777 | exec - npair_intel.cpp:730-731 | void LAMMPS_NS::NPairIntel::bin_newton<float, double, 0, 0, 0, 0, 0>(int, LAMMPS_NS::NeighList*, LAMMPS_NS::IntelBuffers<float, double>*, int, int, int) [clone .extracted] | Innermost | 0.09 | 0.08 | 0.08 | 0.07 | 0.07 | 0.07 | 0.07 | 0.06 | 0.07 | 0.07 | 0.06 | 0.09 | 0.08 | 0.08 | 0.07 | 0.07 | 0.07 | 0.07 | 0.06 | 0.07 | 0.07 | 0.06 | 0.77 | 0.09 | 0.08 | 0.09 | 0.08 | 0.08 | 0.08 | 0.06 | 0.07 | 0.06 | 0.06 | 0.77 | 0.09 | 0.08 | 0.09 | 0.08 | 0.08 | 0.08 | 0.06 | 0.07 | 0.06 | 0.06 | 0.71 | 0.06 | 0.05 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.02 | 0.71 | 0.06 | 0.05 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.02 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 58.84 | 696.19 | 906.45 | 1217.05 | 1210.28 | 1320.75 | 1339.94 | 1589.87 | 1636.82 | 1792.64 | 1824.17 | 0 | 6.25 | 1 | 1 | 16 | 1.09 | 1.41 | 1.77 | 2.48 | 2.46 | 2.5 | 2.59 | 2.28 | 2.89 | 2.35 | 2.83 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0.99 | 0 | 0.98 | 0 | 1.04 | -0 | 0.98 | 0 | 0.92 | 0.01 | 0.82 | 0.01 | 0.84 | 0.01 | 0.76 | 0.02 | 0.75 | 0.02 | 0.72 | 0.02 |
6217 | exec - nbin_intel.cpp:232-233 | void LAMMPS_NS::NBinIntel::bin_atoms<float, double>(LAMMPS_NS::IntelBuffers<float, double>*) | Innermost | 0.08 | 0.09 | 0.09 | 0.08 | 0.09 | 0.08 | 0.09 | 0.08 | 0.09 | 0.08 | 0.08 | 0.08 | 0.09 | 0.09 | 0.08 | 0.09 | 0.08 | 0.09 | 0.08 | 0.09 | 0.08 | 0.08 | 0.70 | 0.11 | 0.10 | 0.08 | 0.08 | 0.07 | 0.08 | 0.07 | 0.07 | 0.06 | 0.08 | 0.70 | 0.11 | 0.10 | 0.08 | 0.08 | 0.07 | 0.08 | 0.07 | 0.07 | 0.06 | 0.08 | 0.68 | 0.07 | 0.06 | 0.04 | 0.05 | 0.04 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.68 | 0.07 | 0.06 | 0.04 | 0.05 | 0.04 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 9.17 | 1 | 1 | 13.27 | 1.04 | 1.63 | 1.81 | 1.91 | 1.73 | 1.81 | 2.03 | 2.06 | 2.15 | 2.23 | 2.43 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0.8 | 0.02 | 0.77 | 0.02 | 0.81 | 0.02 | 0.7 | 0.03 | 0.73 | 0.02 | 0.65 | 0.03 | 0.62 | 0.03 | 0.57 | 0.04 | 0.58 | 0.03 | 0.51 | 0.04 |
6727 | exec - atom.cpp:2386-2387 | LAMMPS_NS::Atom::sort() | Innermost | 0.08 | 0.08 | 0.07 | 0.06 | 0.07 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.08 | 0.08 | 0.07 | 0.06 | 0.07 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.68 | 0.09 | 0.08 | 0.06 | 0.07 | 0.06 | 0.07 | 0.06 | 0.05 | 0.05 | 0.05 | 0.68 | 0.09 | 0.08 | 0.06 | 0.07 | 0.06 | 0.07 | 0.06 | 0.05 | 0.05 | 0.05 | 0.62 | 0.06 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.62 | 0.06 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 6 | 72 | 96 | 120 | 126 | 144 | 166 | 190 | 215 | 235 | 253 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 9.17 | 1 | 1 | 13.27 | 1.09 | 1.6 | 1.83 | 2.02 | 1.92 | 2.09 | 2.53 | 2.28 | 2.01 | 2.31 | 2.18 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0.87 | 0.01 | 0.95 | 0 | 0.96 | 0 | 0.87 | 0.01 | 0.9 | 0.01 | 0.81 | 0.01 | 0.75 | 0.02 | 0.77 | 0.01 | 0.73 | 0.02 | 0.71 | 0.02 |
5197 | exec - domain_omp.cpp:56-57 | LAMMPS_NS::DomainOMP::pbc() [clone .extracted.19] | Single | 0.07 | 0.13 | 0.16 | 0.17 | 0.17 | 0.18 | 0.20 | 0.19 | 0.19 | 0.20 | 0.15 | 0.07 | 0.13 | 0.16 | 0.17 | 0.17 | 0.18 | 0.20 | 0.19 | 0.19 | 0.20 | 0.15 | 0.59 | 0.15 | 0.15 | 0.14 | 0.13 | 0.13 | 0.14 | 0.13 | 0.12 | 0.13 | 0.10 | 0.59 | 0.15 | 0.15 | 0.14 | 0.13 | 0.13 | 0.14 | 0.13 | 0.12 | 0.13 | 0.10 | 0.53 | 0.10 | 0.10 | 0.09 | 0.09 | 0.08 | 0.08 | 0.08 | 0.07 | 0.07 | 0.05 | 0.53 | 0.10 | 0.10 | 0.09 | 0.09 | 0.08 | 0.08 | 0.08 | 0.07 | 0.07 | 0.05 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 66.67 | 68.75 | 1.5 | 1 | 1 | 1.11 | 1.58 | 1.57 | 1.6 | 1.5 | 1.63 | 1.73 | 1.71 | 1.6 | 1.79 | 1.87 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0.47 | 0.07 | 0.35 | 0.1 | 0.3 | 0.12 | 0.29 | 0.12 | 0.28 | 0.13 | 0.23 | 0.15 | 0.22 | 0.15 | 0.2 | 0.15 | 0.18 | 0.17 | 0.23 | 0.11 |
6216 | exec - nbin_intel.cpp:229-233 | void LAMMPS_NS::NBinIntel::bin_atoms<float, double>(LAMMPS_NS::IntelBuffers<float, double>*) | Outermost | 0.06 | 0.07 | 0.07 | 0.07 | 0.07 | 0.06 | 0.07 | 0.06 | 0.07 | 0.07 | 0.06 | 0.15 | 0.16 | 0.16 | 0.15 | 0.16 | 0.15 | 0.15 | 0.15 | 0.15 | 0.15 | 0.15 | 0.53 | 0.09 | 0.08 | 0.08 | 0.07 | 0.06 | 0.05 | 0.06 | 0.08 | 0.05 | 0.06 | 1.21 | 0.19 | 0.15 | 0.13 | 0.13 | 0.11 | 0.10 | 0.11 | 0.11 | 0.10 | 0.10 | 0.51 | 0.05 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 1.18 | 0.12 | 0.10 | 0.08 | 0.08 | 0.07 | 0.06 | 0.06 | 0.06 | 0.05 | 0.05 | 6 | 72 | 96 | 120 | 126 | 144 | 166 | 192 | 215 | 238 | 254 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 7.5 | 1 | 1 | 15.36 | 1.06 | 1.77 | 1.8 | 2.39 | 2.13 | 2.07 | 1.98 | 2.55 | 3.09 | 2.2 | 2.64 | NA | NA | NA | NA | NA | 1 | 0 | 0.82 | 0.01 | 0.75 | 0.02 | 0.75 | 0.02 | 0.73 | 0.02 | 0.72 | 0.02 | 0.65 | 0.02 | 0.62 | 0.02 | 0.54 | 0.03 | 0.5 | 0.03 | 0.52 | 0.03 |
6723 | exec - atom.cpp:2409-2411 | LAMMPS_NS::Atom::sort() | Innermost | 0.06 | 0.05 | 0.05 | 0.06 | 0.04 | 0.06 | 0.05 | 0.06 | 0.05 | 0.06 | 0.06 | 0.06 | 0.05 | 0.05 | 0.06 | 0.04 | 0.06 | 0.05 | 0.06 | 0.05 | 0.06 | 0.06 | 0.57 | 0.07 | 0.06 | 0.06 | 0.05 | 0.07 | 0.06 | 0.05 | 0.05 | 0.07 | 0.05 | 0.57 | 0.07 | 0.06 | 0.06 | 0.05 | 0.07 | 0.06 | 0.05 | 0.05 | 0.07 | 0.05 | 0.51 | 0.04 | 0.03 | 0.03 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.51 | 0.04 | 0.03 | 0.03 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 6 | 72 | 96 | 119 | 124 | 142 | 168 | 190 | 213 | 236 | 252 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 7.03 | 1 | 1 | 15.38 | 1.12 | 2 | 2.09 | 2.11 | 2.56 | 2.49 | 2.83 | 2.41 | 2.64 | 3.41 | 2.23 | 1 | 0 | 0 | 4 | 3 | 1 | 0 | 1.12 | 0 | 1.01 | -0 | 0.89 | 0.01 | 1.13 | 0 | 0.81 | 0.01 | 0.85 | 0.01 | 0.69 | 0.02 | 0.74 | 0.01 | 0.62 | 0.02 | 0.53 | 0.03 |
20951 | exec - pair_eam.cpp:912-914 | LAMMPS_NS::PairEAM::unpack_reverse_comm(int, int*, double*) | Single | 0.06 | 0.13 | 0.14 | 0.14 | 0.14 | 0.16 | 0.13 | 0.14 | 0.14 | 0.18 | 0.13 | 0.06 | 0.13 | 0.14 | 0.14 | 0.14 | 0.16 | 0.13 | 0.14 | 0.14 | 0.18 | 0.13 | 0.55 | 0.16 | 0.13 | 0.12 | 0.13 | 0.12 | 0.10 | 0.10 | 0.11 | 0.12 | 0.11 | 0.55 | 0.16 | 0.13 | 0.12 | 0.13 | 0.12 | 0.10 | 0.10 | 0.11 | 0.12 | 0.11 | 0.45 | 0.10 | 0.08 | 0.07 | 0.07 | 0.07 | 0.06 | 0.06 | 0.05 | 0.06 | 0.05 | 0.45 | 0.10 | 0.08 | 0.07 | 0.07 | 0.07 | 0.06 | 0.06 | 0.05 | 0.06 | 0.05 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 3.68 | 36.92 | 51.11 | 64.22 | 67.71 | 67.95 | 92.36 | 98.01 | 104.21 | 93.08 | 129.33 | 0 | 12.5 | 1 | 1.41 | 8 | 1.21 | 1.55 | 1.57 | 1.66 | 1.84 | 1.75 | 1.81 | 1.88 | 2.06 | 1.91 | 2.35 | 0 | 2 | 0 | 0 | 1 | 1 | 0 | 0.38 | 0.08 | 0.34 | 0.09 | 0.31 | 0.1 | 0.32 | 0.09 | 0.26 | 0.12 | 0.29 | 0.09 | 0.25 | 0.1 | 0.23 | 0.11 | 0.18 | 0.14 | 0.21 | 0.11 |
6726 | exec - atom.cpp:2384-2387 | LAMMPS_NS::Atom::sort() | Outermost | 0.05 | 0.05 | 0.05 | 0.05 | 0.04 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.13 | 0.13 | 0.12 | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.10 | 0.10 | 0.10 | 0.50 | 0.06 | 0.06 | 0.05 | 0.06 | 0.05 | 0.05 | 0.04 | 0.04 | 0.05 | 0.04 | 1.08 | 0.16 | 0.11 | 0.10 | 0.11 | 0.09 | 0.09 | 0.08 | 0.08 | 0.07 | 0.08 | 0.43 | 0.04 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 1.05 | 0.10 | 0.07 | 0.06 | 0.06 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 6 | 72 | 96 | 120 | 126 | 143 | 165 | 190 | 204 | 225 | 247 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 7.5 | 1 | 1 | 15.36 | 1.18 | 1.76 | 2.2 | 2.01 | 2.53 | 2.45 | 2.63 | 2.54 | 2.4 | 3.25 | 2.86 | NA | NA | NA | NA | NA | 1 | 0 | 0.97 | 0 | 0.91 | 0 | 0.86 | 0.01 | 0.94 | 0 | 0.8 | 0.01 | 0.74 | 0.01 | 0.76 | 0.01 | 0.75 | 0.01 | 0.74 | 0.01 | 0.66 | 0.01 |
6134 | exec - intel_buffers.cpp:624-624 | LAMMPS_NS::IntelBuffers<float, double>::fdotr_reduce_l5(int, int, int, int, double&, double&, double&, double&, double&, double&) | Single | 0.05 | 0.06 | 0.06 | 0.06 | 0.06 | 0.07 | 0.07 | 0.07 | 0.08 | 0.09 | 0.07 | 0.05 | 0.06 | 0.06 | 0.06 | 0.06 | 0.07 | 0.07 | 0.07 | 0.08 | 0.09 | 0.07 | 0.44 | 0.06 | 0.05 | 0.06 | 0.05 | 0.06 | 0.06 | 0.05 | 0.07 | 0.06 | 0.05 | 0.44 | 0.06 | 0.05 | 0.06 | 0.05 | 0.06 | 0.06 | 0.05 | 0.07 | 0.06 | 0.05 | 0.41 | 0.04 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.41 | 0.04 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 17.36 | 186.86 | 232.46 | 257.96 | 261.58 | 255.15 | 274.83 | 285.05 | 287.51 | 268.72 | 336.28 | 10.34 | 18.1 | 1 | 1.08 | 3.71 | 1.07 | 1.54 | 1.57 | 2.04 | 1.59 | 1.83 | 1.8 | 1.86 | 2.12 | 1.77 | 1.9 | 3 | 2 | 0 | 0 | 0 | 1 | 0 | 0.8 | 0.01 | 0.72 | 0.02 | 0.64 | 0.02 | 0.61 | 0.02 | 0.51 | 0.03 | 0.47 | 0.04 | 0.43 | 0.04 | 0.37 | 0.05 | 0.32 | 0.06 | 0.36 | 0.05 |
31372 | exec - pair_eam_intel.cpp:830-832 | LAMMPS_NS::PairEAMIntel::pack_forward_comm(int, int*, double*, int, int*) | Single | 0.05 | 0.06 | 0.06 | 0.07 | 0.07 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.05 | 0.06 | 0.06 | 0.07 | 0.07 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.43 | 0.08 | 0.07 | 0.06 | 0.09 | 0.06 | 0.07 | 0.06 | 0.05 | 0.05 | 0.05 | 0.43 | 0.08 | 0.07 | 0.06 | 0.09 | 0.06 | 0.07 | 0.06 | 0.05 | 0.05 | 0.05 | 0.39 | 0.05 | 0.04 | 0.03 | 0.04 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.39 | 0.05 | 0.04 | 0.03 | 0.04 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 6 | 72 | 96 | 119 | 126 | 144 | 168 | 189 | 215 | 239 | 248 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100 | 55 | 1 | 1 | 1.5 | 1.12 | 1.79 | 1.85 | 1.88 | 2.41 | 2.2 | 2.48 | 2.67 | 2.23 | 2.06 | 2.42 | 0 | 2 | 0 | 0 | 1 | 1 | 0 | 0.68 | 0.02 | 0.63 | 0.02 | 0.56 | 0.03 | 0.52 | 0.03 | 0.54 | 0.03 | 0.52 | 0.03 | 0.5 | 0.03 | 0.48 | 0.03 | 0.44 | 0.03 | 0.41 | 0.04 |
27787 | exec - npair_intel.cpp:392-398 | void LAMMPS_NS::NPairIntel::bin_newton<float, double, 0, 0, 0, 0, 0>(int, LAMMPS_NS::NeighList*, LAMMPS_NS::IntelBuffers<float, double>*, int, int, int) [clone .extracted] | Innermost | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.41 | 0.06 | 0.05 | 0.04 | 0.05 | 0.05 | 0.05 | 0.04 | 0.04 | 0.05 | 0.04 | 0.41 | 0.06 | 0.05 | 0.04 | 0.05 | 0.05 | 0.05 | 0.04 | 0.04 | 0.05 | 0.04 | 0.38 | 0.04 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.38 | 0.04 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 6 | 72 | 95 | 117 | 123 | 136 | 161 | 182 | 209 | 218 | 231 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 6.25 | 1.78 | 1 | 16 | 1.1 | 1.7 | 1.87 | 2.25 | 2.62 | 2.6 | 2.84 | 2.66 | 2.66 | 3.41 | 2.75 | 1 | 5 | 0 | 0 | 1 | 1 | 0 | 0.89 | 0 | 0.89 | 0 | 0.97 | 0 | 0.88 | 0.01 | 0.87 | 0.01 | 0.72 | 0.01 | 0.73 | 0.01 | 0.64 | 0.02 | 0.7 | 0.01 | 0.67 | 0.01 |
6722 | exec - atom.cpp:2405-2414 | LAMMPS_NS::Atom::sort() | Outermost | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.10 | 0.08 | 0.08 | 0.08 | 0.07 | 0.09 | 0.08 | 0.08 | 0.08 | 0.08 | 0.09 | 0.36 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.88 | 0.10 | 0.09 | 0.09 | 0.07 | 0.09 | 0.09 | 0.07 | 0.07 | 0.07 | 0.07 | 0.27 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.78 | 0.06 | 0.05 | 0.04 | 0.04 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 6 | 71 | 95 | 109 | 119 | 137 | 154 | 173 | 194 | 214 | 222 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 6.82 | 9.52 | 1 | 1 | 13.47 | 1.31 | 2.21 | 2.49 | 2.56 | 2.6 | 3.11 | 2.85 | 2.67 | 3.3 | 4.16 | 2.41 | NA | NA | NA | NA | NA | 1 | 0 | 1.01 | -0 | 0.94 | 0 | 0.95 | 0 | 0.88 | 0 | 0.81 | 0.01 | 0.74 | 0.01 | 0.71 | 0.01 | 0.68 | 0.01 | 0.69 | 0.01 | 0.7 | 0.01 |
7141 | exec - atom_vec.cpp:1033-1039 [...] | LAMMPS_NS::AtomVec::unpack_border(int, int, double*) | Single | 0.03 | 0.07 | 0.06 | 0.08 | 0.08 | 0.08 | 0.09 | 0.09 | 0.10 | 0.09 | 0.12 | 0.03 | 0.07 | 0.06 | 0.08 | 0.08 | 0.08 | 0.09 | 0.09 | 0.10 | 0.09 | 0.12 | 0.32 | 0.09 | 0.08 | 0.07 | 0.08 | 0.07 | 0.08 | 0.08 | 0.07 | 0.07 | 0.08 | 0.32 | 0.09 | 0.08 | 0.07 | 0.08 | 0.07 | 0.08 | 0.08 | 0.07 | 0.07 | 0.08 | 0.24 | 0.06 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.04 | 0.03 | 0.04 | 0.24 | 0.06 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.04 | 0.03 | 0.04 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 238 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 10.42 | 2 | 1 | 10.67 | 1.34 | 1.61 | 2.11 | 1.86 | 2.07 | 1.96 | 2.29 | 2.45 | 2.08 | 2.22 | 1.87 | 0 | 4 | 1 | 1 | 0 | 1 | 0 | 0.36 | 0.05 | 0.39 | 0.04 | 0.3 | 0.06 | 0.29 | 0.05 | 0.28 | 0.06 | 0.23 | 0.07 | 0.21 | 0.07 | 0.18 | 0.08 | 0.18 | 0.08 | 0.13 | 0.1 |
7105 | exec - atom_vec.cpp:802-809 [...] | LAMMPS_NS::AtomVec::pack_border(int, int*, double*, int, int*) | Single | 0.03 | 0.07 | 0.08 | 0.09 | 0.10 | 0.10 | 0.10 | 0.10 | 0.10 | 0.11 | 0.09 | 0.03 | 0.07 | 0.08 | 0.09 | 0.10 | 0.10 | 0.10 | 0.10 | 0.10 | 0.11 | 0.09 | 0.28 | 0.11 | 0.10 | 0.09 | 0.10 | 0.09 | 0.09 | 0.10 | 0.08 | 0.08 | 0.08 | 0.28 | 0.11 | 0.10 | 0.09 | 0.10 | 0.09 | 0.09 | 0.10 | 0.08 | 0.08 | 0.08 | 0.21 | 0.06 | 0.05 | 0.05 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.21 | 0.06 | 0.05 | 0.05 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 12.5 | 1.29 | 1 | 8 | 1.34 | 1.97 | 2.02 | 2.06 | 2.04 | 2.05 | 2.07 | 2.25 | 2.15 | 2.02 | 2.32 | 0 | 1 | 0 | 2 | 4 | 1 | 0 | 0.31 | 0.05 | 0.26 | 0.06 | 0.23 | 0.07 | 0.2 | 0.08 | 0.19 | 0.08 | 0.18 | 0.08 | 0.15 | 0.09 | 0.16 | 0.08 | 0.13 | 0.1 | 0.14 | 0.08 |
7106 | exec - atom_vec.cpp:821-828 [...] | LAMMPS_NS::AtomVec::pack_border(int, int*, double*, int, int*) | Single | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.30 | 0.06 | 0.05 | 0.04 | 0.05 | 0.05 | 0.07 | 0.06 | 0.05 | 0.04 | 0.04 | 0.30 | 0.06 | 0.05 | 0.04 | 0.05 | 0.05 | 0.07 | 0.06 | 0.05 | 0.04 | 0.04 | 0.20 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.20 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 6 | 64 | 80 | 96 | 106 | 116 | 127 | 144 | 160 | 179 | 183 | 2.53 | 28.71 | 38.03 | 44.97 | 43.56 | 54.52 | 53.15 | 67.41 | 76.04 | 77.18 | 88.96 | 0 | 12.5 | 1.29 | 1 | 8 | 1.51 | 3.04 | 3 | 2.75 | 3.67 | 3.6 | 5.77 | 5.56 | 5.85 | 4.49 | 4.65 | 0 | 1 | 0 | 2 | 4 | 1 | 0 | 0.92 | 0 | 0.88 | 0 | 0.84 | 0 | 0.81 | 0 | 0.8 | 0 | 0.71 | 0.01 | 0.75 | 0 | 0.77 | 0 | 0.73 | 0.01 | 0.73 | 0 |
31387 | exec - pair_eam_intel.cpp:440-464 | void LAMMPS_NS::PairEAMIntel::eval<1, 1, 1, float, double>(int, int, LAMMPS_NS::IntelBuffers<float, double>*, LAMMPS_NS::PairEAMIntel::ForceConst<float> const&, int, int) [clone .extracted] | Single | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.17 | 0.03 | 0.04 | 0.03 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.03 | 0.17 | 0.03 | 0.04 | 0.03 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.03 | 0.15 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.15 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 56.53 | 601.78 | 605.11 | 981.04 | 981.87 | 924.62 | 909.96 | 987.03 | 967.52 | 1373.64 | 1296.59 | 100 | 79.76 | 1.07 | 1 | 1.07 | 1.1 | 2.39 | 2.56 | 3.29 | 3.92 | 3.04 | 3.7 | 2.83 | 3.56 | 3.73 | 3.8 | 0 | 0 | 0 | 0 | 5 | 1 | 0 | 0.88 | 0 | 0.7 | 0.01 | 0.84 | 0 | 0.82 | 0 | 0.65 | 0.01 | 0.58 | 0.01 | 0.54 | 0.01 | 0.51 | 0.01 | 0.57 | 0.01 | 0.54 | 0.01 |
9521 | exec - compute_temp.cpp:90-92 | LAMMPS_NS::ComputeTemp::compute_scalar() | Single | 0.02 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.04 | 0.05 | 0.05 | 0.02 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.04 | 0.05 | 0.05 | 0.16 | 0.04 | 0.04 | 0.04 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.04 | 0.16 | 0.04 | 0.04 | 0.04 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.04 | 0.15 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.15 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 6 | 72 | 96 | 120 | 126 | 144 | 168 | 192 | 216 | 240 | 256 | 23.06 | 149.00 | 165.90 | 163.96 | 179.37 | 159.30 | 165.99 | 170.82 | 190.18 | 205.13 | 193.90 | 100 | 75.69 | 1.01 | 1 | 1.01 | 1.08 | 1.69 | 2.1 | 2.01 | 2.75 | 2.01 | 2.38 | 2.15 | 2.38 | 2.9 | 2.67 | 0 | 2 | 0 | 0 | 2 | 1 | 0 | 0.52 | 0.01 | 0.43 | 0.02 | 0.37 | 0.02 | 0.39 | 0.02 | 0.31 | 0.03 | 0.28 | 0.03 | 0.25 | 0.03 | 0.24 | 0.03 | 0.21 | 0.04 | 0.2 | 0.04 |
31375 | exec - pair_eam_intel.cpp:847-847 | LAMMPS_NS::PairEAMIntel::unpack_forward_comm(int, int, double*) | Single | 0.02 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.02 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.17 | 0.05 | 0.06 | 0.05 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.17 | 0.05 | 0.06 | 0.05 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.14 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 0.14 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 6 | 72 | 96 | 119 | 124 | 138 | 160 | 188 | 204 | 224 | 236 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100 | 75 | 1 | 1 | 1 | 1.22 | 2.12 | 2.56 | 2.55 | 2.61 | 2.88 | 2.35 | 2.62 | 2.78 | 2.73 | 2.75 | 0 | 2 | 0 | 0 | 0 | 1 | 0 | 0.46 | 0.02 | 0.38 | 0.02 | 0.33 | 0.03 | 0.33 | 0.03 | 0.36 | 0.02 | 0.31 | 0.03 | 0.3 | 0.03 | 0.29 | 0.03 | 0.26 | 0.03 | 0.22 | 0.03 |
27789 | exec - npair_intel.cpp:358-371 [...] | void LAMMPS_NS::NPairIntel::bin_newton<float, double, 0, 0, 0, 0, 0>(int, LAMMPS_NS::NeighList*, LAMMPS_NS::IntelBuffers<float, double>*, int, int, int) [clone .extracted] | Innermost | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.12 | 0.04 | 0.04 | 0.03 | 0.02 | 0.03 | 0.03 | 0.03 | 0.02 | 0.03 | 0.02 | 0.12 | 0.04 | 0.04 | 0.03 | 0.02 | 0.03 | 0.03 | 0.03 | 0.02 | 0.03 | 0.02 | 0.09 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.09 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 6 | 65 | 79 | 81 | 95 | 93 | 110 | 117 | 137 | 165 | 140 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 6.25 | 1.44 | 1 | 16 | 1.41 | 2.61 | 3.95 | 2.87 | 2.91 | 3.3 | 2.76 | 3.62 | 3.04 | 3.33 | 3.32 | 1 | 5 | 0 | 0 | 0 | 1 | 0 | 0.59 | 0.01 | 0.64 | 0.01 | 0.72 | 0 | 0.62 | 0 | 0.6 | 0.01 | 0.51 | 0.01 | 0.52 | 0.01 | 0.45 | 0.01 | 0.41 | 0.01 | 0.48 | 0.01 |