* [MAQAO] Info: Detected 96 Lprof instances in ip-172-31-47-249.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
[0mLAMMPS (22 Jul 2025)
OMP_NUM_THREADS environment is not set. Defaulting to 1 thread.
using 1 OpenMP thread(s) per MPI task
Lattice spacing in x,y,z = 3.615 3.615 3.615
Created orthogonal box = (0 0 0) to (1156.8 578.4 578.4)
6 by 4 by 4 MPI processor grid
Created 32768000 atoms
using lattice units in orthogonal box = (0 0 0) to (1156.8 578.4 578.4)
create_atoms CPU = 0.063 seconds
Neighbor list info ...
update: every = 1 steps, delay = 5 steps, check = yes
max neighbors/atom: 2000, page size: 100000
master list distance cutoff = 5.95
ghost atom cutoff = 5.95
binsize = 2.975, bins = 389 195 195
1 neighbor lists, perpetual/occasional/extra = 1 0 0
(1) pair eam, perpetual
attributes: half, newton on
pair build: half/bin/atomonly/newton
stencil: half/bin/3d
bin: standard
Setting up Verlet run ...
Unit style : metal
Current step : 0
Time step : 0.005
Per MPI rank memory allocation (min/avg/max) = 133.3 | 135.2 | 136.8 Mbytes
Step Temp E_pair E_mol TotEng Press
0 1600 -1.1599872e+08 0 -1.0922177e+08 18704.157
10 475.61659 -1.1120972e+08 0 -1.091952e+08 64949.732
Loop time of 2.80526 on 96 procs for 10 steps with 32768000 atoms
Performance: 1.540 ns/day, 15.585 hours/ns, 3.565 timesteps/s, 116.809 Matom-step/s
99.3% CPU use with 96 MPI tasks x 1 OpenMP threads
MPI task timing breakdown:
Section | min time | avg time | max time |%varavg| %total
---------------------------------------------------------------
Pair | 2.1123 | 2.1565 | 2.1979 | 1.5 | 76.87
Neigh | 0.30416 | 0.30781 | 0.31577 | 0.4 | 10.97
Comm | 0.069474 | 0.11578 | 0.16113 | 6.7 | 4.13
Output | 0.0027881 | 0.0029144 | 0.0032262 | 0.2 | 0.10
Modify | 0.1632 | 0.17887 | 0.1868 | 1.1 | 6.38
Other | | 0.04342 | | | 1.55
Nlocal: 341333 ave 342550 max 340635 min
Histogram: 39 25 0 0 0 0 0 0 11 21
Nghost: 87173.1 ave 88346 max 85099 min
Histogram: 32 0 0 0 0 0 0 0 2 62
Neighs: 1.28138e+07 ave 1.28704e+07 max 1.27717e+07 min
Histogram: 12 18 14 17 3 0 4 12 5 11
Total # of neighbors = 1.2301205e+09
Ave neighs/atom = 37.540299
Neighbor list builds = 1
Dangerous builds = 0
Setting up Verlet run ...
Unit style : metal
Current step : 10
Time step : 0.005
Per MPI rank memory allocation (min/avg/max) = 133.3 | 135.3 | 137.2 Mbytes
Step Temp E_pair E_mol TotEng Press
10 475.61659 -1.1120972e+08 0 -1.091952e+08 64949.732
50 780.66035 -1.1250592e+08 0 -1.0919935e+08 52288.914
100 798.44003 -1.1258168e+08 0 -1.0919981e+08 51469.262
110 797.58056 -1.1257807e+08 0 -1.0919984e+08 51503.229
Loop time of 38.7391 on 96 procs for 100 steps with 32768000 atoms
Performance: 1.115 ns/day, 21.522 hours/ns, 2.581 timesteps/s, 84.586 Matom-step/s
99.7% CPU use with 96 MPI tasks x 1 OpenMP threads
MPI task timing breakdown:
Section | min time | avg time | max time |%varavg| %total
---------------------------------------------------------------
Pair | 29.374 | 29.742 | 29.977 | 2.6 | 76.77
Neigh | 5.4213 | 5.5519 | 5.6539 | 2.5 | 14.33
Comm | 0.98518 | 1.3146 | 1.8004 | 17.2 | 3.39
Output | 0.0084401 | 0.0088049 | 0.0096531 | 0.3 | 0.02
Modify | 1.5898 | 1.7448 | 1.8137 | 5.3 | 4.50
Other | | 0.3775 | | | 0.97
Nlocal: 341333 ave 342676 max 340281 min
Histogram: 6 21 22 14 1 0 0 2 15 15
Nghost: 87085.4 ave 88566 max 84969 min
Histogram: 21 11 0 0 0 0 1 15 36 12
Neighs: 1.28772e+07 ave 1.29441e+07 max 1.2825e+07 min
Histogram: 7 11 20 16 9 2 8 8 11 4
Total # of neighbors = 1.2362091e+09
Ave neighs/atom = 37.726109
Neighbor list builds = 17
Dangerous builds = 4
Total wall time: 0:00:43
[MAQAO] Info: 95/96 lprof instances finished
Your experiment path is /home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0
To display your profiling results:
################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/Tools/QaaS/qaas_runs/175-397-2310/intel/LAMMPS/run/base_runs/compilers/armclang_4/run_1753973878/OV1_armclang_4/tools/lprof_npsu_run_0 #
################################################################################################################################################################################################################