_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-23, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/aocc-compiler-5.0.0/bin/clang++
Compiler Flags: "-g -grecord-gcc-switches -fno-omit-frame-pointer "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 1 threads on rank 0
0-> 0
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 8
Spatial decomp: 2 x 2 x 2 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 8 1 / 8
(Rx,Ry,Rz) R in XYZ: 2x2x2 1x1x1 / 2x2x2
(PQR) TOTAL: 8 16 / 128
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.03330
LPlusTimes 10 5.45965
LTimes 10 7.74801
Population 10 1.24607
Scattering 10 255.14928
Solve 1 275.37393
Source 10 0.01263
SweepSolver 10 5.46533
SweepSubdomain 160 4.65820
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.033301,5.459647,7.748011,1.246070,255.149281,275.373925,0.012626,5.465332,4.658199
Figures of Merit
================
Throughput: 2.193308e+07 [unknowns/(second/iteration)]
Grind time : 4.559324e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 85.23175 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0
To display your profiling results:
##############################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0 #
##############################################################################################################################################
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-23, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/aocc-compiler-5.0.0/bin/clang++
Compiler Flags: "-g -grecord-gcc-switches -fno-omit-frame-pointer "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 2 threads on rank 0
0-> 0 1-> 12
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 8
Spatial decomp: 2 x 2 x 2 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 8 1 / 8
(Rx,Ry,Rz) R in XYZ: 2x2x2 1x1x1 / 2x2x2
(PQR) TOTAL: 8 16 / 128
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.01342
LPlusTimes 10 2.94013
LTimes 10 4.06437
Population 10 0.46831
Scattering 10 127.55165
Solve 1 138.54869
Source 10 0.00692
SweepSolver 10 3.19927
SweepSubdomain 160 2.42637
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.013423,2.940127,4.064372,0.468314,127.551653,138.548686,0.006925,3.199269,2.426366
Figures of Merit
================
Throughput: 4.359332e+07 [unknowns/(second/iteration)]
Grind time : 2.293929e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 75.84126 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1
To display your profiling results:
##############################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1 #
##############################################################################################################################################
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-23, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/aocc-compiler-5.0.0/bin/clang++
Compiler Flags: "-g -grecord-gcc-switches -fno-omit-frame-pointer "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 4 threads on rank 0
0-> 0 1-> 6 2-> 12 3-> 18
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 8
Spatial decomp: 2 x 2 x 2 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 8 1 / 8
(Rx,Ry,Rz) R in XYZ: 2x2x2 1x1x1 / 2x2x2
(PQR) TOTAL: 8 16 / 128
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.01269
LPlusTimes 10 1.56666
LTimes 10 2.09938
Population 10 0.36848
Scattering 10 64.29046
Solve 1 70.52734
Source 10 0.00359
SweepSolver 10 1.88282
SweepSubdomain 160 1.23904
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.012690,1.566664,2.099383,0.368484,64.290465,70.527335,0.003592,1.882816,1.239037
Figures of Merit
================
Throughput: 8.563769e+07 [unknowns/(second/iteration)]
Grind time : 1.167710e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 65.80765 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2
To display your profiling results:
##############################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2 #
##############################################################################################################################################
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-23, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/aocc-compiler-5.0.0/bin/clang++
Compiler Flags: "-g -grecord-gcc-switches -fno-omit-frame-pointer "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 8 threads on rank 0
0-> 0 1-> 3 2-> 6 3-> 9 4-> 12 5-> 15 6-> 18 7-> 21
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 8
Spatial decomp: 2 x 2 x 2 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 8 1 / 8
(Rx,Ry,Rz) R in XYZ: 2x2x2 1x1x1 / 2x2x2
(PQR) TOTAL: 8 16 / 128
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.03197
LPlusTimes 10 1.06065
LTimes 10 1.12910
Population 10 0.18079
Scattering 10 33.24805
Solve 1 37.48769
Source 10 0.00199
SweepSolver 10 1.55031
SweepSubdomain 160 0.63917
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.031971,1.060648,1.129102,0.180785,33.248052,37.487690,0.001992,1.550315,0.639170
Figures of Merit
================
Throughput: 1.611142e+08 [unknowns/(second/iteration)]
Grind time : 6.206779e-09 [(seconds/iteration)/unknowns]
Sweep efficiency : 41.22841 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3
To display your profiling results:
##############################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3 #
##############################################################################################################################################
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-23, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/aocc-compiler-5.0.0/bin/clang++
Compiler Flags: "-g -grecord-gcc-switches -fno-omit-frame-pointer "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 16 threads on rank 0
0-> 0 1->193 2-> 3 3->196 4-> 6 5->199 6-> 9 7->202
8-> 12 9->205 10-> 15 11->208 12-> 18 13->211 14-> 21 15->214
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 8
Spatial decomp: 2 x 2 x 2 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 8 1 / 8
(Rx,Ry,Rz) R in XYZ: 2x2x2 1x1x1 / 2x2x2
(PQR) TOTAL: 8 16 / 128
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.01359
LPlusTimes 10 0.94547
LTimes 10 0.82533
Population 10 0.16172
Scattering 10 20.35638
Solve 1 23.21753
Source 10 0.00101
SweepSolver 10 0.61617
SweepSubdomain 160 0.35685
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.013589,0.945470,0.825329,0.161724,20.356383,23.217532,0.001011,0.616170,0.356851
Figures of Merit
================
Throughput: 2.601395e+08 [unknowns/(second/iteration)]
Grind time : 3.844091e-09 [(seconds/iteration)/unknowns]
Sweep efficiency : 57.91442 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4
To display your profiling results:
##############################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4 #
##############################################################################################################################################
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.5-dev
LLNL-CODE-775068
Copyright (c) 2014-23, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/eoseret/aocc-compiler-5.0.0/bin/clang++
Compiler Flags: "-g -grecord-gcc-switches -fno-omit-frame-pointer "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 24 threads on rank 0
0-> 0 1-> 1 2-> 2 3-> 3 4-> 4 5-> 5 6-> 6 7-> 7
8-> 8 9-> 9 10-> 10 11-> 11 12-> 12 13-> 13 14-> 14 15-> 15
16-> 16 17-> 17 18-> 18 19-> 19 20-> 20 21-> 21 22-> 22 23-> 23
Input Parameters
================
Problem Size:
Zones: 24 x 16 x 16 (6144 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 8
Spatial decomp: 2 x 2 x 2 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 8 1 / 8
(Rx,Ry,Rz) R in XYZ: 2x2x2 1x1x1 / 2x2x2
(PQR) TOTAL: 8 16 / 128
Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 24 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 37748736 288.000
k_plane 37748736 288.000
mixelem_to_fraction 6320 0.048
phi 157286400 1200.000
phi_out 157286400 1200.000
psi 603979776 4608.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 603979776 4608.000
sigt_zonal 6291456 48.000
volume 6144 0.047
-------- ------------ ---------
TOTAL 1645233448 12552.135
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.289596e+09, change=1.000000e+00
iter 1: particle count=1.938605e+09, change=3.347815e-01
iter 2: particle count=2.261901e+09, change=1.429312e-01
iter 3: particle count=2.422372e+09, change=6.624562e-02
iter 4: particle count=2.501793e+09, change=3.174531e-02
iter 5: particle count=2.540981e+09, change=1.542247e-02
iter 6: particle count=2.560258e+09, change=7.529487e-03
iter 7: particle count=2.569713e+09, change=3.679282e-03
iter 8: particle count=2.574337e+09, change=1.796232e-03
iter 9: particle count=2.576593e+09, change=8.754707e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.02158
LPlusTimes 10 0.55113
LTimes 10 0.56632
Population 10 0.09146
Scattering 10 16.87966
Solve 1 19.13998
Source 10 0.00076
SweepSolver 10 0.74002
SweepSubdomain 160 0.28208
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.021584,0.551130,0.566323,0.091459,16.879655,19.139984,0.000764,0.740015,0.282082
Figures of Merit
================
Throughput: 3.155592e+08 [unknowns/(second/iteration)]
Grind time : 3.168978e-09 [(seconds/iteration)/unknowns]
Sweep efficiency : 38.11847 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 603979776
END
Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5
To display your profiling results:
##############################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5 #
##############################################################################################################################################