options

Executable Output


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /home/eoseret/aocc-compiler-5.0.0/bin/clang++
  Compiler Flags:         "-g -grecord-gcc-switches -fno-omit-frame-pointer    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 1 threads on rank 0
    0->  0

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.03330
  LPlusTimes                  10       5.45965
  LTimes                      10       7.74801
  Population                  10       1.24607
  Scattering                  10     255.14928
  Solve                        1     275.37393
  Source                      10       0.01263
  SweepSolver                 10       5.46533
  SweepSubdomain             160       4.65820

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.033301,5.459647,7.748011,1.246070,255.149281,275.373925,0.012626,5.465332,4.658199

Figures of Merit
================

  Throughput:         2.193308e+07 [unknowns/(second/iteration)]
  Grind time :        4.559324e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  85.23175 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0

To display your profiling results:
##############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                   #
##############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_0  #
##############################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /home/eoseret/aocc-compiler-5.0.0/bin/clang++
  Compiler Flags:         "-g -grecord-gcc-switches -fno-omit-frame-pointer    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 2 threads on rank 0
    0->  0    1-> 12

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.01342
  LPlusTimes                  10       2.94013
  LTimes                      10       4.06437
  Population                  10       0.46831
  Scattering                  10     127.55165
  Solve                        1     138.54869
  Source                      10       0.00692
  SweepSolver                 10       3.19927
  SweepSubdomain             160       2.42637

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.013423,2.940127,4.064372,0.468314,127.551653,138.548686,0.006925,3.199269,2.426366

Figures of Merit
================

  Throughput:         4.359332e+07 [unknowns/(second/iteration)]
  Grind time :        2.293929e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  75.84126 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1

To display your profiling results:
##############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                   #
##############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_1  #
##############################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /home/eoseret/aocc-compiler-5.0.0/bin/clang++
  Compiler Flags:         "-g -grecord-gcc-switches -fno-omit-frame-pointer    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 4 threads on rank 0
    0->  0    1->  6    2-> 12    3-> 18

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.01269
  LPlusTimes                  10       1.56666
  LTimes                      10       2.09938
  Population                  10       0.36848
  Scattering                  10      64.29046
  Solve                        1      70.52734
  Source                      10       0.00359
  SweepSolver                 10       1.88282
  SweepSubdomain             160       1.23904

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.012690,1.566664,2.099383,0.368484,64.290465,70.527335,0.003592,1.882816,1.239037

Figures of Merit
================

  Throughput:         8.563769e+07 [unknowns/(second/iteration)]
  Grind time :        1.167710e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  65.80765 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2

To display your profiling results:
##############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                   #
##############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_2  #
##############################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /home/eoseret/aocc-compiler-5.0.0/bin/clang++
  Compiler Flags:         "-g -grecord-gcc-switches -fno-omit-frame-pointer    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 8 threads on rank 0
    0->  0    1->  3    2->  6    3->  9    4-> 12    5-> 15    6-> 18    7-> 21

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.03197
  LPlusTimes                  10       1.06065
  LTimes                      10       1.12910
  Population                  10       0.18079
  Scattering                  10      33.24805
  Solve                        1      37.48769
  Source                      10       0.00199
  SweepSolver                 10       1.55031
  SweepSubdomain             160       0.63917

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.031971,1.060648,1.129102,0.180785,33.248052,37.487690,0.001992,1.550315,0.639170

Figures of Merit
================

  Throughput:         1.611142e+08 [unknowns/(second/iteration)]
  Grind time :        6.206779e-09 [(seconds/iteration)/unknowns]
  Sweep efficiency :  41.22841 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3

To display your profiling results:
##############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                   #
##############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_3  #
##############################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /home/eoseret/aocc-compiler-5.0.0/bin/clang++
  Compiler Flags:         "-g -grecord-gcc-switches -fno-omit-frame-pointer    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 16 threads on rank 0
    0->  0    1->193    2->  3    3->196    4->  6    5->199    6->  9    7->202
    8-> 12    9->205   10-> 15   11->208   12-> 18   13->211   14-> 21   15->214

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.01359
  LPlusTimes                  10       0.94547
  LTimes                      10       0.82533
  Population                  10       0.16172
  Scattering                  10      20.35638
  Solve                        1      23.21753
  Source                      10       0.00101
  SweepSolver                 10       0.61617
  SweepSubdomain             160       0.35685

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.013589,0.945470,0.825329,0.161724,20.356383,23.217532,0.001011,0.616170,0.356851

Figures of Merit
================

  Throughput:         2.601395e+08 [unknowns/(second/iteration)]
  Grind time :        3.844091e-09 [(seconds/iteration)/unknowns]
  Sweep efficiency :  57.91442 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4

To display your profiling results:
##############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                   #
##############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_4  #
##############################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /home/eoseret/aocc-compiler-5.0.0/bin/clang++
  Compiler Flags:         "-g -grecord-gcc-switches -fno-omit-frame-pointer    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 24 threads on rank 0
    0->  0    1->  1    2->  2    3->  3    4->  4    5->  5    6->  6    7->  7
    8->  8    9->  9   10-> 10   11-> 11   12-> 12   13-> 13   14-> 14   15-> 15
   16-> 16   17-> 17   18-> 18   19-> 19   20-> 20   21-> 21   22-> 22   23-> 23

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.02158
  LPlusTimes                  10       0.55113
  LTimes                      10       0.56632
  Population                  10       0.09146
  Scattering                  10      16.87966
  Solve                        1      19.13998
  Source                      10       0.00076
  SweepSolver                 10       0.74002
  SweepSubdomain             160       0.28208

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.021584,0.551130,0.566323,0.091459,16.879655,19.139984,0.000764,0.740015,0.282082

Figures of Merit
================

  Throughput:         3.155592e+08 [unknowns/(second/iteration)]
  Grind time :        3.168978e-09 [(seconds/iteration)/unknowns]
  Sweep efficiency :  38.11847 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5

To display your profiling results:
##############################################################################################################################################
#    LEVEL    |     REPORT     |                                                   COMMAND                                                   #
##############################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_npsu_run_5  #
##############################################################################################################################################

×