-
GPU Acceleration of Monte Carlo Tallies on Unstructured Meshes in OpenMC with PUMI-Tally
Authors:
Fuad Hasan,
Cameron W. Smith,
Mark S. Shephard,
R. Michael Churchill,
George J. Wilkie,
Paul K. Romano,
Patrick C. Shriwise,
Jacob S. Merson
Abstract:
Unstructured mesh tallies are a bottleneck in Monte Carlo neutral particle transport simulations of fusion reactors. This paper introduces the PUMI-Tally library that takes advantage of mesh adjacency information to accelerate these tallies on CPUs and GPUs. For a fixed source simulation using track-length tallies, we achieved a speed-up of 19.7X on an NVIDIA A100, and 9.2X using OpenMP on 128 thr…
▽ More
Unstructured mesh tallies are a bottleneck in Monte Carlo neutral particle transport simulations of fusion reactors. This paper introduces the PUMI-Tally library that takes advantage of mesh adjacency information to accelerate these tallies on CPUs and GPUs. For a fixed source simulation using track-length tallies, we achieved a speed-up of 19.7X on an NVIDIA A100, and 9.2X using OpenMP on 128 threads of two AMD EPYC 7763 CPUs on NERSC Perlmutter. On the Empire AI alpha system, we achieved a speed-up of 20X using an NVIDIA H100 and 96 threads of an Intel Xenon 8568Y+. Our method showed better scaling with number of particles and number of elements. Additionally, we observed a 199X reduction in the number of allocations during initialization and the first three iterations, with a similar overall memory consumption. And, our hybrid CPU/GPU method demonstrated a 6.69X improvement in the energy consumption over the current approach.
△ Less
Submitted 26 April, 2025;
originally announced April 2025.
-
Demonstration of OpenMC as a framework for atomic transport and plasma interaction
Authors:
George J. Wilkie,
Paul K. Romano,
R. Michael Churchill
Abstract:
Modern tooling is demanded for predicting the transport and reaction characteristics of atoms and molecules, especially in the context of magnetic confinement fusion. DEGAS2, among the most common and capable tools currently in use, shares many fundamental similarities with the OpenMC framework, which was primarily developed for neutron and photon transport. In this work, we demonstrate that OpenM…
▽ More
Modern tooling is demanded for predicting the transport and reaction characteristics of atoms and molecules, especially in the context of magnetic confinement fusion. DEGAS2, among the most common and capable tools currently in use, shares many fundamental similarities with the OpenMC framework, which was primarily developed for neutron and photon transport. In this work, we demonstrate that OpenMC is suitable for atomic transport calculations. The relative error between the models is small, and the performance of OpenMC is at least comparable to DEGAS2. This is the case even without taking advantage of heterogeneous computing architecture, which is only one of the several remarkable new capabilities that this demonstration heralds.
△ Less
Submitted 19 November, 2024;
originally announced November 2024.
-
Point containment algorithms for constructive solid geometry with unbounded primitives
Authors:
Paul K. Romano,
Patrick A. Myers,
Seth R. Johnson,
Aljaž Kolšek,
Patrick C. Shriwise
Abstract:
We present several algorithms for evaluating point containment in constructive solid geometry (CSG) trees with unbounded primitives. Three algorithms are presented based on postfix, prefix, and infix notations of the CSG binary expression tree. We show that prefix and infix notations enable short-circuiting logic, which reduces the number of primitives that must be checked during point containment…
▽ More
We present several algorithms for evaluating point containment in constructive solid geometry (CSG) trees with unbounded primitives. Three algorithms are presented based on postfix, prefix, and infix notations of the CSG binary expression tree. We show that prefix and infix notations enable short-circuiting logic, which reduces the number of primitives that must be checked during point containment. To evaluate the performance of the algorithms, each algorithm was implemented in the OpenMC Monte Carlo particle transport code, which relies on CSG to represent solid bodies through which subatomic particles travel. Two sets of tests were carried out. First, the execution time to generate a high-resolution rasterized image of a 2D slice of a detailed CSG model of the ITER tokamak was measured. Use of both prefix and infix notations offered significant speedup over the postfix notation that has traditionally been used in particle transport codes, with infix resulting in a 6$\times$ reduction in execution time relative to postfix. We then measured the execution time of neutron transport simulations of the same ITER model using each of the algorithms. The results and performance improvements reveal the same trends as for the rasterization test, with a 4.59$\times$ overall speedup using the infix notation relative to the original postfix notation in OpenMC.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Termination of Picard Iteration for Coupled Neutronics/Thermal-Hydraulics Simulations
Authors:
Dean Wang,
Paul K. Romano
Abstract:
In this paper, we consider the coupled N/TH problem, in which the termination criterion for the neutronics iteration adopts an adaptive tolerance with respect to the fuel temperature residual at each Picard iteration. We refer to this coupling scheme as the inexact Picard iteration method. Fourier analysis is performed to investigate how the convergence behavior of Picard iteration is influenced b…
▽ More
In this paper, we consider the coupled N/TH problem, in which the termination criterion for the neutronics iteration adopts an adaptive tolerance with respect to the fuel temperature residual at each Picard iteration. We refer to this coupling scheme as the inexact Picard iteration method. Fourier analysis is performed to investigate how the convergence behavior of Picard iteration is influenced by the inexact neutronics solution. It is found that if the convergence of the inner neutronics iteration is slow, Picard coupling may become unstable unless a tighter tolerance is used for the neutronics iteration. Nevertheless, our analysis indicates that a certain amount of over-solving is necessary for maintaining the stability of Picard iteration if the iterative solution of the subproblem is not fast enough. However, this issue has not been addressed in the previous studies.
△ Less
Submitted 28 July, 2023; v1 submitted 30 June, 2023;
originally announced June 2023.
-
Celeritas: GPU-accelerated particle transport for detector simulation in High Energy Physics experiments
Authors:
S. C. Tognini,
P. Canal,
T. M. Evans,
G. Lima,
A. L. Lund,
S. R. Johnson,
S. Y. Jun,
V. R. Pascuzzi,
P. K. Romano
Abstract:
Within the next decade, experimental High Energy Physics (HEP) will enter a new era of scientific discovery through a set of targeted programs recommended by the Particle Physics Project Prioritization Panel (P5), including the upcoming High Luminosity Large Hadron Collider (LHC) HL-LHC upgrade and the Deep Underground Neutrino Experiment (DUNE). These efforts in the Energy and Intensity Frontiers…
▽ More
Within the next decade, experimental High Energy Physics (HEP) will enter a new era of scientific discovery through a set of targeted programs recommended by the Particle Physics Project Prioritization Panel (P5), including the upcoming High Luminosity Large Hadron Collider (LHC) HL-LHC upgrade and the Deep Underground Neutrino Experiment (DUNE). These efforts in the Energy and Intensity Frontiers will require an unprecedented amount of computational capacity on many fronts including Monte Carlo (MC) detector simulation. In order to alleviate this impending computational bottleneck, the Celeritas MC particle transport code is designed to leverage the new generation of heterogeneous computer architectures, including the exascale computing power of U.S. Department of Energy (DOE) Leadership Computing Facilities (LCFs), to model targeted HEP detector problems at the full fidelity of Geant4. This paper presents the planned roadmap for Celeritas, including its proposed code architecture, physics capabilities, and strategies for integrating it with existing and future experimental HEP computing workflows.
△ Less
Submitted 22 March, 2022; v1 submitted 16 March, 2022;
originally announced March 2022.