-
Experiences Readying Applications for Exascale
Authors:
Paul T. Bauman,
Reuben D. Budiardja,
Dmytro Bykov,
Noel Chalmers,
Jacqueline Chen,
Nicholas Curtis,
Marc Day,
Markus Eisenbach,
Lucas Esclapez,
Alessandro Fanfarillo,
William Freitag,
Nicholas Frontiere,
Antigoni Georgiadou,
Joseph Glenski,
Kalyana Gottiparthi,
Marc T. Henry de Frahan,
Gustav R. Jansen,
Wayne Joubert,
Justin G. Lietz,
Jakub Kurzak,
Nicholas Malaya,
Bronson Messer,
Damon McDougall,
Paul Mullowney,
Stephen Nichols
, et al. (7 additional authors not shown)
Abstract:
The advent of exascale computing invites an assessment of existing best practices for developing application readiness on the world's largest supercomputers. This work details observations from the last four years in preparing scientific applications to run on the Oak Ridge Leadership Computing Facility's (OLCF) Frontier system. This paper addresses a range of topics in software including programm…
▽ More
The advent of exascale computing invites an assessment of existing best practices for developing application readiness on the world's largest supercomputers. This work details observations from the last four years in preparing scientific applications to run on the Oak Ridge Leadership Computing Facility's (OLCF) Frontier system. This paper addresses a range of topics in software including programmability, tuning, and portability considerations that are key to moving applications from existing systems to future installations. A set of representative workloads provides case studies for general system and software testing. We evaluate the use of early access systems for development across several generations of hardware. Finally, we discuss how best practices were identified and disseminated to the community through a wide range of activities including user-guides and trainings. We conclude with recommendations for ensuring application readiness on future leadership computing systems.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Optimizing High-Performance Linpack for Exascale Accelerated Architectures
Authors:
Noel Chalmers,
Jakub Kurzak,
Damon McDougall,
Paul T. Bauman
Abstract:
We detail the performance optimizations made in rocHPL, AMD's open-source implementation of the High-Performance Linpack (HPL) benchmark targeting accelerated node architectures designed for exascale systems such as the Frontier supercomputer. The implementation leverages the high-throughput GPU accelerators on the node via highly optimized linear algebra libraries, as well as the entire CPU socke…
▽ More
We detail the performance optimizations made in rocHPL, AMD's open-source implementation of the High-Performance Linpack (HPL) benchmark targeting accelerated node architectures designed for exascale systems such as the Frontier supercomputer. The implementation leverages the high-throughput GPU accelerators on the node via highly optimized linear algebra libraries, as well as the entire CPU socket to perform latency-sensitive factorization phases. We detail novel performance improvements such as a multi-threaded approach to computing the panel factorization phase on the CPU, time-sharing of CPU cores between processes on the node, as well as several optimizations which hide MPI communication. We present some performance results of this implementation of the HPL benchmark on a single node of the Frontier early access cluster at Oak Ridge National Laboratory, as well as scaling to multiple nodes.
△ Less
Submitted 20 April, 2023;
originally announced April 2023.
-
GRINS: A Multiphysics Framework Based on the libMesh Finite Element Library
Authors:
Paul T. Bauman,
Roy H. Stogner
Abstract:
The progression of scientific computing resources has enabled the numerical approximation of mathematical models describing complex physical phenomena. A significant portion of researcher time is typically dedicated to the development of software to compute the numerical solutions. This work describes a flexible C++ software framework, built on the libMesh finite element library, designed to allev…
▽ More
The progression of scientific computing resources has enabled the numerical approximation of mathematical models describing complex physical phenomena. A significant portion of researcher time is typically dedicated to the development of software to compute the numerical solutions. This work describes a flexible C++ software framework, built on the libMesh finite element library, designed to alleviate developer burden and provide easy access to modern computational algorithms, including quantity-of-interest-driven parallel adaptive mesh refinement on unstructured grids and adjoint-based sensitivities. Other software environments are highlighted and the current work motivated; in particular, the present work is an attempt to balance software infrastructure and user flexibility. The applicable class of problems and design of the software components is discussed in detail. Several examples demonstrate the effectiveness of the design, including applications that incorporate uncertainty. Current and planned developments are discussed.
△ Less
Submitted 19 June, 2015;
originally announced June 2015.