Skip to main content

Showing 1–6 of 6 results for author: Poeschel, F

.
  1. arXiv:2501.03383  [pdf, other

    physics.comp-ph cs.DC cs.LG

    The Artificial Scientist -- in-transit Machine Learning of Plasma Simulations

    Authors: Jeffrey Kelling, Vicente Bolea, Michael Bussmann, Ankush Checkervarty, Alexander Debus, Jan Ebert, Greg Eisenhauer, Vineeth Gutta, Stefan Kesselheim, Scott Klasky, Richard Pausch, Norbert Podhorszki, Franz Poschel, David Rogers, Jeyhun Rustamov, Steve Schmerler, Ulrich Schramm, Klaus Steiniger, Rene Widera, Anna Willmann, Sunita Chandrasekaran

    Abstract: Increasing HPC cluster sizes and large-scale simulations that produce petabytes of data per run, create massive IO and storage challenges for analysis. Deep learning-based techniques, in particular, make use of these amounts of domain data to extract patterns that help build scientific understanding. Here, we demonstrate a streaming workflow in which simulation data is streamed directly to a machi… ▽ More

    Submitted 15 January, 2025; v1 submitted 6 January, 2025; originally announced January 2025.

    Comments: 12 pages, 9 figures

  2. arXiv:2410.00178  [pdf, other

    cs.PF

    Streaming Data in HPC Workflows Using ADIOS

    Authors: Greg Eisenhauer, Norbert Podhorszki, Ana Gainaru, Scott Klasky, Philip E. Davis, Manish Parashar, Matthew Wolf, Eric Suchtya, Erick Fredj, Vicente Bolea, Franz Pöschel, Klaus Steiniger, Michael Bussmann, Richard Pausch, Sunita Chandrasekaran

    Abstract: The "IO Wall" problem, in which the gap between computation rate and data access rate grows continuously, poses significant problems to scientific workflows which have traditionally relied upon using the filesystem for intermediate storage between workflow stages. One way to avoid this problem in scientific workflows is to stream data directly from producers to consumers and avoiding storage entir… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

  3. arXiv:2408.02869  [pdf, other

    cs.DC cs.PF physics.plasm-ph

    Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring

    Authors: Jeremy J. Williams, Daniel Medeiros, Stefan Costea, David Tskhakaya, Franz Poeschel, René Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis

    Abstract: Large-scale HPC simulations of plasma dynamics in fusion devices require efficient parallel I/O to avoid slowing down the simulation and to enable the post-processing of critical information. Such complex simulations lacking parallel I/O capabilities may encounter performance bottlenecks, hindering their effectiveness in data-intensive computing tasks. In this work, we focus on introducing and enh… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: Accepted by IEEE Cluster workshop 2024 (REX-IO 2024), prepared in the standardized IEEE conference format and consists of 10 pages, which includes the main text, references, and figures

  4. arXiv:2211.04770  [pdf, other

    cs.LG cs.AI hep-th

    Continual learning autoencoder training for a particle-in-cell simulation via streaming

    Authors: Patrick Stiller, Varun Makdani, Franz Pöschel, Richard Pausch, Alexander Debus, Michael Bussmann, Nico Hoffmann

    Abstract: The upcoming exascale era will provide a new generation of physics simulations. These simulations will have a high spatiotemporal resolution, which will impact the training of machine learning models since storing a high amount of simulation data on disk is nearly impossible. Therefore, we need to rethink the training of machine learning models for simulations for the upcoming exascale era. This w… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  5. Improving I/O Performance for Exascale Applications through Online Data Layout Reorganization

    Authors: Lipeng Wan, Axel Huebl, Junmin Gu, Franz Poeschel, Ana Gainaru, Ruonan Wang, Jieyang Chen, Xin Liang, Dmitry Ganyushin, Todd Munson, Ian Foster, Jean-Luc Vay, Norbert Podhorszki, Kesheng Wu, Scott Klasky

    Abstract: The applications being developed within the U.S. Exascale Computing Project (ECP) to run on imminent Exascale computers will generate scientific results with unprecedented fidelity and record turn-around time. Many of these codes are based on particle-mesh methods and use advanced algorithms, especially dynamic load-balancing and mesh-refinement, to achieve high performance on Exascale machines. Y… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: 12 pages, 15 figures, accepted by IEEE Transactions on Parallel and Distributed Systems

    Journal ref: IEEE Transactions on Parallel and Distributed Systems, 2021

  6. Transitioning from file-based HPC workflows to streaming data pipelines with openPMD and ADIOS2

    Authors: Franz Poeschel, Juncheng E, William F. Godoy, Norbert Podhorszki, Scott Klasky, Greg Eisenhauer, Philip E. Davis, Lipeng Wan, Ana Gainaru, Junmin Gu, Fabian Koller, René Widera, Michael Bussmann, Axel Huebl

    Abstract: This paper aims to create a transition path from file-based IO to streaming-based workflows for scientific applications in an HPC environment. By using the openPMP-api, traditional workflows limited by filesystem bottlenecks can be overcome and flexibly extended for in situ analysis. The openPMD-api is a library for the description of scientific data according to the Open Standard for Particle-Mes… ▽ More

    Submitted 19 January, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: 18 pages, 9 figures, SMC2021, supplementary material at https://zenodo.org/record/4906276