Skip to main content

Showing 1–4 of 4 results for author: John, C M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.10013  [pdf, ps, other

    cs.DC

    Training LLMs on HPC Systems: Best Practices from the OpenGPT-X Project

    Authors: Carolin Penke, Chelsea Maria John, Jan Ebert, Stefan Kesselheim, Andreas Herten

    Abstract: The training of large language models (LLMs) requires substantial computational resources, complex software stacks, and carefully designed workflows to achieve scalability and efficiency. This report presents best practices and insights gained from the OpenGPT-X project, a German initiative focused on developing open, multilingual LLMs optimized for European languages. We detail the use of high-pe… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    ACM Class: C.4; I.2.11; I.2.7; K.6

  2. arXiv:2409.12994  [pdf, other

    cs.AR cs.AI cs.DC cs.LG cs.PF

    Performance and Power: Systematic Evaluation of AI Workloads on Accelerators with CARAML

    Authors: Chelsea Maria John, Stepan Nassyr, Carolin Penke, Andreas Herten

    Abstract: The rapid advancement of machine learning (ML) technologies has driven the development of specialized hardware accelerators designed to facilitate more efficient model training. This paper introduces the CARAML benchmark suite, which is employed to assess performance and energy consumption during the training of transformer-based large language models and computer vision models on a range of hardw… ▽ More

    Submitted 29 October, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

    Comments: To be published in Workshop Proceedings of The International Conference for High Performance Computing Networking, Storage, and Analysis (SC-W '24) (2024)

  3. Application-Driven Exascale: The JUPITER Benchmark Suite

    Authors: Andreas Herten, Sebastian Achilles, Damian Alvarez, Jayesh Badwaik, Eric Behle, Mathis Bode, Thomas Breuer, Daniel Caviedes-Voullième, Mehdi Cherti, Adel Dabah, Salem El Sayed, Wolfgang Frings, Ana Gonzalez-Nicolas, Eric B. Gregory, Kaveh Haghighi Mood, Thorsten Hater, Jenia Jitsev, Chelsea Maria John, Jan H. Meinke, Catrin I. Meyer, Pavel Mezentsev, Jan-Oliver Mirus, Stepan Nassyr, Carolin Penke, Manoel Römmer , et al. (6 additional authors not shown)

    Abstract: Benchmarks are essential in the design of modern HPC installations, as they define key aspects of system components. Beyond synthetic workloads, it is crucial to include real applications that represent user requirements into benchmark suites, to guarantee high usability and widespread adoption of a new system. Given the significant investments in leadership-class supercomputers of the exascale er… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: To be published in Proceedings of The International Conference for High Performance Computing Networking, Storage, and Analysis (SC '24) (2024)

    ACM Class: B.8.2; C.0; C.5.1; D.1.0; C.4

    Journal ref: 2024 SC24: International Conference for High Performance Computing, Networking, Storage and Analysis SC

  4. arXiv:2403.17757  [pdf, other

    cs.CV cs.LG

    Noise2Noise Denoising of CRISM Hyperspectral Data

    Authors: Robert Platt, Rossella Arcucci, Cédric M. John

    Abstract: Hyperspectral data acquired by the Compact Reconnaissance Imaging Spectrometer for Mars (CRISM) have allowed for unparalleled mapping of the surface mineralogy of Mars. Due to sensor degradation over time, a significant portion of the recently acquired data is considered unusable. Here a new data-driven model architecture, Noise2Noise4Mars (N2N4M), is introduced to remove noise from CRISM images.… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 5 pages, 3 figures. Accepted as a conference paper at the ICLR 2024 ML4RS Workshop