Skip to main content

Showing 1–5 of 5 results for author: Mohror, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.11141  [pdf, other

    cs.CE

    Kilometer-Scale E3SM Land Model Simulation over North America

    Authors: Dali Wang, Chen Wang, Qinglei Cao, Peter Schwartz, Fengming Yuan, Jayesh Krishna, Danqing Wu, Danial Ricciuto, Peter Thornton, Shih-Chieh Kao, Michele Thornton, Kathryn Mohror

    Abstract: The development of a kilometer-scale E3SM Land Model (km-scale ELM) is an integral part of the E3SM project, which seeks to advance energy-related Earth system science research with state-of-the-art modeling and simulation capabilities on exascale computing systems. Through the utilization of high-fidelity data products, such as atmospheric forcing and soil properties, the km-scale ELM plays a cri… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

  2. arXiv:2501.04654  [pdf, other

    cs.DC cs.PF

    Recorder: Comprehensive Parallel I/O Tracing and Analysis

    Authors: Chen Wang, Izzet Yildirim, Hariharan Devarajan, Kathryn Mohror, Marc Snir

    Abstract: This paper presents Recorder, a parallel I/O tracing tool designed to capture comprehensive I/O information on HPC applications. Recorder traces I/O calls across various I/O layers, storing all function parameters for each captured call. The volume of stored information scales linearly the application's execution scale. To address this, we present a sophisticated pattern-recognition-based compress… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: 29 pages. Under Review. Submitted to the Journal of Supercomputing

  3. Formal Definitions and Performance Comparison of Consistency Models for Parallel File Systems

    Authors: Chen Wang, Kathryn Mohror, Marc Snir

    Abstract: The semantics of HPC storage systems are defined by the consistency models to which they abide. Storage consistency models have been less studied than their counterparts in memory systems, with the exception of the POSIX standard and its strict consistency model. The use of POSIX consistency imposes a performance penalty that becomes more significant as the scale of parallel file systems increases… ▽ More

    Submitted 26 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 15 pages. Submitted to IEEE TPDS

    Journal ref: IEEE Transactions on Parallel and Distributed Systems, 2024, Volume 35, Issue 6, Pages 937-951

  4. arXiv:2312.06131  [pdf, other

    cs.DC

    ML-based Modeling to Predict I/O Performance on Different Storage Sub-systems

    Authors: Yiheng Xu, Pranav Sivaraman, Hariharan Devarajan, Kathryn Mohror, Abhinav Bhatele

    Abstract: Parallel applications can spend a significant amount of time performing I/O on large-scale supercomputers. Fast near-compute storage accelerators called burst buffers can reduce the time a processor spends performing I/O and mitigate I/O bottlenecks. However, determining if a given application could be accelerated using burst buffers is not straightforward even for storage experts. The relationshi… ▽ More

    Submitted 11 January, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  5. arXiv:2103.02131  [pdf, ps, other

    cs.DC

    VELOC: VEry Low Overhead Checkpointing in the Age of Exascale

    Authors: Bogdan Nicolae, Adam Moody, Gregory Kosinovsky, Kathryn Mohror, Franck Cappello

    Abstract: Checkpointing large amounts of related data concurrently to stable storage is a common I/O pattern of many HPC applications. However, such a pattern frequently leads to I/O bottlenecks that lead to poor scalability and performance. As modern HPC infrastructures continue to evolve, there is a growing gap between compute capacity vs. I/O capabilities. Furthermore, the storage hierarchy is becoming i… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Journal ref: SuperCheck'21: First International Symposium on Checkpointing for Supercomputing, 2021