Skip to main content

Showing 1–3 of 3 results for author: Moses, W

Searching in archive math. Search in all archives.
.
  1. arXiv:2507.04647  [pdf, ps, other

    cs.DC math.NA

    RAPTOR: Practical Numerical Profiling of Scientific Applications

    Authors: Faveo Hoerold, Ivan R. Ivanov, Akash Dhruv, William S. Moses, Anshu Dubey, Mohamed Wahib, Jens Domke

    Abstract: The proliferation of low-precision units in modern high-performance architectures increasingly burdens domain scientists. Historically, the choice in HPC was easy: can we get away with 32 bit floating-point operations and lower bandwidth requirements, or is FP64 necessary? Driven by Artificial Intelligence, vendors introduced novel low-precision units for vector and tensor operations, and FP64 cap… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 12 pages, 8 figures, to be published in SC'25

  2. arXiv:2305.07546  [pdf, other

    math.NA cs.AI cs.CE

    Understanding Automatic Differentiation Pitfalls

    Authors: Jan Hückelheim, Harshitha Menon, William Moses, Bruce Christianson, Paul Hovland, Laurent Hascoët

    Abstract: Automatic differentiation, also known as backpropagation, AD, autodiff, or algorithmic differentiation, is a popular technique for computing derivatives of computer programs accurately and efficiently. Sometimes, however, the derivatives computed by AD could be interpreted as incorrect. These pitfalls occur systematically across tools and approaches. In this paper we broadly categorize problematic… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  3. arXiv:2204.01722  [pdf, other

    cs.MS cs.CE cs.DC math.NA

    Performance Portable Solid Mechanics via Matrix-Free $p$-Multigrid

    Authors: Jed Brown, Valeria Barra, Natalie Beams, Leila Ghaffari, Matthew Knepley, William Moses, Rezgar Shakeri, Karen Stengel, Jeremy L. Thompson, Junchao Zhang

    Abstract: Finite element analysis of solid mechanics is a foundational tool of modern engineering, with low-order finite element methods and assembled sparse matrices representing the industry standard for implicit analysis. We use performance models and numerical experiments to demonstrate that high-order methods greatly reduce the costs to reach engineering tolerances while enabling effective use of GPUs;… ▽ More

    Submitted 23 May, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    ACM Class: G.1.8; G.1.5; G.1.10; G.4; J.2; J.6; D.1.3