Skip to main content

Showing 1–8 of 8 results for author: Clark, M A

Searching in archive physics. Search in all archives.
.
  1. arXiv:2502.00152  [pdf, other

    hep-lat physics.comp-ph

    Improving HISQ propagator solves using deflation

    Authors: Leon Hostetler, M. A. Clark, Carleton DeTar, Steven Gottlieb, Evan Weinberg

    Abstract: Typically, the conjugate gradient (CG) algorithm employs mixed precision and even-odd preconditioning to compute propagators for highly improved staggered quarks (HISQ). This approach suffers from critical slowing down as the light quark mass is decreased to its physical value. Multigrid is one alternative to combat critical slowing down; however, it involves setup costs that are not always easy t… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

    Comments: 10 pages, 3 Figures; Talk presented at the 41st International Symposium on Lattice Field theory (LATTICE2024), July 28th - August 3rd, 2024, Liverpool, UK

    Journal ref: PoS(LATTICE2024)046

  2. arXiv:1810.01609  [pdf, other

    hep-lat cs.DC nucl-th physics.comp-ph

    Simulating the weak death of the neutron in a femtoscale universe with near-Exascale computing

    Authors: Evan Berkowitz, M. A. Clark, Arjun Gambhir, Ken McElvain, Amy Nicholson, Enrico Rinaldi, Pavlos Vranas, André Walker-Loud, Chia Cheng Chang, Bálint Joó, Thorsten Kurth, Kostas Orginos

    Abstract: The fundamental particle theory called Quantum Chromodynamics (QCD) dictates everything about protons and neutrons, from their intrinsic properties to interactions that bind them into atomic nuclei. Quantities that cannot be fully resolved through experiment, such as the neutron lifetime (whose precise value is important for the existence of light-atomic elements that make the sun shine and life p… ▽ More

    Submitted 10 October, 2018; v1 submitted 3 October, 2018; originally announced October 2018.

    Comments: 2018 Gordon Bell Finalist: 9 pages, 9 figures; v2: fixed 2 typos and appended acknowledgements

    Report number: LLNL-JRNL-749850, RIKEN-iTHEMS-Report-18 ACM Class: C.1.4; D.1.3

    Journal ref: Supercomputing 2018, pp. 697-705

  3. arXiv:1710.09745  [pdf, other

    hep-lat physics.comp-ph

    Pushing Memory Bandwidth Limitations Through Efficient Implementations of Block-Krylov Space Solvers on GPUs

    Authors: M. A. Clark, Alexei Strelchenko, Alejandro Vaquero, Mathias Wagner, Evan Weinberg

    Abstract: Lattice quantum chromodynamics simulations in nuclear physics have benefited from a tremendous number of algorithmic advances such as multigrid and eigenvector deflation. These improve the time to solution but do not alleviate the intrinsic memory-bandwidth constraints of the matrix-vector operation dominating iterative solvers. Batching this operation for multiple vectors and exploiting cache and… ▽ More

    Submitted 7 August, 2018; v1 submitted 26 October, 2017; originally announced October 2017.

    Comments: 15 pages, 14 figures, in press

    Report number: FERMILAB-PUB-17-592-CD

    Journal ref: Comp. Phys. Comm. 2018

  4. arXiv:1612.07873  [pdf, other

    hep-lat physics.comp-ph

    Accelerating Lattice QCD Multigrid on GPUs Using Fine-Grained Parallelization

    Authors: M. A. Clark, Bálint Joó, Alexei Strelchenko, Michael Cheng, Arjun Gambhir, Richard Brower

    Abstract: The past decade has witnessed a dramatic acceleration of lattice quantum chromodynamics calculations in nuclear and particle physics. This has been due to both significant progress in accelerating the iterative linear solvers using multi-grid algorithms, and due to the throughput improvements brought by GPUs. Deploying hierarchical algorithms optimally on GPUs is non-trivial owing to the lack of p… ▽ More

    Submitted 22 December, 2016; originally announced December 2016.

    Comments: http://dl.acm.org/citation.cfm?id=3014904.3014995}

    Journal ref: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '16), Article 68 (November, 2016)

  5. arXiv:1408.5925  [pdf, other

    hep-lat cs.MS physics.comp-ph

    A Framework for Lattice QCD Calculations on GPUs

    Authors: F. T. Winter, M. A. Clark, R. G. Edwards, B. Joó

    Abstract: Computing platforms equipped with accelerators like GPUs have proven to provide great computational power. However, exploiting such platforms for existing scientific applications is not a trivial task. Current GPU programming frameworks such as CUDA C/C++ require low-level programming from the developer in order to achieve high performance code. As a result porting of applications to GPUs is typic… ▽ More

    Submitted 25 August, 2014; originally announced August 2014.

    Comments: 10 pages, 6 figures, as published in the proceedings of IPDPS '14

  6. arXiv:1109.2935  [pdf, other

    hep-lat physics.comp-ph

    Scaling Lattice QCD beyond 100 GPUs

    Authors: R. Babich, M. A. Clark, B. Joó, G. Shi, R. C. Brower, S. Gottlieb

    Abstract: Over the past five years, graphics processing units (GPUs) have had a transformational effect on numerical lattice quantum chromodynamics (LQCD) calculations in nuclear and particle physics. While GPUs have been applied with great success to the post-Monte Carlo "analysis" phase which accounts for a substantial fraction of the workload in a typical LQCD calculation, the initial Monte Carlo "gauge… ▽ More

    Submitted 13 September, 2011; originally announced September 2011.

    Comments: 11 pages, 10 figures, to appear in the proceedings of the 2011 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC'11)

  7. arXiv:1011.0024  [pdf, other

    hep-lat physics.comp-ph

    Parallelizing the QUDA Library for Multi-GPU Calculations in Lattice Quantum Chromodynamics

    Authors: Ronald Babich, Michael A. Clark, Bálint Joó

    Abstract: Graphics Processing Units (GPUs) are having a transformational effect on numerical lattice quantum chromodynamics (LQCD) calculations of importance in nuclear and particle physics. The QUDA library provides a package of mixed precision sparse matrix linear solvers for LQCD applications, supporting single GPUs based on NVIDIA's Compute Unified Device Architecture (CUDA). This library, interfaced to… ▽ More

    Submitted 29 October, 2010; originally announced November 2010.

    Comments: 11 pages, 7 figures, to appear in the Proceedings of Supercomputing 2010 (submitted April 12, 2010)

  8. arXiv:1003.5575  [pdf, other

    astro-ph.IM physics.comp-ph

    Enabling a High Throughput Real Time Data Pipeline for a Large Radio Telescope Array with GPUs

    Authors: R. G. Edgar, M. A. Clark, K. Dale, D. A. Mitchell, S. M. Ord, R. B. Wayth, H. Pfister, L. J. Greenhill

    Abstract: The Murchison Widefield Array (MWA) is a next-generation radio telescope currently under construction in the remote Western Australia Outback. Raw data will be generated continuously at 5GiB/s, grouped into 8s cadences. This high throughput motivates the development of on-site, real time processing and reduction in preference to archiving, transport and off-line processing. Each batch of 8s data m… ▽ More

    Submitted 14 June, 2010; v1 submitted 29 March, 2010; originally announced March 2010.

    Comments: Version accepted by Comp. Phys. Comm