Skip to main content

Showing 1–10 of 10 results for author: Springer, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.01999  [pdf, other

    quant-ph cs.PF cs.SE

    cuQuantum SDK: A High-Performance Library for Accelerating Quantum Science

    Authors: Harun Bayraktar, Ali Charara, David Clark, Saul Cohen, Timothy Costa, Yao-Lung L. Fang, Yang Gao, Jack Guan, John Gunnels, Azzam Haidar, Andreas Hehn, Markus Hohnerbach, Matthew Jones, Tom Lubowe, Dmitry Lyakh, Shinya Morino, Paul Springer, Sam Stanwyck, Igor Terentyev, Satya Varadhan, Jonathan Wong, Takuma Yamaguchi

    Abstract: We present the NVIDIA cuQuantum SDK, a state-of-the-art library of composable primitives for GPU-accelerated quantum circuit simulations. As the size of quantum devices continues to increase, making their classical simulation progressively more difficult, the availability of fast and scalable quantum circuit simulators becomes vital for quantum algorithm developers, as well as quantum hardware eng… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: paper accepted at QCE 2023, journal reference will be updated whenever available

    MSC Class: 68Q12; 68Q09; 81P68;

  2. arXiv:2211.15460  [pdf

    cs.GR

    Fragment-History Volumes

    Authors: Francisco InĂ¡cio, Jan P. Springer

    Abstract: Hardware-based triangle rasterization is still the prevalent method for generating images at real-time interactive frame rates. With the availability of a programmable graphics pipeline a large variety of techniques are supported for evaluating lighting and material properties of fragments. However, these techniques are usually restricted to evaluating local lighting and material effects. In addit… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    ACM Class: I.3.7

  3. arXiv:1805.10121  [pdf, other

    cs.MS cs.CE cs.DS

    ChASE: Chebyshev Accelerated Subspace iteration Eigensolver for sequences of Hermitian eigenvalue problems

    Authors: Jan Winkelmann, Paul Springer, Edoardo Di Napoli

    Abstract: Solving dense Hermitian eigenproblems arranged in a sequence with direct solvers fails to take advantage of those spectral properties which are pertinent to the entire sequence, and not just to the single problem. When such features take the form of correlations between the eigenvectors of consecutive problems, as is the case in many real-world applications, the potential benefit of exploiting the… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: 33 pages. Submitted to ACM TOMS

  4. arXiv:1705.06661  [pdf, other

    cs.MS cs.PF

    Spin Summations: A High-Performance Perspective

    Authors: Paul Springer, Devin Matthews, Paolo Bientinesi

    Abstract: Besides tensor contractions, one of the most pronounced computational bottlenecks in the non-orthogonally spin-adapted forms of the quantum chemistry methods CCSDT and CCSDTQ, and their approximate forms---including CCSD(T) and CCSDT(Q)---are spin summations. At a first sight, spin summations are operations similar to tensor transpositions; a closer look instead reveals additional challenges to hi… ▽ More

    Submitted 18 May, 2017; originally announced May 2017.

    ACM Class: G.4; D.1.3

  5. arXiv:1704.04374  [pdf, other

    cs.MS cs.DC cs.PF

    HPTT: A High-Performance Tensor Transposition C++ Library

    Authors: Paul Springer, Tong Su, Paolo Bientinesi

    Abstract: Recently we presented TTC, a domain-specific compiler for tensor transpositions. Despite the fact that the performance of the generated code is nearly optimal, due to its offline nature, TTC cannot be utilized in all the application codes in which the tensor sizes and the necessary tensor permutations are determined at runtime. To overcome this limitation, we introduce the open-source C++ library… ▽ More

    Submitted 10 May, 2017; v1 submitted 14 April, 2017; originally announced April 2017.

    ACM Class: G.4; D.1.3

  6. arXiv:1701.05242  [pdf, other

    physics.comp-ph cs.CE

    A Scalable, Linear-Time Dynamic Cutoff Algorithm for Molecular Dynamics

    Authors: Paul Springer, Ahmed E. Ismail, Paolo Bientinesi

    Abstract: Recent results on supercomputers show that beyond 65K cores, the efficiency of molecular dynamics simulations of interfacial systems decreases significantly. In this paper, we introduce a dynamic cutoff method (DCM) for interfacial systems of arbitrarily large size. The idea consists in adopting a cutoff-based method in which the cutoff is cho- sen on a particle-by-particle basis, according to the… ▽ More

    Submitted 18 January, 2017; originally announced January 2017.

    Comments: in ISC High Performance 2015

  7. TTC: A Tensor Transposition Compiler for Multiple Architectures

    Authors: Paul Springer, Aravind Sankaran, Paolo Bientinesi

    Abstract: We consider the problem of transposing tensors of arbitrary dimension and describe TTC, an open source domain-specific parallel compiler. TTC generates optimized parallel C++/CUDA C code that achieves a significant fraction of the system's peak memory bandwidth. TTC exhibits high performance across multiple architectures, including modern AVX-based systems (e.g.,~Intel Haswell, AMD Steamroller), I… ▽ More

    Submitted 5 July, 2016; originally announced July 2016.

    ACM Class: G.4; D.3.4; I.1.2; I.1.3

  8. arXiv:1607.00145  [pdf, other

    cs.MS cs.PF

    Design of a high-performance GEMM-like Tensor-Tensor Multiplication

    Authors: Paul Springer, Paolo Bientinesi

    Abstract: We present "GEMM-like Tensor-Tensor multiplication" (GETT), a novel approach to tensor contractions that mirrors the design of a high-performance general matrix-matrix multiplication (GEMM). The critical insight behind GETT is the identification of three index sets, involved in the tensor contraction, which enable us to systematically reduce an arbitrary tensor contraction to loops around a highly… ▽ More

    Submitted 7 November, 2017; v1 submitted 1 July, 2016; originally announced July 2016.

    ACM Class: G.4; D.3.4; I.1.2; I.1.3

  9. arXiv:1603.02297  [pdf, other

    cs.MS cs.DC cs.PF

    TTC: A high-performance Compiler for Tensor Transpositions

    Authors: Paul Springer, Jeff R. Hammond, Paolo Bientinesi

    Abstract: We present TTC, an open-source parallel compiler for multidimensional tensor transpositions. In order to generate high-performance C++ code, TTC explores a number of optimizations, including software prefetching, blocking, loop-reordering, and explicit vectorization. To evaluate the performance of multidimensional transpositions across a range of possible use-cases, we also release a benchmark cov… ▽ More

    Submitted 7 March, 2016; originally announced March 2016.

  10. arXiv:1502.03234  [pdf, other

    cs.DC cs.CE

    A Scalable, Linear-Time Dynamic Cutoff Algorithm for Molecular Simulations of Interfacial Systems

    Authors: Paul Springer

    Abstract: This master thesis introduces the idea of dynamic cutoffs in molecular dynamics simulations, based on the distance between particles and the interface, and presents a solution for detecting interfaces in real-time. Our dynamic cutoff method (DCM) exhibits a linear-time complexity as well as nearly ideal weak and strong scaling. The DCM is tailored for massively parallel architectures and for large… ▽ More

    Submitted 11 February, 2015; originally announced February 2015.