Skip to main content

Showing 1–8 of 8 results for author: Valeev, E F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12501  [pdf, ps, other

    physics.chem-ph cs.MS physics.comp-ph

    Efficient vectorized evaluation of Gaussian AO integrals on modern central processing units

    Authors: Andrey Asadchev, Edward F. Valeev

    Abstract: We report an implementation of the McMurchie-Davidson evaluation scheme for 1- and 2-particle Gaussian AO integrals designed for efficient execution on modern central processing units (CPUs) with Single Instruction Multiple Data (SIMD) instruction sets. Like in our recent MD implementation for graphical processing units (GPUs) [J. Chem. Phys. 160, 244109 (2024)], variable-sized batches of shellset… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  2. arXiv:2405.01834  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.DC physics.chem-ph

    3-center and 4-center 2-particle Gaussian AO integrals on modern accelerated processors

    Authors: Andrey Asadchev, Edward F. Valeev

    Abstract: We report an implementation of the McMurchie-Davidson (MD) algorithm for 3-center and 4-center 2-particle integrals over Gaussian atomic orbitals (AOs) with low and high angular momenta $l$ and varying degrees of contraction for graphical processing units (GPUs). This work builds upon our recent implementation of a matrix form of the MD algorithm that is efficient for GPU evaluation of 4-center 2-… ▽ More

    Submitted 30 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  3. arXiv:2401.04836  [pdf, other

    cs.PL cs.DC cs.PF

    CoNST: Code Generator for Sparse Tensor Networks

    Authors: Saurabh Raje, Yufan Xu, Atanas Rountev, Edward F. Valeev, Saday Sadayappan

    Abstract: Sparse tensor networks are commonly used to represent contractions over sparse tensors. Tensor contractions are higher-order analogs of matrix multiplication. Tensor networks arise commonly in many domains of scientific computing and data science. After a transformation into a tree of binary contractions, the network is implemented as a sequence of individual contractions. Several critical aspects… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  4. arXiv:2307.03452  [pdf, ps, other

    physics.comp-ph cs.CE physics.chem-ph

    High-performance evaluation of high angular momentum 4-center Gaussian integrals on modern accelerated processors

    Authors: Andrey Asadchev, Edward F. Valeev

    Abstract: We present a high-performance evaluation method for 4-center 2-particle integrals over Gaussian atomic orbitals with high angular momenta ($l\geq4$) and arbitrary contraction degrees on graphical processing units (GPUs) and other accelerators. The implementation uses the matrix form of McMurchie-Davidson recurrences. Evaluation of the 4-center integrals over four $l=6$ ($i$) Gaussian AOs in the do… ▽ More

    Submitted 19 December, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 23 pages

  5. arXiv:2210.03192  [pdf, other

    physics.comp-ph cs.MS physics.chem-ph

    Memory-Efficient Recursive Evaluation of 3-Center Gaussian Integrals

    Authors: Andrey Asadchev, Edward F. Valeev

    Abstract: To improve the efficiency of Gaussian integral evaluation on modern accelerated architectures FLOP-efficient Obara-Saika-based recursive evaluation schemes are optimized for the memory footprint. For the 3-center 2-particle integrals that are key for the evaluation of Coulomb and other 2-particle interactions in the density-fitting approximation the use of multi-quantal recurrences (in which multi… ▽ More

    Submitted 16 January, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: 37 pages, 2 figures, 6 tables

  6. Scalable Task-Based Algorithm for Multiplication of Block-Rank-Sparse Matrices

    Authors: Justus A. Calvin, Cannada A. Lewis, Edward F. Valeev

    Abstract: A task-based formulation of Scalable Universal Matrix Multiplication Algorithm (SUMMA), a popular algorithm for matrix multiplication (MM), is applied to the multiplication of hierarchy-free, rank-structured matrices that appear in the domain of quantum chemistry (QC). The novel features of our formulation are: (1) concurrent scheduling of multiple SUMMA iterations, and (2) fine-grained task-based… ▽ More

    Submitted 9 October, 2015; v1 submitted 1 September, 2015; originally announced September 2015.

    Comments: 8 pages, 6 figures, accepted to IA3 2015. arXiv admin note: text overlap with arXiv:1504.05046

  7. arXiv:1507.01888  [pdf, ps, other

    cs.MS cs.CE math.NA

    MADNESS: A Multiresolution, Adaptive Numerical Environment for Scientific Simulation

    Authors: Robert J. Harrison, Gregory Beylkin, Florian A. Bischoff, Justus A. Calvin, George I. Fann, Jacob Fosso-Tande, Diego Galindo, Jeff R. Hammond, Rebecca Hartman-Baker, Judith C. Hill, Jun Jia, Jakob S. Kottmann, M-J. Yvonne Ou, Laura E. Ratcliff, Matthew G. Reuter, Adam C. Richie-Halford, Nichols A. Romero, Hideo Sekino, William A. Shelton, Bryan E. Sundahl, W. Scott Thornton, Edward F. Valeev, Álvaro Vázquez-Mayagoitia, Nicholas Vence, Yukina Yokoi

    Abstract: MADNESS (multiresolution adaptive numerical environment for scientific simulation) is a high-level software environment for solving integral and differential equations in many dimensions that uses adaptive and fast harmonic analysis methods with guaranteed precision based on multiresolution analysis and separated representations. Underpinning the numerical capabilities is a powerful petascale para… ▽ More

    Submitted 5 July, 2015; originally announced July 2015.

    Journal ref: SIAM SISC 38, S123-S142 (2016)

  8. arXiv:1504.05046  [pdf, other

    cs.DC

    Task-Based Algorithm for Matrix Multiplication: A Step Towards Block-Sparse Tensor Computing

    Authors: Justus A. Calvin, Edward F. Valeev

    Abstract: Distributed-memory matrix multiplication (MM) is a key element of algorithms in many domains (machine learning, quantum physics). Conventional algorithms for dense MM rely on regular/uniform data decomposition to ensure load balance. These traits conflict with the irregular structure (block-sparse or rank-sparse within blocks) that is increasingly relevant for fast methods in quantum physics. To d… ▽ More

    Submitted 20 April, 2015; originally announced April 2015.

    Comments: submitted to SC15 (9 pages, 8 figures)