Skip to main content

Showing 1–3 of 3 results for author: Verite, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.15488  [pdf, ps, other

    cs.DC

    Minimizing Communication for Parallel Symmetric Tensor Times Same Vector Computation

    Authors: Hussam Al Daas, Grey Ballard, Laura Grigori, Suraj Kumar, Kathryn Rouse, Mathieu Vérité

    Abstract: In this article, we focus on the parallel communication cost of multiplying the same vector along two modes of a $3$-dimensional symmetric tensor. This is a key computation in the higher-order power method for determining eigenpairs of a $3$-dimensional symmetric tensor and in gradient-based methods for computing a symmetric CP decomposition. We establish communication lower bounds that determine… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 19 pages, 1 figure

  2. arXiv:2409.11304  [pdf, ps, other

    cs.DC

    Communication Lower Bounds and Optimal Algorithms for Symmetric Matrix Computations

    Authors: Hussam Al Daas, Grey Ballard, Laura Grigori, Suraj Kumar, Kathryn Rouse, Mathieu Verite

    Abstract: In this article, we focus on the communication costs of three symmetric matrix computations: i) multiplying a matrix with its transpose, known as a symmetric rank-k update (SYRK) ii) adding the result of the multiplication of a matrix with the transpose of another matrix and the transpose of that result, known as a symmetric rank-2k update (SYR2K) iii) performing matrix multiplication with a symme… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: 43 pages, 6 figures. To be published in ACM Transactions on Parallel Computing

  3. arXiv:2202.10217  [pdf, ps, other

    cs.DC

    I/O-Optimal Algorithms for Symmetric Linear Algebra Kernels

    Authors: Olivier Beaumont, Lionel Eyraud-Dubois, Mathieu Vérité, Julien Langou

    Abstract: In this paper, we consider two fundamental symmetric kernels in linear algebra: the Cholesky factorization and the symmetric rank-$k$ update (SYRK), with the classical three nested loops algorithms for these kernels. In addition, we consider a machine model with a fast memory of size $S$ and an unbounded slow memory. In this model, all computations must be performed on operands in fast memory, and… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.