Skip to main content

Showing 1–1 of 1 results for author: Carrica, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.13821  [pdf, other

    cs.MS cs.DC

    Toward Portable GPU Performance: Julia Recursive Implementation of TRMM and TRSM

    Authors: Vicki Carrica, Maxwell Onyango, Rabab Alomairy, Evelyne Ringoot, James Schloss, Alan Edelman

    Abstract: This paper presents a performant and portable recursive implementation of triangular matrix-matrix multiplication (TRMM) and triangular solve (TRSM) in Julia for GPUs, two kernels that underlie many linear-algebra algorithms. We restructure TRMM and TRSM so that most work is executed as general matrix-matrix multiplication (GEMM), improving use of the GPU memory hierarchy and reducing latency. Exp… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.