Skip to main content

Showing 1–2 of 2 results for author: Block, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00424  [pdf, ps, other

    cs.LG cs.AI cs.AR cs.ET

    COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning

    Authors: Chamika Sudusinghe, Gerasimos Gerogiannis, Damitha Lenadora, Charles Block, Josep Torrellas, Charith Mendis

    Abstract: Sparse tensor programs are essential in deep learning and graph analytics, driving the need for optimized processing. To meet this demand, specialized hardware accelerators are being developed. Optimizing these programs for accelerators is challenging for two reasons: program performance is highly sensitive to variations in sparse inputs, and early-stage accelerators rely on expensive simulators.… ▽ More

    Submitted 14 June, 2025; v1 submitted 31 May, 2025; originally announced June 2025.

    Comments: Accepted at the 42nd International Conference on Machine Learning

  2. arXiv:2408.11988  [pdf, other

    cs.DC

    Distributed-Memory Parallel Algorithms for Sparse Matrix and Sparse Tall-and-Skinny Matrix Multiplication

    Authors: Isuru Ranawaka, Md Taufique Hussain, Charles Block, Gerasimos Gerogiannis, Josep Torrellas, Ariful Azad

    Abstract: We consider a sparse matrix-matrix multiplication (SpGEMM) setting where one matrix is square and the other is tall and skinny. This special variant, called TS-SpGEMM, has important applications in multi-source breadth-first search, influence maximization, sparse graph embedding, and algebraic multigrid solvers. Unfortunately, popular distributed algorithms like sparse SUMMA deliver suboptimal per… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.