Skip to main content

Showing 1–4 of 4 results for author: Siracusa, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.09870  [pdf, other

    cs.AR cs.LG cs.PL

    Ember: A Compiler for Efficient Embedding Operations on Decoupled Access-Execute Architectures

    Authors: Marco Siracusa, Olivia Hsu, Victor Soria-Pardos, Joshua Randall, Arnaud Grasset, Eric Biscondi, Doug Joseph, Randy Allen, Fredrik Kjolstad, Miquel Moretó Planas, Adrià Armejach

    Abstract: Irregular embedding lookups are a critical bottleneck in recommender models, sparse large language models, and graph learning models. In this paper, we first demonstrate that, by offloading these lookups to specialized access units, Decoupled Access-Execute (DAE) processors achieve 2.6$\times$ higher performance and 6.4$\times$ higher performance/watt than GPUs on end-to-end models. Then, we propo… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 14 pages, 19 figures, under review

    ACM Class: C.1.2; C.1.3; D.3.4

  2. arXiv:2304.06944  [pdf, other

    cs.AR

    SpChar: Characterizing the Sparse Puzzle via Decision Trees

    Authors: Francesco Sgherzi, Marco Siracusa, Ivan Fernandez, Adrià Armejach, Miquel Moretó

    Abstract: Sparse matrix computation is crucial in various modern applications, including large-scale graph analytics, deep learning, and recommender systems. The performance of sparse kernels varies greatly depending on the structure of the input matrix, making it difficult to gain a comprehensive understanding of sparse computation and its relationship to inputs, algorithms, and target machine architecture… ▽ More

    Submitted 30 July, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: 27 pages, 17 figures

    ACM Class: B.8.2

  3. arXiv:2103.10040  [pdf, other

    cs.AR

    Solving Large Top-K Graph Eigenproblems with a Memory and Compute-optimized FPGA Design

    Authors: Francesco Sgherzi, Alberto Parravicini, Marco Siracusa, Marco Domenico Santambrogio

    Abstract: Large-scale eigenvalue computations on sparse matrices are a key component of graph analytics techniques based on spectral methods. In such applications, an exhaustive computation of all eigenvalues and eigenvectors is impractical and unnecessary, as spectral methods can retrieve the relevant properties of enormous graphs using just the eigenvectors associated with the Top-K largest eigenvalues.… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  4. arXiv:2103.04808  [pdf, other

    cs.AR cs.IR

    Scaling up HBM Efficiency of Top-K SpMV for Approximate Embedding Similarity on FPGAs

    Authors: Alberto Parravicini, Luca Giuseppe Cellamare, Marco Siracusa, Marco Domenico Santambrogio

    Abstract: Top-K SpMV is a key component of similarity-search on sparse embeddings. This sparse workload does not perform well on general-purpose NUMA systems that employ traditional caching strategies. Instead, modern FPGA accelerator cards have a few tricks up their sleeve. We introduce a Top-K SpMV FPGA design that leverages reduced precision and a novel packet-wise CSR matrix compression, enabling custom… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: To appear in Proceedings of the 58th Design Automation Conference (DAC)