Skip to main content

Showing 1–2 of 2 results for author: Graening, A

.
  1. arXiv:2503.15753  [pdf, other

    cs.AR

    CATCH: a Cost Analysis Tool for Co-optimization of chiplet-based Heterogeneous systems

    Authors: Alexander Graening, Jonti Talukdar, Saptadeep Pal, Krishnendu Chakrabarty, Puneet Gupta

    Abstract: With the increasing prevalence of chiplet systems in high-performance computing applications, the number of design options has increased dramatically. Instead of chips defaulting to a single die design, now there are options for 2.5D and 3D stacking along with a plethora of choices regarding configurations and processes. For chiplet-based designs, high-impact decisions such as those regarding the… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 13 pages, 21 figures

  2. arXiv:2103.01308  [pdf, other

    cs.LG

    SWIS -- Shared Weight bIt Sparsity for Efficient Neural Network Acceleration

    Authors: Shurui Li, Wojciech Romaszkan, Alexander Graening, Puneet Gupta

    Abstract: Quantization is spearheading the increase in performance and efficiency of neural network computing systems making headway into commodity hardware. We present SWIS - Shared Weight bIt Sparsity, a quantization framework for efficient neural network inference acceleration delivering improved performance and storage compression through an offline weight decomposition and scheduling algorithm. SWIS ca… ▽ More

    Submitted 2 March, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: 8 pages, 6 figures, accepted as a full-length paper at the 2021 TinyML Research Symposium (https://openreview.net/group?id=tinyml.org/tinyML/2021/Research_Symposium)