Skip to main content

Showing 1–3 of 3 results for author: Reggiani, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.08137  [pdf, other

    cs.AR

    Empowering Vector Architectures for ML: The CAMP Architecture for Matrix Multiplication

    Authors: Mohammadreza Esmali Nojehdeh, Hossein Mokhtarnia, Julian Pavon Rivera, Narcis Rodas Quiroga, Roger Figueras Bagué, Enrico Reggiani, Miquel Moreto, Osman Unsal, Adrian Cristal, Eduard Ayguade

    Abstract: This study presents the Cartesian Accumulative Matrix Pipeline (CAMP) architecture, a novel approach designed to enhance matrix multiplication in Vector Architectures (VAs) and Single Instruction Multiple Data (SIMD) units. CAMP improves the processing efficiency of Quantized Neural Networks (QNNs). Matrix multiplication is a cornerstone of machine learning applications, and its quantized versions… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  2. arXiv:2305.04546  [pdf, other

    cs.AR

    Flex-SFU: Accelerating DNN Activation Functions by Non-Uniform Piecewise Approximation

    Authors: Enrico Reggiani, Renzo Andri, Lukas Cavigelli

    Abstract: Modern DNN workloads increasingly rely on activation functions consisting of computationally complex operations. This poses a challenge to current accelerators optimized for convolutions and matrix-matrix multiplications. This work presents Flex-SFU, a lightweight hardware accelerator for activation functions implementing non-uniform piecewise interpolation supporting multiple data formats. Non-Un… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 6 pages, 6 figures, 3 tables

  3. Adaptable Register File Organization for Vector Processors

    Authors: Cristóbal Ramírez Lazo, Enrico Reggiani, Carlos Rojas Morales, Roger Figueras Bagué, Luis Alfonso Villa Vargas, Marco Antonio Ramírez Salinas, Mateo Valero Cortés, Osman Sabri Unsal, Adrián Cristal

    Abstract: Modern scientific applications are getting more diverse, and the vector lengths in those applications vary widely. Contemporary Vector Processors (VPs) are designed either for short vector lengths, e.g., Fujitsu A64FX with 512-bit ARM SVE vector support, or long vectors, e.g., NEC Aurora Tsubasa with 16Kbits Maximum Vector Length (MVL). Unfortunately, both approaches have drawbacks. On the one han… ▽ More

    Submitted 29 May, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: 28th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2022)