Skip to main content

Showing 1–15 of 15 results for author: Schiavone, P D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.08421  [pdf, ps, other

    cs.AR

    e-GPU: An Open-Source and Configurable RISC-V Graphic Processing Unit for TinyAI Applications

    Authors: Simone Machetti, Pasquale Davide Schiavone, Lara Orlandic, Darong Huang, Deniz Kasap, Giovanni Ansaloni, David Atienza

    Abstract: Graphics processing units (GPUs) excel at parallel processing, but remain largely unexplored in ultra-low-power edge devices (TinyAI) due to their power and area limitations, as well as the lack of suitable programming frameworks. To address these challenges, this work introduces embedded GPU (e-GPU), an open-source and configurable RISC-V GPU platform designed for TinyAI devices. Its extensive co… ▽ More

    Submitted 5 June, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

  2. Quadrilatero: A RISC-V programmable matrix coprocessor for low-power edge applications

    Authors: Danilo Cammarata, Matteo Perotti, Marco Bertuletti, Angelo Garofalo, Pasquale Davide Schiavone, David Atienza, Luca Benini

    Abstract: The rapid growth of AI-based Internet-of-Things applications increased the demand for high-performance edge processing engines on a low-power budget and tight area constraints. As a consequence, vector processor architectures, traditionally designed for high-performance computing (HPC), made their way into edge devices, promising high utilization of floating-point units (FPUs) and low power consum… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  3. arXiv:2504.07555  [pdf, other

    cs.AR

    Just TestIt! An SBST Approach To Automate System-Integration Testing

    Authors: Tommaso Terzano, Luigi Giuffrida, Juan Sapriza, Pasquale Davide Schiavone, Guido Masera, David Atienza, Luciano Lavagno, Maurizio Martina

    Abstract: This paper introduces TestIt, an open-source Python package designed to automate full-system integration testing using a Software-Based Self-Test (SBST) approach. By dynamically generating test vectors and golden references, TestIt significantly reduces development time and complexity while supporting both simulation and FPGA environments. Its flexible design positions TestIt as a key enabler for… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  4. arXiv:2504.02533  [pdf, other

    cs.AR

    ARCANE: Adaptive RISC-V Cache Architecture for Near-memory Extensions

    Authors: Vincenzo Petrolo, Flavia Guella, Michele Caon, Pasquale Davide Schiavone, Guido Masera, Maurizio Martina

    Abstract: Modern data-driven applications expose limitations of von Neumann architectures - extensive data movement, low throughput, and poor energy efficiency. Accelerators improve performance but lack flexibility and require data transfers. Existing compute in- and near-memory solutions mitigate these issues but face usability challenges due to data placement constraints. We propose a novel cache architec… ▽ More

    Submitted 7 April, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

    Comments: 6 pages, 4 figures, accepted at the Design Automation Conference (DAC) 2025

  5. arXiv:2501.18253  [pdf, other

    cs.AR

    Increasing the Energy-Efficiency of Wearables Using Low-Precision Posit Arithmetic with PHEE

    Authors: David Mallasén, Pasquale Davide Schiavone, Alberto A. Del Barrio, Manuel Prieto-Matias, David Atienza

    Abstract: Wearable biomedical devices are increasingly being used for continuous patient health monitoring, enabling real-time insights and extended data collection without the need for prolonged hospital stays. These devices must be energy efficient to minimize battery size, improve comfort, and reduce recharging intervals. This paper investigates the use of specialized low-precision arithmetic formats to… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: 9 pages. Code available at https://github.com/esl-epfl/PHEE

  6. arXiv:2408.01988  [pdf, other

    cs.LG cs.AI cs.AR

    MetaWearS: A Shortcut in Wearable Systems Lifecycle with Only a Few Shots

    Authors: Alireza Amirshahi, Maedeh H. Toosi, Siamak Mohammadi, Stefano Albini, Pasquale Davide Schiavone, Giovanni Ansaloni, Amir Aminifar, David Atienza

    Abstract: Wearable systems provide continuous health monitoring and can lead to early detection of potential health issues. However, the lifecycle of wearable systems faces several challenges. First, effective model training for new wearable devices requires substantial labeled data from various subjects collected directly by the wearable. Second, subsequent model updates require further extensive labeled d… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  7. Scalable and RISC-V Programmable Near-Memory Computing Architectures for Edge Nodes

    Authors: Michele Caon, Clément Choné, Pasquale Davide Schiavone, Alexandre Levisse, Guido Masera, Maurizio Martina, David Atienza

    Abstract: The widespread adoption of data-centric algorithms, particularly Artificial Intelligence (AI) and Machine Learning (ML), has exposed the limitations of centralized processing infrastructures, driving a shift towards edge computing. This necessitates stringent constraints on energy efficiency, which traditional von Neumann architectures struggle to meet. The Compute-In-Memory (CIM) paradigm has eme… ▽ More

    Submitted 27 March, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 15 pages, 13 figures, accepted in IEEE Transactions on Emerging Topics in Computing

  8. arXiv:2406.03886  [pdf, other

    cs.LG eess.SP

    BiomedBench: A benchmark suite of TinyML biomedical applications for low-power wearables

    Authors: Dimitrios Samakovlis, Stefano Albini, Rubén Rodríguez Álvarez, Denisa-Andreea Constantinescu, Pasquale Davide Schiavone, Miguel Peón Quirós, David Atienza

    Abstract: The design of low-power wearables for the biomedical domain has received a lot of attention in recent decades, as technological advances in chip manufacturing have allowed real-time monitoring of patients using low-complexity ML within the mW range. Despite advances in application and hardware design research, the domain lacks a systematic approach to hardware evaluation. In this work, we propose… ▽ More

    Submitted 11 October, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures. Accepted for publication to Design & Test Special Issue TinyML

  9. arXiv:2404.12503  [pdf, other

    cs.AR

    STRELA: STReaming ELAstic CGRA Accelerator for Embedded Systems

    Authors: Daniel Vazquez, Jose Miranda, Alfonso Rodriguez, Andres Otero, Pascuale Davide Schiavone, David Atienza

    Abstract: Reconfigurable computing offers a good balance between flexibility and energy efficiency. When combined with software-programmable devices such as CPUs, it is possible to obtain higher performance by spatially distributing the parallelizable sections of an application throughout the reconfigurable device while the CPU is in charge of control-intensive sections. This work introduces an elastic Coar… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 14 pages, 11 figures

  10. arXiv:2401.05548  [pdf, other

    cs.AR

    X-HEEP: An Open-Source, Configurable and Extendible RISC-V Microcontroller for the Exploration of Ultra-Low-Power Edge Accelerators

    Authors: Simone Machetti, Pasquale Davide Schiavone, Thomas Christoph Müller, Miguel Peón-Quirós, David Atienza

    Abstract: The field of edge computing has witnessed remarkable growth owing to the increasing demand for real-time processing of data in applications. However, challenges persist due to limitations in performance and power consumption. To overcome these challenges, heterogeneous architectures have emerged that combine host processors with specialized accelerators tailored to specific applications, leading t… ▽ More

    Submitted 8 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  11. arXiv:2007.08952  [pdf, other

    cs.AR cs.LG eess.SP

    Always-On 674uW @ 4GOP/s Error Resilient Binary Neural Networks with Aggressive SRAM Voltage Scaling on a 22nm IoT End-Node

    Authors: Alfio Di Mauro, Francesco Conti, Pasquale Davide Schiavone, Davide Rossi, Luca Benini

    Abstract: Binary Neural Networks (BNNs) have been shown to be robust to random bit-level noise, making aggressive voltage scaling attractive as a power-saving technique for both logic and SRAMs. In this work, we introduce the first fully programmable IoT end-node system-on-chip (SoC) capable of executing software-defined, hardware-accelerated BNNs at ultra-low voltage. Our SoC exploits a hybrid memory schem… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: Submitted to ISICAS2020 journal special issue

  12. arXiv:2006.14256  [pdf, other

    cs.AR

    Arnold: an eFPGA-Augmented RISC-V SoC for Flexible and Low-Power IoT End-Nodes

    Authors: Pasquale Davide Schiavone, Davide Rossi, Alfio Di Mauro, Frank Gurkaynak, Timothy Saxe, Mao Wang, Ket Chong Yap, Luca Benini

    Abstract: A wide range of Internet of Things (IoT) applications require powerful, energy-efficient and flexible end-nodes to acquire data from multiple sources, process and distill the sensed data through near-sensor data analytics algorithms, and transmit it wirelessly. This work presents Arnold: a 0.5 V to 0.8 V, 46.83 uW/MHz, 600 MOPS fully programmable RISC-V Microcontroller unit (MCU) fabricated in 22… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

  13. XNOR Neural Engine: a Hardware Accelerator IP for 21.6 fJ/op Binary Neural Network Inference

    Authors: Francesco Conti, Pasquale Davide Schiavone, Luca Benini

    Abstract: Binary Neural Networks (BNNs) are promising to deliver accuracy comparable to conventional deep neural networks at a fraction of the cost in terms of memory and energy. In this paper, we introduce the XNOR Neural Engine (XNE), a fully digital configurable hardware accelerator IP for BNNs, integrated within a microcontroller unit (MCU) equipped with an autonomous I/O subsystem and hybrid SRAM / sta… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: 11 pages, 8 figures, 2 tables, 3 listings. Accepted for presentation at CODES'18 and for publication in IEEE Transactions on Computer-Aided Design of Circuits and Systems (TCAD) as part of the ESWEEK-TCAD special issue

  14. arXiv:1612.05974  [pdf, other

    cs.AR cs.CR cs.LG cs.NE

    An IoT Endpoint System-on-Chip for Secure and Energy-Efficient Near-Sensor Analytics

    Authors: Francesco Conti, Robert Schilling, Pasquale Davide Schiavone, Antonio Pullini, Davide Rossi, Frank Kagan Gürkaynak, Michael Muehlberghuber, Michael Gautschi, Igor Loi, Germain Haugou, Stefan Mangard, Luca Benini

    Abstract: Near-sensor data analytics is a promising direction for IoT endpoints, as it minimizes energy spent on communication and reduces network load - but it also poses security concerns, as valuable data is stored or sent over the network at various stages of the analytics pipeline. Using encryption to protect sensitive data at the boundary of the on-chip analytics engine is a way to address data securi… ▽ More

    Submitted 23 April, 2017; v1 submitted 18 December, 2016; originally announced December 2016.

    Comments: 15 pages, 12 figures, accepted for publication to the IEEE Transactions on Circuits and Systems - I: Regular Papers

  15. arXiv:1608.08376  [pdf, other

    cs.AR

    A near-threshold RISC-V core with DSP extensions for scalable IoT Endpoint Devices

    Authors: Michael Gautschi, Pasquale Davide Schiavone, Andreas Traber, Igor Loi, Antonio Pullini, Davide Rossi, Eric Flamand, Frank K. Gurkaynak, Luca Benini

    Abstract: Endpoint devices for Internet-of-Things not only need to work under extremely tight power envelope of a few milliwatts, but also need to be flexible in their computing capabilities, from a few kOPS to GOPS. Near-threshold(NT) operation can achieve higher energy efficiency, and the performance scalability can be gained through parallelism. In this paper we describe the design of an open-source RISC… ▽ More

    Submitted 30 August, 2016; originally announced August 2016.