Skip to main content

Showing 1–4 of 4 results for author: Ghanathe, N P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.10692  [pdf, other

    cs.LG cs.AI cs.NE

    DEBUG-HD: Debugging TinyML models on-device using Hyper-Dimensional computing

    Authors: Nikhil P Ghanathe, Steven J E Wilton

    Abstract: TinyML models often operate in remote, dynamic environments without cloud connectivity, making them prone to failures. Ensuring reliability in such scenarios requires not only detecting model failures but also identifying their root causes. However, transient failures, privacy concerns, and the safety-critical nature of many applications-where systems cannot be interrupted for debugging-complicate… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

    Comments: Accepted at the Machine Learning for Systems Workshop at NeurIPS 2024

  2. arXiv:2404.12599  [pdf, other

    cs.LG cs.CV

    QUTE: Quantifying Uncertainty in TinyML with Early-exit-assisted ensembles for model-monitoring

    Authors: Nikhil P Ghanathe, Steven J E Wilton

    Abstract: Uncertainty quantification (UQ) provides a resource-efficient solution for on-device monitoring of tinyML models deployed without access to true labels. However, existing UQ methods impose significant memory and compute demands, making them impractical for ultra-low-power, KB-sized TinyML devices. Prior work has attempted to reduce overhead by using early-exit ensembles to quantify uncertainty in… ▽ More

    Submitted 16 November, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2207.06613  [pdf, other

    cs.LG cs.CV eess.IV

    T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit

    Authors: Nikhil P Ghanathe, Steve Wilton

    Abstract: Deploying Machine learning (ML) on milliwatt-scale edge devices (tinyML) is gaining popularity due to recent breakthroughs in ML and Internet of Things (IoT). Most tinyML research focuses on model compression techniques that trade accuracy (and model capacity) for compact models to fit into the KB-sized tiny-edge devices. In this paper, we show how such models can be enhanced by the addition of an… ▽ More

    Submitted 26 April, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at 20th ACM International Conference on Computing Frontiers

  4. arXiv:2107.03653  [pdf, ps, other

    cs.AR cs.DC cs.LG cs.PL

    MAFIA: Machine Learning Acceleration on FPGAs for IoT Applications

    Authors: Nikhil Pratap Ghanathe, Vivek Seshadri, Rahul Sharma, Steve Wilton, Aayan Kumar

    Abstract: Recent breakthroughs in ML have produced new classes of models that allow ML inference to run directly on milliwatt-powered IoT devices. On one hand, existing ML-to-FPGA compilers are designed for deep neural-networks on large FPGAs. On the other hand, general-purpose HLS tools fail to exploit properties specific to ML inference, thereby resulting in suboptimal performance. We propose MAFIA, a too… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: Accepted at The International Conference on Field-Programmable Logic and Applications (FPL), 2021