Skip to main content

Showing 1–6 of 6 results for author: Amarnath, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.11663  [pdf, other

    cs.AR cs.AI cs.LG

    MEADOW: Memory-efficient Dataflow and Data Packing for Low Power Edge LLMs

    Authors: Abhishek Moitra, Arkapravo Ghosh, Shrey Agarwal, Aporva Amarnath, Karthik Swaminathan, Priyadarshini Panda

    Abstract: The computational and memory challenges of large language models (LLMs) have sparked several optimization approaches towards their efficient implementation. While prior LLM-targeted quantization, and prior works on sparse acceleration have significantly mitigated the memory and computation bottleneck, they do so assuming high power platforms such as GPUs and server-class FPGAs with large off-chip… ▽ More

    Submitted 14 February, 2025; originally announced March 2025.

    Comments: 12 pages, 13 figures. Accepted to The Eighth Annual Conference on Machine Learning and Systems (MLSys), 2025

  2. arXiv:2410.07364  [pdf, other

    physics.optics cs.AI cs.DC cs.LG

    Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing

    Authors: Ismail Erbas, Aporva Amarnath, Vikas Pandey, Karthik Swaminathan, Naigang Wang, Xavier Intes

    Abstract: Fluorescence lifetime imaging (FLI) is a widely used technique in the biomedical field for measuring the decay times of fluorescent molecules, providing insights into metabolic states, protein interactions, and ligand-receptor bindings. However, its broader application in fast biological processes, such as dynamic activity monitoring, and clinical use, such as in guided surgery, is limited by long… ▽ More

    Submitted 15 November, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: 7 pages, 6 figures

  3. arXiv:2410.00948  [pdf, other

    eess.IV cs.LG q-bio.QM

    Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging

    Authors: Ismail Erbas, Vikas Pandey, Aporva Amarnath, Naigang Wang, Karthik Swaminathan, Stefan T. Radev, Xavier Intes

    Abstract: Fluorescence lifetime imaging (FLI) is an important technique for studying cellular environments and molecular interactions, but its real-time application is limited by slow data acquisition, which requires capturing large time-resolved images and complex post-processing using iterative fitting algorithms. Deep learning (DL) models enable real-time inference, but can be computationally demanding d… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: 8 pages, 2 figures

  4. arXiv:2402.13440  [pdf, other

    cs.AI cs.NE

    A Neuro-Symbolic Approach to Multi-Agent RL for Interpretability and Probabilistic Decision Making

    Authors: Chitra Subramanian, Miao Liu, Naweed Khan, Jonathan Lenchner, Aporva Amarnath, Sarathkrishna Swaminathan, Ryan Riegel, Alexander Gray

    Abstract: Multi-agent reinforcement learning (MARL) is well-suited for runtime decision-making in optimizing the performance of systems where multiple agents coexist and compete for shared resources. However, applying common deep learning-based MARL solutions to real-world problems suffers from issues of interpretability, sample efficiency, partial observability, etc. To address these challenges, we present… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    ACM Class: I.2.6

  5. arXiv:2203.13396  [pdf, other

    cs.AR cs.DC cs.OS

    HetSched: Quality-of-Mission Aware Scheduling for Autonomous Vehicle SoCs

    Authors: Aporva Amarnath, Subhankar Pal, Hiwot Kassa, Augusto Vega, Alper Buyuktosunoglu, Hubertus Franke, John-David Wellman, Ronald Dreslinski, Pradip Bose

    Abstract: Systems-on-Chips (SoCs) that power autonomous vehicles (AVs) must meet stringent performance and safety requirements prior to deployment. With increasing complexity in AV applications, the system needs to meet these real-time demands of multiple safety-critical applications simultaneously. A typical AV-SoC is a heterogeneous multiprocessor consisting of accelerators supported by general-purpose co… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: 14 pages, 11 figures, 4 tables

  6. arXiv:2007.14371  [pdf, other

    cs.AR cs.DC

    STOMP: A Tool for Evaluation of Scheduling Policies in Heterogeneous Multi-Processors

    Authors: Augusto Vega, Aporva Amarnath, John-David Wellman, Hiwot Kassa, Subhankar Pal, Hubertus Franke, Alper Buyuktosunoglu, Ronald Dreslinski, Pradip Bose

    Abstract: The proliferation of heterogeneous chip multiprocessors in recent years has reached unprecedented levels. Traditional homogeneous platforms have shown fundamental limitations when it comes to enabling high-performance yet-ultra-low-power computing, in particular in application domains with real-time execution deadlines or criticality constraints. By combining the right set of general purpose cores… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.