Skip to main content

Showing 1–5 of 5 results for author: Manasi, S D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.12120  [pdf, other

    cs.LG cs.AR

    An Open-Source ML-Based Full-Stack Optimization Framework for Machine Learning Accelerators

    Authors: Hadi Esmaeilzadeh, Soroush Ghodrati, Andrew B. Kahng, Joon Kyung Kim, Sean Kinzer, Sayak Kundu, Rohan Mahapatra, Susmita Dey Manasi, Sachin Sapatnekar, Zhiang Wang, Ziqing Zeng

    Abstract: Parameterizable machine learning (ML) accelerators are the product of recent breakthroughs in ML. To fully enable their design space exploration (DSE), we propose a physical-design-driven, learning-based prediction framework for hardware-accelerated deep neural network (DNN) and non-DNN ML algorithms. It adopts a unified approach that combines backend power, performance, and area (PPA) analysis wi… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: This is an extended version of our work titled "Physically Accurate Learning-based Performance Prediction of Hardware-accelerated ML Algorithms" published in MLCAD 2022

  2. arXiv:2306.16767  [pdf, other

    cs.AR cs.LG

    Performance Analysis of DNN Inference/Training with Convolution and non-Convolution Operations

    Authors: Hadi Esmaeilzadeh, Soroush Ghodrati, Andrew B. Kahng, Sean Kinzer, Susmita Dey Manasi, Sachin S. Sapatnekar, Zhiang Wang

    Abstract: Today's performance analysis frameworks for deep learning accelerators suffer from two significant limitations. First, although modern convolutional neural network (CNNs) consist of many types of layers other than convolution, especially during training, these frameworks largely focus on convolution layers only. Second, these frameworks are generally targeted towards inference, and lack support fo… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Journal ref: ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 30, Issue 1, Article No.: 3, Pages 1 - 34, Oct. 2024

  3. arXiv:2105.10554  [pdf, other

    cs.AR cs.LG

    GNNIE: GNN Inference Engine with Load-balancing and Graph-Specific Caching

    Authors: Sudipta Mondal, Susmita Dey Manasi, Kishor Kunal, S. Ramprasath, Sachin S. Sapatnekar

    Abstract: Graph neural networks (GNN) analysis engines are vital for real-world problems that use large graph models. Challenges for a GNN hardware platform include the ability to (a) host a variety of GNNs, (b) handle high sparsity in input vertex feature vectors and the graph adjacency matrix and the accompanying random memory access patterns, and (c) maintain load-balanced computation in the face of unev… ▽ More

    Submitted 7 August, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

  4. NeuPart: Using Analytical Models to Drive Energy-Efficient Partitioning of CNN Computations on Cloud-Connected Mobile Clients

    Authors: Susmita Dey Manasi, Farhana Sharmin Snigdha, Sachin S. Sapatnekar

    Abstract: Data processing on convolutional neural networks (CNNs) places a heavy burden on energy-constrained mobile platforms. This work optimizes energy on a mobile client by partitioning CNN computations between in situ processing on the client and offloaded computations in the cloud. A new analytical CNN energy model is formulated, capturing all major components of the in situ computation, for ASIC-base… ▽ More

    Submitted 25 June, 2020; v1 submitted 9 May, 2019; originally announced May 2019.

    Comments: Published in IEEE Transactions on Very Large Scale Integration (VLSI) Systems, April 2020

    Journal ref: IEEE Transactions on Very Large Scale Integration Systems (TVLSI), vol. 28, no. 8, pp. 1844-1857, Aug. 2020

  5. arXiv:1610.03902  [pdf, other

    cs.ET

    Straintronic magneto-tunneling-junction based ternary content addressable memory

    Authors: S. Dey Manasi, M. M. Al Rashid, J. Atulasimha, S. Bandyopadhyay, A. R. Trivedi

    Abstract: Straintronic magneto-tunneling junction (s-MTJ) switches, whose resistances are controlled with voltage-generated strain in the magnetostrictive free layer of the MTJ, are extremely energy-efficient switches that would dissipate a few aJ of energy during switching. Unfortunately, they are also relatively error-prone and have low resistance on/off ratio. This suggests that as computing elements, th… ▽ More

    Submitted 21 October, 2016; v1 submitted 12 October, 2016; originally announced October 2016.

    Comments: 8 pages, 11 figures

    Journal ref: Part I: IEEE Transactions on Electron Devices (Volume: 64, Issue: 7, Page(s): 2835-2841, July 2017), Part II: IEEE Transactions on Electron Devices (Volume: 64, Issue: 7, Page(s): 2842-2848, July 2017)