Skip to main content

Showing 1–6 of 6 results for author: Modha, D S

.
  1. arXiv:2301.13330  [pdf, other

    cs.LG cs.CV

    Efficient and Effective Methods for Mixed Precision Neural Network Quantization for Faster, Energy-efficient Inference

    Authors: Deepika Bablani, Jeffrey L. Mckinstry, Steven K. Esser, Rathinakumar Appuswamy, Dharmendra S. Modha

    Abstract: For efficient neural network inference, it is desirable to achieve state-of-the-art accuracy with the simplest networks requiring the least computation, memory, and power. Quantizing networks to lower precision is a powerful technique for simplifying networks. As each layer of a network may have different sensitivity to quantization, mixed precision quantization methods selectively tune the precis… ▽ More

    Submitted 10 January, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  2. arXiv:1902.08153  [pdf, other

    cs.LG stat.ML

    Learned Step Size Quantization

    Authors: Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha

    Abstract: Deep networks run with low precision operations at inference time offer power and space advantages over high precision alternatives, but need to overcome the challenge of maintaining high accuracy as precision decreases. Here, we present a method for training such networks, Learned Step Size Quantization, that achieves the highest accuracy to date on the ImageNet dataset when using models, from a… ▽ More

    Submitted 6 May, 2020; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: International Conference on Learning Representations (2020)

  3. arXiv:1809.09260  [pdf, other

    cs.LG cs.NE stat.ML

    Low Precision Policy Distillation with Application to Low-Power, Real-time Sensation-Cognition-Action Loop with Neuromorphic Computing

    Authors: Jeffrey L Mckinstry, Davis R. Barch, Deepika Bablani, Michael V. Debole, Steven K. Esser, Jeffrey A. Kusnitz, John V. Arthur, Dharmendra S. Modha

    Abstract: Low precision networks in the reinforcement learning (RL) setting are relatively unexplored because of the limitations of binary activations for function approximation. Here, in the discrete action ATARI domain, we demonstrate, for the first time, that low precision policy distillation from a high precision network provides a principled, practical way to train an RL agent. As an application, on 10… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

  4. arXiv:1809.04191  [pdf, other

    cs.CV

    Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient Embedded Inference

    Authors: Jeffrey L. McKinstry, Steven K. Esser, Rathinakumar Appuswamy, Deepika Bablani, John V. Arthur, Izzet B. Yildiz, Dharmendra S. Modha

    Abstract: To realize the promise of ubiquitous embedded deep network inference, it is essential to seek limits of energy and area efficiency. To this end, low-precision networks offer tremendous promise because both energy and area scale down quadratically with the reduction in precision. Here we demonstrate ResNet-18, -34, -50, -152, Inception-v3, Densenet-161, and VGG-16bn networks on the ImageNet classif… ▽ More

    Submitted 24 February, 2019; v1 submitted 11 September, 2018; originally announced September 2018.

  5. Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing

    Authors: Steven K. Esser, Paul A. Merolla, John V. Arthur, Andrew S. Cassidy, Rathinakumar Appuswamy, Alexander Andreopoulos, David J. Berg, Jeffrey L. McKinstry, Timothy Melano, Davis R. Barch, Carmelo di Nolfo, Pallab Datta, Arnon Amir, Brian Taba, Myron D. Flickner, Dharmendra S. Modha

    Abstract: Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on spiking neurons, low precision synapses, and a scalable communication network. Here, we demonstrate that neuromorphic computing, despite its novel architectural pri… ▽ More

    Submitted 24 May, 2016; v1 submitted 27 March, 2016; originally announced March 2016.

    Comments: 7 pages, 6 figures

    Journal ref: PNAS 113 (2016) 11441-11446

  6. Mapping Generative Models onto a Network of Digital Spiking Neurons

    Authors: Bruno U. Pedroni, Srinjoy Das, John V. Arthur, Paul A. Merolla, Bryan L. Jackson, Dharmendra S. Modha, Kenneth Kreutz-Delgado, Gert Cauwenberghs

    Abstract: Stochastic neural networks such as Restricted Boltzmann Machines (RBMs) have been successfully used in applications ranging from speech recognition to image classification. Inference and learning in these algorithms use a Markov Chain Monte Carlo procedure called Gibbs sampling, where a logistic function forms the kernel of this sampler. On the other side of the spectrum, neuromorphic systems have… ▽ More

    Submitted 9 October, 2015; v1 submitted 24 September, 2015; originally announced September 2015.

    Comments: A similar version of this manuscript has been submitted to IEEE TBioCAS for revision in October 2015