Skip to main content

Showing 1–10 of 10 results for author: Modha, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.13330  [pdf, other

    cs.LG cs.CV

    Efficient and Effective Methods for Mixed Precision Neural Network Quantization for Faster, Energy-efficient Inference

    Authors: Deepika Bablani, Jeffrey L. Mckinstry, Steven K. Esser, Rathinakumar Appuswamy, Dharmendra S. Modha

    Abstract: For efficient neural network inference, it is desirable to achieve state-of-the-art accuracy with the simplest networks requiring the least computation, memory, and power. Quantizing networks to lower precision is a powerful technique for simplifying networks. As each layer of a network may have different sensitivity to quantization, mixed precision quantization methods selectively tune the precis… ▽ More

    Submitted 10 January, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  2. arXiv:1902.08153  [pdf, other

    cs.LG stat.ML

    Learned Step Size Quantization

    Authors: Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha

    Abstract: Deep networks run with low precision operations at inference time offer power and space advantages over high precision alternatives, but need to overcome the challenge of maintaining high accuracy as precision decreases. Here, we present a method for training such networks, Learned Step Size Quantization, that achieves the highest accuracy to date on the ImageNet dataset when using models, from a… ▽ More

    Submitted 6 May, 2020; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: International Conference on Learning Representations (2020)

  3. arXiv:1809.09260  [pdf, other

    cs.LG cs.NE stat.ML

    Low Precision Policy Distillation with Application to Low-Power, Real-time Sensation-Cognition-Action Loop with Neuromorphic Computing

    Authors: Jeffrey L Mckinstry, Davis R. Barch, Deepika Bablani, Michael V. Debole, Steven K. Esser, Jeffrey A. Kusnitz, John V. Arthur, Dharmendra S. Modha

    Abstract: Low precision networks in the reinforcement learning (RL) setting are relatively unexplored because of the limitations of binary activations for function approximation. Here, in the discrete action ATARI domain, we demonstrate, for the first time, that low precision policy distillation from a high precision network provides a principled, practical way to train an RL agent. As an application, on 10… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

  4. arXiv:1809.04191  [pdf, other

    cs.CV

    Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient Embedded Inference

    Authors: Jeffrey L. McKinstry, Steven K. Esser, Rathinakumar Appuswamy, Deepika Bablani, John V. Arthur, Izzet B. Yildiz, Dharmendra S. Modha

    Abstract: To realize the promise of ubiquitous embedded deep network inference, it is essential to seek limits of energy and area efficiency. To this end, low-precision networks offer tremendous promise because both energy and area scale down quadratically with the reduction in precision. Here we demonstrate ResNet-18, -34, -50, -152, Inception-v3, Densenet-161, and VGG-16bn networks on the ImageNet classif… ▽ More

    Submitted 24 February, 2019; v1 submitted 11 September, 2018; originally announced September 2018.

  5. arXiv:1606.02407  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    Structured Convolution Matrices for Energy-efficient Deep learning

    Authors: Rathinakumar Appuswamy, Tapan Nayak, John Arthur, Steven Esser, Paul Merolla, Jeffrey Mckinstry, Timothy Melano, Myron Flickner, Dharmendra Modha

    Abstract: We derive a relationship between network representation in energy-efficient neuromorphic architectures and block Toplitz convolutional matrices. Inspired by this connection, we develop deep convolutional networks using a family of structured convolutional matrices and achieve state-of-the-art trade-off between energy efficiency and classification accuracy for well-known image recognition tasks. We… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

  6. arXiv:1606.01981  [pdf, other

    cs.NE cs.CV cs.LG

    Deep neural networks are robust to weight binarization and other non-linear distortions

    Authors: Paul Merolla, Rathinakumar Appuswamy, John Arthur, Steve K. Esser, Dharmendra Modha

    Abstract: Recent results show that deep neural networks achieve excellent performance even when, during training, weights are quantized and projected to a binary representation. Here, we show that this is just the tip of the iceberg: these same networks, during testing, also exhibit a remarkable robustness to distortions beyond quantization, including additive and multiplicative noise, and a class of non-li… ▽ More

    Submitted 6 June, 2016; originally announced June 2016.

  7. Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing

    Authors: Steven K. Esser, Paul A. Merolla, John V. Arthur, Andrew S. Cassidy, Rathinakumar Appuswamy, Alexander Andreopoulos, David J. Berg, Jeffrey L. McKinstry, Timothy Melano, Davis R. Barch, Carmelo di Nolfo, Pallab Datta, Arnon Amir, Brian Taba, Myron D. Flickner, Dharmendra S. Modha

    Abstract: Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on spiking neurons, low precision synapses, and a scalable communication network. Here, we demonstrate that neuromorphic computing, despite its novel architectural pri… ▽ More

    Submitted 24 May, 2016; v1 submitted 27 March, 2016; originally announced March 2016.

    Comments: 7 pages, 6 figures

    Journal ref: PNAS 113 (2016) 11441-11446

  8. Mapping Generative Models onto a Network of Digital Spiking Neurons

    Authors: Bruno U. Pedroni, Srinjoy Das, John V. Arthur, Paul A. Merolla, Bryan L. Jackson, Dharmendra S. Modha, Kenneth Kreutz-Delgado, Gert Cauwenberghs

    Abstract: Stochastic neural networks such as Restricted Boltzmann Machines (RBMs) have been successfully used in applications ranging from speech recognition to image classification. Inference and learning in these algorithms use a Markov Chain Monte Carlo procedure called Gibbs sampling, where a logistic function forms the kernel of this sampler. On the other side of the spectrum, neuromorphic systems have… ▽ More

    Submitted 9 October, 2015; v1 submitted 24 September, 2015; originally announced September 2015.

    Comments: A similar version of this manuscript has been submitted to IEEE TBioCAS for revision in October 2015

  9. arXiv:1503.07793  [pdf, other

    cs.NE

    Gibbs Sampling with Low-Power Spiking Digital Neurons

    Authors: Srinjoy Das, Bruno Umbria Pedroni, Paul Merolla, John Arthur, Andrew S. Cassidy, Bryan L. Jackson, Dharmendra Modha, Gert Cauwenberghs, Ken Kreutz-Delgado

    Abstract: Restricted Boltzmann Machines and Deep Belief Networks have been successfully used in a wide variety of applications including image classification and speech recognition. Inference and learning in these algorithms uses a Markov Chain Monte Carlo procedure called Gibbs sampling. A sigmoidal function forms the kernel of this sampler which can be realized from the firing statistics of noisy integrat… ▽ More

    Submitted 27 March, 2015; v1 submitted 26 March, 2015; originally announced March 2015.

    Comments: Accepted at ISCAS 2015

  10. arXiv:1210.4700  [pdf, ps, other

    cs.IT

    Optimal Lempel-Ziv based lossy compression for memoryless data: how to make the right mistakes

    Authors: Narayana Santhanam, Dharmendra Modha

    Abstract: Compression refers to encoding data using bits, so that the representation uses as few bits as possible. Compression could be lossless: i.e. encoded data can be recovered exactly from its representation) or lossy where the data is compressed more than the lossless case, but can still be recovered to within prespecified distortion metric. In this paper, we prove the optimality of Codelet Parsing, a… ▽ More

    Submitted 17 October, 2012; v1 submitted 17 October, 2012; originally announced October 2012.

    Comments: This file is not the final version, and will be updated for the next few days. (Edited 10/17)