Low Precision Policy Distillation with Application to Low-Power, Real-time Sensation-Cognition-Action Loop with Neuromorphic Computing
Authors:
Jeffrey L Mckinstry,
Davis R. Barch,
Deepika Bablani,
Michael V. Debole,
Steven K. Esser,
Jeffrey A. Kusnitz,
John V. Arthur,
Dharmendra S. Modha
Abstract:
Low precision networks in the reinforcement learning (RL) setting are relatively unexplored because of the limitations of binary activations for function approximation. Here, in the discrete action ATARI domain, we demonstrate, for the first time, that low precision policy distillation from a high precision network provides a principled, practical way to train an RL agent. As an application, on 10…
▽ More
Low precision networks in the reinforcement learning (RL) setting are relatively unexplored because of the limitations of binary activations for function approximation. Here, in the discrete action ATARI domain, we demonstrate, for the first time, that low precision policy distillation from a high precision network provides a principled, practical way to train an RL agent. As an application, on 10 different ATARI games, we demonstrate real-time end-to-end game playing on low-power neuromorphic hardware by converting a sequence of game frames into discrete actions.
△ Less
Submitted 24 September, 2018;
originally announced September 2018.
Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing
Authors:
Steven K. Esser,
Paul A. Merolla,
John V. Arthur,
Andrew S. Cassidy,
Rathinakumar Appuswamy,
Alexander Andreopoulos,
David J. Berg,
Jeffrey L. McKinstry,
Timothy Melano,
Davis R. Barch,
Carmelo di Nolfo,
Pallab Datta,
Arnon Amir,
Brian Taba,
Myron D. Flickner,
Dharmendra S. Modha
Abstract:
Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on spiking neurons, low precision synapses, and a scalable communication network. Here, we demonstrate that neuromorphic computing, despite its novel architectural pri…
▽ More
Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on spiking neurons, low precision synapses, and a scalable communication network. Here, we demonstrate that neuromorphic computing, despite its novel architectural primitives, can implement deep convolution networks that i) approach state-of-the-art classification accuracy across 8 standard datasets, encompassing vision and speech, ii) perform inference while preserving the hardware's underlying energy-efficiency and high throughput, running on the aforementioned datasets at between 1200 and 2600 frames per second and using between 25 and 275 mW (effectively > 6000 frames / sec / W) and iii) can be specified and trained using backpropagation with the same ease-of-use as contemporary deep learning. For the first time, the algorithmic power of deep learning can be merged with the efficiency of neuromorphic processors, bringing the promise of embedded, intelligent, brain-inspired computing one step closer.
△ Less
Submitted 24 May, 2016; v1 submitted 27 March, 2016;
originally announced March 2016.