Skip to main content

Showing 1–1 of 1 results for author: Polino, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1802.05668  [pdf, other

    cs.NE cs.LG

    Model compression via distillation and quantization

    Authors: Antonio Polino, Razvan Pascanu, Dan Alistarh

    Abstract: Deep neural networks (DNNs) continue to make significant advances, solving tasks from image classification to translation or reinforcement learning. One aspect of the field receiving considerable attention is efficiently executing deep models in resource-constrained environments, such as mobile or embedded devices. This paper focuses on this problem, and proposes two new compression methods, which… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Comments: 21 pages, published as a conference paper at ICLR2018