Skip to main content

Showing 1–1 of 1 results for author: Liu, T W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2010.15703  [pdf, other

    cs.CV stat.ML

    Permute, Quantize, and Fine-tune: Efficient Compression of Neural Networks

    Authors: Julieta Martinez, Jashan Shewakramani, Ting Wei Liu, Ioan Andrei Bârsan, Wenyuan Zeng, Raquel Urtasun

    Abstract: Compressing large neural networks is an important step for their deployment in resource-constrained computational platforms. In this context, vector quantization is an appealing framework that expresses multiple parameters using a single code, and has recently achieved state-of-the-art network compression on a range of core vision and natural language processing tasks. Key to the success of vector… ▽ More

    Submitted 10 April, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: CVPR 21 Oral