Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks

Deng, Zhiwei; Russakovsky, Olga

Computer Science > Machine Learning

arXiv:2206.02916 (cs)

[Submitted on 6 Jun 2022 (v1), last revised 19 Nov 2022 (this version, v2)]

Title:Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks

Authors:Zhiwei Deng, Olga Russakovsky

View PDF

Abstract:We propose an algorithm that compresses the critical information of a large dataset into compact addressable memories. These memories can then be recalled to quickly re-train a neural network and recover the performance (instead of storing and re-training on the full original dataset). Building upon the dataset distillation framework, we make a key observation that a shared common representation allows for more efficient and effective distillation. Concretely, we learn a set of bases (aka ``memories'') which are shared between classes and combined through learned flexible addressing functions to generate a diverse set of training examples. This leads to several benefits: 1) the size of compressed data does not necessarily grow linearly with the number of classes; 2) an overall higher compression rate with more effective distillation is achieved; and 3) more generalized queries are allowed beyond recalling the original classes. We demonstrate state-of-the-art results on the dataset distillation task across six benchmarks, including up to 16.5% and 9.7% in retained accuracy improvement when distilling CIFAR10 and CIFAR100 respectively. We then leverage our framework to perform continual learning, achieving state-of-the-art results on four benchmarks, with 23.2% accuracy improvement on MANY. The code is released on our project webpage this https URL.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.02916 [cs.LG]
	(or arXiv:2206.02916v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.02916

Submission history

From: Zhiwei Deng [view email]
[v1] Mon, 6 Jun 2022 21:32:26 UTC (3,440 KB)
[v2] Sat, 19 Nov 2022 03:48:09 UTC (12,984 KB)

Computer Science > Machine Learning

Title:Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators