CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models

Kuznedelev, Denis; Kurtic, Eldar; Frantar, Elias; Alistarh, Dan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.09223 (cs)

[Submitted on 14 Oct 2022 (v1), last revised 31 May 2023 (this version, v2)]

Title:CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models

Authors:Denis Kuznedelev, Eldar Kurtic, Elias Frantar, Dan Alistarh

View PDF

Abstract:Driven by significant improvements in architectural design and training pipelines, computer vision has recently experienced dramatic progress in terms of accuracy on classic benchmarks such as ImageNet. These highly-accurate models are challenging to deploy, as they appear harder to compress using standard techniques such as pruning. We address this issue by introducing the Correlation Aware Pruner (CAP), a new unstructured pruning framework which significantly pushes the compressibility limits for state-of-the-art architectures. Our method is based on two technical advancements: a new theoretically-justified pruner, which can handle complex weight correlations accurately and efficiently during the pruning process itself, and an efficient finetuning procedure for post-compression recovery. We validate our approach via extensive experiments on several modern vision models such as Vision Transformers (ViT), modern CNNs, and ViT-CNN hybrids, showing for the first time that these can be pruned to high sparsity levels (e.g. $\geq 75$%) with low impact on accuracy ($\leq 1$% relative drop). Our approach is also compatible with structured pruning and quantization, and can lead to practical speedups of 1.5 to 2.4x without accuracy loss. To further showcase CAP's accuracy and scalability, we use it to show for the first time that extremely-accurate large vision models, trained via self-supervised techniques, can also be pruned to moderate sparsities, with negligible accuracy loss.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
MSC classes:	68T07
ACM classes:	I.m
Cite as:	arXiv:2210.09223 [cs.CV]
	(or arXiv:2210.09223v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.09223

Submission history

From: Denis Kuznedelev [view email]
[v1] Fri, 14 Oct 2022 12:19:09 UTC (3,368 KB)
[v2] Wed, 31 May 2023 09:59:46 UTC (5,026 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators