i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Wolfe, Cameron R.; Kyrillidis, Anastasios

Computer Science > Machine Learning

arXiv:2112.04905 (cs)

[Submitted on 7 Dec 2021 (v1), last revised 29 Mar 2022 (this version, v2)]

Title:i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Authors:Cameron R. Wolfe, Anastasios Kyrillidis

View PDF

Abstract:We propose a novel, structured pruning algorithm for neural networks -- the iterative, Sparse Structured Pruning algorithm, dubbed as i-SpaSP. Inspired by ideas from sparse signal recovery, i-SpaSP operates by iteratively identifying a larger set of important parameter groups (e.g., filters or neurons) within a network that contribute most to the residual between pruned and dense network output, then thresholding these groups based on a smaller, pre-defined pruning ratio. For both two-layer and multi-layer network architectures with ReLU activations, we show the error induced by pruning with i-SpaSP decays polynomially, where the degree of this polynomial becomes arbitrarily large based on the sparsity of the dense network's hidden representations. In our experiments, i-SpaSP is evaluated across a variety of datasets (i.e., MNIST, ImageNet, and XNLI) and architectures (i.e., feed forward networks, ResNet34, MobileNetV2, and BERT), where it is shown to discover high-performing sub-networks and improve upon the pruning efficiency of provable baseline methodologies by several orders of magnitude. Put simply, i-SpaSP is easy to implement with automatic differentiation, achieves strong empirical results, comes with theoretical convergence guarantees, and is efficient, thus distinguishing itself as one of the few computationally efficient, practical, and provable pruning algorithms.

Comments:	29 pages, 4 figures, 4th Annual Conference on Learning for Dynamics and Control
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
MSC classes:	68T07
ACM classes:	I.2.6; I.2.10; I.4.0
Cite as:	arXiv:2112.04905 [cs.LG]
	(or arXiv:2112.04905v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2112.04905

Submission history

From: Cameron R. Wolfe [view email]
[v1] Tue, 7 Dec 2021 05:26:45 UTC (718 KB)
[v2] Tue, 29 Mar 2022 20:40:50 UTC (709 KB)

Computer Science > Machine Learning

Title:i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators