Combating Adversarial Attacks Using Sparse Representations

Gopalakrishnan, Soorya; Marzi, Zhinus; Madhow, Upamanyu; Pedarsani, Ramtin

Statistics > Machine Learning

arXiv:1803.03880 (stat)

[Submitted on 11 Mar 2018 (v1), last revised 13 Jul 2018 (this version, v3)]

Title:Combating Adversarial Attacks Using Sparse Representations

Authors:Soorya Gopalakrishnan, Zhinus Marzi, Upamanyu Madhow, Ramtin Pedarsani

View PDF

Abstract:It is by now well-known that small adversarial perturbations can induce classification errors in deep neural networks (DNNs). In this paper, we make the case that sparse representations of the input data are a crucial tool for combating such attacks. For linear classifiers, we show that a sparsifying front end is provably effective against $\ell_{\infty}$-bounded attacks, reducing output distortion due to the attack by a factor of roughly $K / N$ where $N$ is the data dimension and $K$ is the sparsity level. We then extend this concept to DNNs, showing that a "locally linear" model can be used to develop a theoretical foundation for crafting attacks and defenses. Experimental results for the MNIST dataset show the efficacy of the proposed sparsifying front end.

Comments:	Accepted at ICLR Workshop 2018
Subjects:	Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:1803.03880 [stat.ML]
	(or arXiv:1803.03880v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1803.03880

Submission history

From: Soorya Gopalakrishnan [view email]
[v1] Sun, 11 Mar 2018 02:02:46 UTC (18 KB)
[v2] Sat, 28 Apr 2018 10:36:47 UTC (13 KB)
[v3] Fri, 13 Jul 2018 17:16:53 UTC (18 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-03

Change to browse by:

cs
cs.IT
math
math.IT
stat
stat.ML

References & Citations

export BibTeX citation

Statistics > Machine Learning

Title:Combating Adversarial Attacks Using Sparse Representations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Combating Adversarial Attacks Using Sparse Representations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators