Diminishing the Effect of Adversarial Perturbations via Refining Feature Representation

Asadi, Nader; Sarfi, AmirMohammad; Tahsini, Sahba; Eftekhari, Mahdi

Computer Science > Computer Vision and Pattern Recognition

arXiv:1907.01023v1 (cs)

[Submitted on 1 Jul 2019 (this version), latest version 1 Oct 2019 (v2)]

Title:Diminishing the Effect of Adversarial Perturbations via Refining Feature Representation

Authors:Nader Asadi, AmirMohammad Sarfi, Sahba Tahsini, Mahdi Eftekhari

View PDF

Abstract:Deep neural networks are highly vulnerable to adversarial examples, which imposes severe security issues for these state-of-the-art models. Many defense methods have been proposed to mitigate this problem. However, a lot of them depend on modification or additional training of the target model. In this work, we analytically investigate each layer representation of non-perturbed and perturbed images and show the effect of perturbations on each of these representations. Accordingly, a method based on whitening coloring transform is proposed in order to diminish the misrepresentation of any desirable layer caused by adversaries. Our method can be applied to any layer of any arbitrary model without the need of any modification or additional training. Due to the fact that full whitening of the layer representation is not easily differentiable, our proposed method is superbly robust against white-box attacks. Furthermore, we demonstrate the strength of our method against some state-of-the-art black-box attacks such as Carlini-Wagner L2 attack and we show that our method is able to defend against some non-constrained attacks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1907.01023 [cs.CV]
	(or arXiv:1907.01023v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1907.01023

Submission history

From: Nader Asadi [view email]
[v1] Mon, 1 Jul 2019 19:21:22 UTC (647 KB)
[v2] Tue, 1 Oct 2019 17:43:49 UTC (879 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-07

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Nader Asadi
AmirMohammad Sarfi
Sahba Tahsini
Mahdi Eftekhari

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Diminishing the Effect of Adversarial Perturbations via Refining Feature Representation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Diminishing the Effect of Adversarial Perturbations via Refining Feature Representation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators