Denoised Smoothing: A Provable Defense for Pretrained Classifiers

Salman, Hadi; Sun, Mingjie; Yang, Greg; Kapoor, Ashish; Kolter, J. Zico

Computer Science > Machine Learning

arXiv:2003.01908 (cs)

[Submitted on 4 Mar 2020 (v1), last revised 21 Sep 2020 (this version, v2)]

Title:Denoised Smoothing: A Provable Defense for Pretrained Classifiers

Authors:Hadi Salman, Mingjie Sun, Greg Yang, Ashish Kapoor, J. Zico Kolter

View PDF

Abstract:We present a method for provably defending any pretrained image classifier against $\ell_p$ adversarial attacks. This method, for instance, allows public vision API providers and users to seamlessly convert pretrained non-robust classification services into provably robust ones. By prepending a custom-trained denoiser to any off-the-shelf image classifier and using randomized smoothing, we effectively create a new classifier that is guaranteed to be $\ell_p$-robust to adversarial examples, without modifying the pretrained classifier. Our approach applies to both the white-box and the black-box settings of the pretrained classifier. We refer to this defense as denoised smoothing, and we demonstrate its effectiveness through extensive experimentation on ImageNet and CIFAR-10. Finally, we use our approach to provably defend the Azure, Google, AWS, and ClarifAI image classification APIs. Our code replicating all the experiments in the paper can be found at: this https URL.

Comments:	10 pages main text; 29 pages total
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2003.01908 [cs.LG]
	(or arXiv:2003.01908v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2003.01908

Submission history

From: Hadi Salman [view email]
[v1] Wed, 4 Mar 2020 06:15:55 UTC (3,352 KB)
[v2] Mon, 21 Sep 2020 02:20:16 UTC (2,441 KB)

Computer Science > Machine Learning

Title:Denoised Smoothing: A Provable Defense for Pretrained Classifiers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Denoised Smoothing: A Provable Defense for Pretrained Classifiers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators