On Certifying Robustness against Backdoor Attacks via Randomized Smoothing

Wang, Binghui; Cao, Xiaoyu; jia, Jinyuan; Gong, Neil Zhenqiang

Computer Science > Cryptography and Security

arXiv:2002.11750 (cs)

[Submitted on 26 Feb 2020 (v1), last revised 20 Jul 2020 (this version, v4)]

Title:On Certifying Robustness against Backdoor Attacks via Randomized Smoothing

Authors:Binghui Wang, Xiaoyu Cao, Jinyuan jia, Neil Zhenqiang Gong

View PDF

Abstract:Backdoor attack is a severe security threat to deep neural networks (DNNs). We envision that, like adversarial examples, there will be a cat-and-mouse game for backdoor attacks, i.e., new empirical defenses are developed to defend against backdoor attacks but they are soon broken by strong adaptive backdoor attacks. To prevent such cat-and-mouse game, we take the first step towards certified defenses against backdoor attacks. Specifically, in this work, we study the feasibility and effectiveness of certifying robustness against backdoor attacks using a recent technique called randomized smoothing. Randomized smoothing was originally developed to certify robustness against adversarial examples. We generalize randomized smoothing to defend against backdoor attacks. Our results show the theoretical feasibility of using randomized smoothing to certify robustness against backdoor attacks. However, we also find that existing randomized smoothing methods have limited effectiveness at defending against backdoor attacks, which highlight the needs of new theory and methods to certify robustness against backdoor attacks.

Comments:	CVPR 2020 Workshop on Adversarial Machine Learning in Computer Vision, 2020. DeepMind Best Extended Abstract
Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2002.11750 [cs.CR]
	(or arXiv:2002.11750v4 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2002.11750

Submission history

From: Binghui Wang [view email]
[v1] Wed, 26 Feb 2020 19:15:46 UTC (133 KB)
[v2] Sat, 11 Apr 2020 18:53:12 UTC (153 KB)
[v3] Tue, 14 Apr 2020 13:02:37 UTC (153 KB)
[v4] Mon, 20 Jul 2020 16:15:42 UTC (154 KB)

Computer Science > Cryptography and Security

Title:On Certifying Robustness against Backdoor Attacks via Randomized Smoothing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:On Certifying Robustness against Backdoor Attacks via Randomized Smoothing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators