Rademacher Complexity for Adversarially Robust Generalization

Yin, Dong; Ramchandran, Kannan; Bartlett, Peter

Computer Science > Machine Learning

arXiv:1810.11914v2 (cs)

[Submitted on 29 Oct 2018 (v1), revised 7 Nov 2018 (this version, v2), latest version 29 Jul 2020 (v4)]

Title:Rademacher Complexity for Adversarially Robust Generalization

Authors:Dong Yin, Kannan Ramchandran, Peter Bartlett

View PDF

Abstract:Many machine learning models are vulnerable to adversarial attacks. It has been observed that adding adversarial perturbations that are imperceptible to humans can make machine learning models produce wrong predictions with high confidence. Although there has been a lot of recent effort dedicated to learning models that are adversarially robust, this remains an open problem. In particular, it has been empirically observed that although using adversarial training can effectively reduce the adversarial classification error on the training dataset, the learned model cannot generalize well to the test data. Moreover, we lack a theoretical understanding of the generalization property of machine learning models in the adversarial setting.
In this paper, we study the adversarially robust generalization problem through the lens of Rademacher complexity. We focus on $\ell_\infty$ adversarial attacks and study both linear classifiers and feedforward neural networks. For binary linear classifiers, we prove tight bounds for the adversarial Rademacher complexity, and show that in the adversarial setting, the Rademacher complexity is never smaller than that in the natural setting, and it has an unavoidable dimension dependence, unless the weight vector has bounded $\ell_1$ norm. The results also extend to multi-class linear classifiers. For (nonlinear) neural networks, we show that the dimension dependence also exists in the Rademacher complexity of the $\ell_\infty$ adversarial loss function class. We further consider a surrogate adversarial loss and prove margin bounds for this setting. Our results indicate that having $\ell_1$ norm constraints on the weight matrices might be a potential way to improve generalization in the adversarial setting.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:1810.11914 [cs.LG]
	(or arXiv:1810.11914v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.11914

Submission history

From: Dong Yin [view email]
[v1] Mon, 29 Oct 2018 00:51:08 UTC (21 KB)
[v2] Wed, 7 Nov 2018 06:40:59 UTC (21 KB)
[v3] Fri, 25 Jan 2019 07:03:12 UTC (272 KB)
[v4] Wed, 29 Jul 2020 04:23:34 UTC (272 KB)

Computer Science > Machine Learning

Title:Rademacher Complexity for Adversarially Robust Generalization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Rademacher Complexity for Adversarially Robust Generalization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators