Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning

Nanda, Vedant; Dooley, Samuel; Singla, Sahil; Feizi, Soheil; Dickerson, John P.

Computer Science > Machine Learning

arXiv:2006.12621v2 (cs)

[Submitted on 17 Jun 2020 (v1), revised 2 Jul 2020 (this version, v2), latest version 21 Jan 2021 (v4)]

Title:Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning

Authors:Vedant Nanda, Samuel Dooley, Sahil Singla, Soheil Feizi, John P. Dickerson

View PDF

Abstract:Deep neural networks are being increasingly used in real world applications (e.g. surveillance, face recognition). This has resulted in concerns about the fairness of decisions made by these models. Various notions and measures of fairness have been proposed to ensure that a decision-making system does not disproportionately harm (or benefit) particular subgroups of population. In this paper, we argue that traditional notions of fairness that are only based on models' outputs are not sufficient when decision-making systems such as deep networks are vulnerable to adversarial attacks. We argue that in some cases, it may be easier for an attacker to target a particular subgroup, resulting in a form of \textit{robustness bias}. We propose a new notion of \textit{adversarial fairness} that requires all subgroups to be equally robust to adversarial perturbations. We show that state-of-the-art neural networks can exhibit robustness bias on real world datasets such as CIFAR10, CIFAR100, Adience, and UTKFace. We then formulate a measure of our proposed fairness notion and use it as a regularization term to decrease the robustness bias in the traditional empirical risk minimization objective. Through empirical evidence, we show that training with our proposed regularization term can partially mitigate adversarial unfairness while maintaining reasonable classification accuracy.

Comments:	18 pages, 17 figures; Under review
Subjects:	Machine Learning (cs.LG); Computers and Society (cs.CY)
Cite as:	arXiv:2006.12621 [cs.LG]
	(or arXiv:2006.12621v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.12621

Submission history

From: Vedant Nanda [view email]
[v1] Wed, 17 Jun 2020 22:22:24 UTC (6,445 KB)
[v2] Thu, 2 Jul 2020 07:42:59 UTC (2,999 KB)
[v3] Tue, 13 Oct 2020 00:56:26 UTC (7,653 KB)
[v4] Thu, 21 Jan 2021 13:18:04 UTC (4,043 KB)

Computer Science > Machine Learning

Title:Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators