Reliability Testing for Natural Language Processing Systems

Tan, Samson; Joty, Shafiq; Baxter, Kathy; Taeihagh, Araz; Bennett, Gregory A.; Kan, Min-Yen

Computer Science > Machine Learning

arXiv:2105.02590 (cs)

[Submitted on 6 May 2021 (v1), last revised 1 Jun 2021 (this version, v3)]

Title:Reliability Testing for Natural Language Processing Systems

Authors:Samson Tan, Shafiq Joty, Kathy Baxter, Araz Taeihagh, Gregory A. Bennett, Min-Yen Kan

View PDF

Abstract:Questions of fairness, robustness, and transparency are paramount to address before deploying NLP systems. Central to these concerns is the question of reliability: Can NLP systems reliably treat different demographics fairly and function correctly in diverse and noisy environments? To address this, we argue for the need for reliability testing and contextualize it among existing work on improving accountability. We show how adversarial attacks can be reframed for this goal, via a framework for developing reliability tests. We argue that reliability testing -- with an emphasis on interdisciplinary collaboration -- will enable rigorous and targeted testing, and aid in the enactment and enforcement of industry standards.

Comments:	Accepted to ACL-IJCNLP 2021 (main conference). Camera-ready version
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2105.02590 [cs.LG]
	(or arXiv:2105.02590v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2105.02590

Submission history

From: Samson Tan [view email]
[v1] Thu, 6 May 2021 11:24:58 UTC (331 KB)
[v2] Thu, 13 May 2021 04:17:44 UTC (330 KB)
[v3] Tue, 1 Jun 2021 03:55:40 UTC (304 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-05

Change to browse by:

cs
cs.AI
cs.CL
cs.CY
cs.NE

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shafiq R. Joty
Araz Taeihagh
Min-Yen Kan

export BibTeX citation

Computer Science > Machine Learning

Title:Reliability Testing for Natural Language Processing Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reliability Testing for Natural Language Processing Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators