STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Springer, Jacob M.; Reinstadler, Bryn Marie; O'Reilly, Una-May

Computer Science > Machine Learning

arXiv:2009.13562v1 (cs)

[Submitted on 28 Sep 2020 (this version), latest version 19 Aug 2021 (v2)]

Title:STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Authors:Jacob M. Springer, Bryn Marie Reinstadler, Una-May O'Reilly

View PDF

Abstract:Adversarial examples are imperceptible perturbations in the input to a neural model that result in misclassification. Generating adversarial examples for source code poses an additional challenge compared to the domains of images and natural language, because source code perturbations must adhere to strict semantic guidelines so the resulting programs retain the functional meaning of the code. We propose a simple and efficient black-box method for generating state-of-the-art adversarial examples on models of code. Our method generates untargeted and targeted attacks, and empirically outperforms competing gradient-based methods with less information and less computational effort. We also use adversarial training to construct a model robust to these attacks; our attack reduces the F1 score of code2seq by 42%. Adversarial training brings the F1 score on adversarial examples up to 99% of baseline.

Comments:	13 pages, 3 figures, 10 tables
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2009.13562 [cs.LG]
	(or arXiv:2009.13562v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2009.13562

Submission history

From: Jacob Springer [view email]
[v1] Mon, 28 Sep 2020 18:21:19 UTC (1,120 KB)
[v2] Thu, 19 Aug 2021 20:20:34 UTC (1,976 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-09

Change to browse by:

cs
cs.CR
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jacob M. Springer
Una-May O'Reilly

export BibTeX citation

Computer Science > Machine Learning

Title:STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators