Skip to main content

Showing 1–6 of 6 results for author: Athalye, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2103.14749  [pdf, other

    stat.ML cs.AI cs.LG

    Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks

    Authors: Curtis G. Northcutt, Anish Athalye, Jonas Mueller

    Abstract: We identify label errors in the test sets of 10 of the most commonly-used computer vision, natural language, and audio datasets, and subsequently study the potential for these label errors to affect benchmark results. Errors in test sets are numerous and widespread: we estimate an average of at least 3.3% errors across the 10 datasets, where for example label errors comprise at least 6% of the Ima… ▽ More

    Submitted 7 November, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Demo available at https://labelerrors.com/ and source code available at https://github.com/cleanlab/label-errors

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  2. arXiv:1902.06705  [pdf, ps, other

    cs.LG cs.CR stat.ML

    On Evaluating Adversarial Robustness

    Authors: Nicholas Carlini, Anish Athalye, Nicolas Papernot, Wieland Brendel, Jonas Rauber, Dimitris Tsipras, Ian Goodfellow, Aleksander Madry, Alexey Kurakin

    Abstract: Correctly evaluating defenses against adversarial examples has proven to be extremely difficult. Despite the significant amount of recent work attempting to design defenses that withstand adaptive attacks, few have succeeded; most papers that propose defenses are quickly shown to be incorrect. We believe a large contributing factor is the difficulty of performing security evaluations. In this pa… ▽ More

    Submitted 20 February, 2019; v1 submitted 18 February, 2019; originally announced February 2019.

    Comments: Living document; source available at https://github.com/evaluating-adversarial-robustness/adv-eval-paper/

  3. arXiv:1807.10272  [pdf, other

    stat.ML cs.CR cs.CV cs.LG

    Evaluating and Understanding the Robustness of Adversarial Logit Pairing

    Authors: Logan Engstrom, Andrew Ilyas, Anish Athalye

    Abstract: We evaluate the robustness of Adversarial Logit Pairing, a recently proposed defense against adversarial examples. We find that a network trained with Adversarial Logit Pairing achieves 0.6% accuracy in the threat model in which the defense is considered. We provide a brief overview of the defense and the threat models/claims considered, as well as a discussion of the methodology and results of ou… ▽ More

    Submitted 23 November, 2018; v1 submitted 26 July, 2018; originally announced July 2018.

    Comments: NeurIPS SECML 2018. Source code at https://github.com/labsix/adversarial-logit-pairing-analysis

  4. arXiv:1804.08598  [pdf, other

    cs.CV cs.CR stat.ML

    Black-box Adversarial Attacks with Limited Queries and Information

    Authors: Andrew Ilyas, Logan Engstrom, Anish Athalye, Jessy Lin

    Abstract: Current neural network-based classifiers are susceptible to adversarial examples even in the black-box setting, where the attacker only has query access to the model. In practice, the threat model for real-world systems is often more restrictive than the typical black-box model where the adversary can observe the full output of the network on arbitrarily many chosen inputs. We define three realist… ▽ More

    Submitted 11 July, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: ICML 2018. This supercedes the previous paper "Query-efficient Black-box adversarial examples."

  5. arXiv:1804.03286  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    On the Robustness of the CVPR 2018 White-Box Adversarial Example Defenses

    Authors: Anish Athalye, Nicholas Carlini

    Abstract: Neural networks are known to be vulnerable to adversarial examples. In this note, we evaluate the two white-box defenses that appeared at CVPR 2018 and find they are ineffective: when applying existing techniques, we can reduce the accuracy of the defended models to 0%.

    Submitted 10 April, 2018; originally announced April 2018.

  6. arXiv:1712.07113  [pdf, other

    cs.CV cs.LG stat.ML

    Query-Efficient Black-box Adversarial Examples (superceded)

    Authors: Andrew Ilyas, Logan Engstrom, Anish Athalye, Jessy Lin

    Abstract: Note that this paper is superceded by "Black-Box Adversarial Attacks with Limited Queries and Information." Current neural network-based image classifiers are susceptible to adversarial examples, even in the black-box setting, where the attacker is limited to query access without access to gradients. Previous methods --- substitute networks and coordinate-based finite-difference methods --- are… ▽ More

    Submitted 6 April, 2018; v1 submitted 19 December, 2017; originally announced December 2017.

    Comments: Superceded by "Black-Box Adversarial Attacks with Limited Queries and Information."