Skip to main content

Showing 1–1 of 1 results for author: Fernandes, S L

Searching in archive cs. Search in all archives.
.
  1. arXiv:1903.05821  [pdf, other

    cs.LG stat.ML

    Attribution-driven Causal Analysis for Detection of Adversarial Examples

    Authors: Susmit Jha, Sunny Raj, Steven Lawrence Fernandes, Sumit Kumar Jha, Somesh Jha, Gunjan Verma, Brian Jalaian, Ananthram Swami

    Abstract: Attribution methods have been developed to explain the decision of a machine learning model on a given input. We use the Integrated Gradient method for finding attributions to define the causal neighborhood of an input by incrementally masking high attribution features. We study the robustness of machine learning models on benign and adversarial inputs in this neighborhood. Our study indicates tha… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

    Comments: 11 pages, 6 figures