Visual Analytics of Neuron Vulnerability to Adversarial Attacks on Convolutional Neural Networks

Li, Yiran; Wang, Junpeng; Fujiwara, Takanori; Ma, Kwan-Liu

doi:10.1145/3587470

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.02814 (cs)

[Submitted on 6 Mar 2023]

Title:Visual Analytics of Neuron Vulnerability to Adversarial Attacks on Convolutional Neural Networks

Authors:Yiran Li, Junpeng Wang, Takanori Fujiwara, Kwan-Liu Ma

View PDF

Abstract:Adversarial attacks on a convolutional neural network (CNN) -- injecting human-imperceptible perturbations into an input image -- could fool a high-performance CNN into making incorrect predictions. The success of adversarial attacks raises serious concerns about the robustness of CNNs, and prevents them from being used in safety-critical applications, such as medical diagnosis and autonomous driving. Our work introduces a visual analytics approach to understanding adversarial attacks by answering two questions: (1) which neurons are more vulnerable to attacks and (2) which image features do these vulnerable neurons capture during the prediction? For the first question, we introduce multiple perturbation-based measures to break down the attacking magnitude into individual CNN neurons and rank the neurons by their vulnerability levels. For the second, we identify image features (e.g., cat ears) that highly stimulate a user-selected neuron to augment and validate the neuron's responsibility. Furthermore, we support an interactive exploration of a large number of neurons by aiding with hierarchical clustering based on the neurons' roles in the prediction. To this end, a visual analytics system is designed to incorporate visual reasoning for interpreting adversarial attacks. We validate the effectiveness of our system through multiple case studies as well as feedback from domain experts.

Comments:	Accepted by the Special Issue on Human-Centered Explainable AI, ACM Transactions on Interactive Intelligent Systems
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Cite as:	arXiv:2303.02814 [cs.CV]
	(or arXiv:2303.02814v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.02814
Related DOI:	https://doi.org/10.1145/3587470

Submission history

From: Yiran Li [view email]
[v1] Mon, 6 Mar 2023 01:01:56 UTC (6,757 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Visual Analytics of Neuron Vulnerability to Adversarial Attacks on Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Visual Analytics of Neuron Vulnerability to Adversarial Attacks on Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators