New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound

Gupta, Arushi; Saunshi, Nikunj; Yu, Dingli; Lyu, Kaifeng; Arora, Sanjeev

Statistics > Machine Learning

arXiv:2211.02912 (stat)

[Submitted on 5 Nov 2022]

Title:New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound

Authors:Arushi Gupta, Nikunj Saunshi, Dingli Yu, Kaifeng Lyu, Sanjeev Arora

View PDF

Abstract:Saliency methods compute heat maps that highlight portions of an input that were most {\em important} for the label assigned to it by a deep net. Evaluations of saliency methods convert this heat map into a new {\em masked input} by retaining the $k$ highest-ranked pixels of the original input and replacing the rest with \textquotedblleft uninformative\textquotedblright\ pixels, and checking if the net's output is mostly unchanged. This is usually seen as an {\em explanation} of the output, but the current paper highlights reasons why this inference of causality may be suspect. Inspired by logic concepts of {\em completeness \& soundness}, it observes that the above type of evaluation focuses on completeness of the explanation, but ignores soundness. New evaluation metrics are introduced to capture both notions, while staying in an {\em intrinsic} framework -- i.e., using the dataset and the net, but no separately trained nets, human evaluations, etc. A simple saliency method is described that matches or outperforms prior methods in the evaluations. Experiments also suggest new intrinsic justifications, based on soundness, for popular heuristic tricks such as TV regularization and upsampling.

Comments:	NeurIPS 2022 (Oral)
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2211.02912 [stat.ML]
	(or arXiv:2211.02912v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2211.02912

Submission history

From: Arushi Gupta [view email]
[v1] Sat, 5 Nov 2022 14:04:59 UTC (14,892 KB)

Statistics > Machine Learning

Title:New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators