Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Samek, Wojciech; Wiegand, Thomas; Müller, Klaus-Robert

Computer Science > Artificial Intelligence

arXiv:1708.08296 (cs)

[Submitted on 28 Aug 2017]

Title:Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Authors:Wojciech Samek, Thomas Wiegand, Klaus-Robert Müller

View PDF

Abstract:With the availability of large databases and recent improvements in deep learning methodology, the performance of AI systems is reaching or even exceeding the human level on an increasing number of complex tasks. Impressive examples of this development can be found in domains such as image classification, sentiment analysis, speech understanding or strategic game playing. However, because of their nested non-linear structure, these highly successful machine learning and artificial intelligence models are usually applied in a black box manner, i.e., no information is provided about what exactly makes them arrive at their predictions. Since this lack of transparency can be a major drawback, e.g., in medical applications, the development of methods for visualizing, explaining and interpreting deep learning models has recently attracted increasing attention. This paper summarizes recent developments in this field and makes a plea for more interpretability in artificial intelligence. Furthermore, it presents two approaches to explaining predictions of deep learning models, one method which computes the sensitivity of the prediction with respect to changes in the input and one approach which meaningfully decomposes the decision in terms of the input variables. These methods are evaluated on three classification tasks.

Comments:	8 pages, 2 figures
Subjects:	Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:1708.08296 [cs.AI]
	(or arXiv:1708.08296v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1708.08296

Submission history

From: Wojciech Samek [view email]
[v1] Mon, 28 Aug 2017 12:53:49 UTC (887 KB)

Computer Science > Artificial Intelligence

Title:Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators