TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

Bewley, Tom; Lawry, Jonathan

Computer Science > Artificial Intelligence

arXiv:2009.04743 (cs)

[Submitted on 10 Sep 2020 (v1), last revised 21 Sep 2020 (this version, v2)]

Title:TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

Authors:Tom Bewley, Jonathan Lawry

View PDF

Abstract:In explainable artificial intelligence, there is increasing interest in understanding the behaviour of autonomous agents to build trust and validate performance. Modern agent architectures, such as those trained by deep reinforcement learning, are currently so lacking in interpretable structure as to effectively be black boxes, but insights may still be gained from an external, behaviourist perspective. Inspired by conceptual spaces theory, we suggest that a versatile first step towards general understanding is to discretise the state space into convex regions, jointly capturing similarities over the agent's action, value function and temporal dynamics within a dataset of observations. We create such a representation using a novel variant of the CART decision tree algorithm, and demonstrate how it facilitates practical understanding of black box agents through prediction, visualisation and rule-based explanation.

Comments:	12 pages (incl. references and appendices), 15 figures. Pre-print, under review
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2009.04743 [cs.AI]
	(or arXiv:2009.04743v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2009.04743

Submission history

From: Tom Bewley [view email]
[v1] Thu, 10 Sep 2020 09:22:27 UTC (9,073 KB)
[v2] Mon, 21 Sep 2020 16:06:19 UTC (11,274 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jonathan Lawry

export BibTeX citation

Computer Science > Artificial Intelligence

Title:TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators