Approximate Exploration through State Abstraction

Taïga, Adrien Ali; Courville, Aaron; Bellemare, Marc G.

Computer Science > Machine Learning

arXiv:1808.09819 (cs)

[Submitted on 29 Aug 2018 (v1), last revised 24 Jan 2019 (this version, v2)]

Title:Approximate Exploration through State Abstraction

Authors:Adrien Ali Taïga, Aaron Courville, Marc G. Bellemare

View PDF

Abstract:Although exploration in reinforcement learning is well understood from a theoretical point of view, provably correct methods remain impractical. In this paper we study the interplay between exploration and approximation, what we call approximate exploration. Our main goal is to further our theoretical understanding of pseudo-count based exploration bonuses (Bellemare et al., 2016), a practical exploration scheme based on density modelling. As a warm-up, we quantify the performance of an exploration algorithm, MBIE-EB (Strehl and Littman, 2008), when explicitly combined with state aggregation. This allows us to confirm that, as might be expected, approximation allows the agent to trade off between learning speed and quality of the learned policy. Next, we show how a given density model can be related to an abstraction and that the corresponding pseudo-count bonus can act as a substitute in MBIE-EB combined with this abstraction, but may lead to either under- or over-exploration. Then, we show that a given density model also defines an implicit abstraction, and find a surprising mismatch between pseudo-counts derived either implicitly or explicitly. Finally we derive a new pseudo-count bonus alleviating this issue.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1808.09819 [cs.LG]
	(or arXiv:1808.09819v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1808.09819

Submission history

From: Adrien Ali Taiga [view email]
[v1] Wed, 29 Aug 2018 13:41:33 UTC (505 KB)
[v2] Thu, 24 Jan 2019 17:18:53 UTC (1,635 KB)

Computer Science > Machine Learning

Title:Approximate Exploration through State Abstraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Approximate Exploration through State Abstraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators