Approximate Exploration through State Abstraction

Taïga, Adrien Ali; Courville, Aaron; Bellemare, Marc G.

Computer Science > Machine Learning

arXiv:1808.09819v1 (cs)

[Submitted on 29 Aug 2018 (this version), latest version 24 Jan 2019 (v2)]

Title:Approximate Exploration through State Abstraction

Authors:Adrien Ali Taïga, Aaron Courville, Marc G. Bellemare

View PDF

Abstract:Although exploration in reinforcement learning is well understood from a theoretical point of view, provably correct methods remain impractical. In this paper we study the interplay between exploration and approximation, what we call \emph{approximate exploration}. We first provide results when the approximation is explicit, quantifying the performance of an exploration algorithm, MBIE-EB \citep{strehl2008analysis}, when combined with state aggregation. In particular, we show that this allows the agent to trade off between learning speed and quality of the policy learned. We then turn to a successful exploration scheme in practical, pseudo-count based exploration bonuses \citep{bellemare2016unifying}. We show that choosing a density model implicitly defines an abstraction and that the pseudo-count bonus incentivizes the agent to explore using this abstraction. We find, however, that implicit exploration may result in a mismatch between the approximated value function and exploration bonus, leading to either under- or over-exploration.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1808.09819 [cs.LG]
	(or arXiv:1808.09819v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1808.09819

Submission history

From: Adrien Ali Taiga [view email]
[v1] Wed, 29 Aug 2018 13:41:33 UTC (505 KB)
[v2] Thu, 24 Jan 2019 17:18:53 UTC (1,635 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-08

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Adrien Ali Taïga
Aaron C. Courville
Marc G. Bellemare

export BibTeX citation

Computer Science > Machine Learning

Title:Approximate Exploration through State Abstraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Approximate Exploration through State Abstraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators