Directed Exploration in PAC Model-Free Reinforcement Learning

Oh, Min-hwan; Iyengar, Garud

Computer Science > Machine Learning

arXiv:1808.10552 (cs)

[Submitted on 31 Aug 2018]

Title:Directed Exploration in PAC Model-Free Reinforcement Learning

Authors:Min-hwan Oh, Garud Iyengar

View PDF

Abstract:We study an exploration method for model-free RL that generalizes the counter-based exploration bonus methods and takes into account long term exploratory value of actions rather than a single step look-ahead. We propose a model-free RL method that modifies Delayed Q-learning and utilizes the long-term exploration bonus with provable efficiency. We show that our proposed method finds a near-optimal policy in polynomial time (PAC-MDP), and also provide experimental evidence that our proposed algorithm is an efficient exploration method.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1808.10552 [cs.LG]
	(or arXiv:1808.10552v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1808.10552

Submission history

From: Min-Hwan Oh [view email]
[v1] Fri, 31 Aug 2018 00:00:22 UTC (494 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-08

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Min-hwan Oh
Garud Iyengar

export BibTeX citation

Computer Science > Machine Learning

Title:Directed Exploration in PAC Model-Free Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Directed Exploration in PAC Model-Free Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators