Beyond the One Step Greedy Approach in Reinforcement Learning

Efroni, Yonathan; Dalal, Gal; Scherrer, Bruno; Mannor, Shie

Computer Science > Artificial Intelligence

arXiv:1802.03654 (cs)

[Submitted on 10 Feb 2018 (v1), last revised 30 Jul 2018 (this version, v3)]

Title:Beyond the One Step Greedy Approach in Reinforcement Learning

Authors:Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor

View PDF

Abstract:The famous Policy Iteration algorithm alternates between policy improvement and policy evaluation. Implementations of this algorithm with several variants of the latter evaluation stage, e.g, $n$-step and trace-based returns, have been analyzed in previous works. However, the case of multiple-step lookahead policy improvement, despite the recent increase in empirical evidence of its strength, has to our knowledge not been carefully analyzed yet. In this work, we introduce the first such analysis. Namely, we formulate variants of multiple-step policy improvement, derive new algorithms using these definitions and prove their convergence. Moreover, we show that recent prominent Reinforcement Learning algorithms are, in fact, instances of our framework. We thus shed light on their empirical success and give a recipe for deriving new algorithms for future study.

Comments:	ICML 2018
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1802.03654 [cs.AI]
	(or arXiv:1802.03654v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1802.03654

Submission history

From: Jonathan Efroni [view email]
[v1] Sat, 10 Feb 2018 22:22:03 UTC (68 KB)
[v2] Sat, 2 Jun 2018 17:19:20 UTC (72 KB)
[v3] Mon, 30 Jul 2018 18:02:11 UTC (2,085 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2018-02

Change to browse by:

cs
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yonathan Efroni
Gal Dalal
Bruno Scherrer
Shie Mannor

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Beyond the One Step Greedy Approach in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Beyond the One Step Greedy Approach in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators