Scaling Up Robust MDPs by Reinforcement Learning

Tamar, Aviv; Xu, Huan; Mannor, Shie

Computer Science > Machine Learning

arXiv:1306.6189v1 (cs)

[Submitted on 26 Jun 2013]

Title:Scaling Up Robust MDPs by Reinforcement Learning

Authors:Aviv Tamar, Huan Xu, Shie Mannor

View PDF

Abstract:We consider large-scale Markov decision processes (MDPs) with parameter uncertainty, under the robust MDP paradigm. Previous studies showed that robust MDPs, based on a minimax approach to handle uncertainty, can be solved using dynamic programming for small to medium sized problems. However, due to the "curse of dimensionality", MDPs that model real-life problems are typically prohibitively large for such approaches. In this work we employ a reinforcement learning approach to tackle this planning problem: we develop a robust approximate dynamic programming method based on a projected fixed point equation to approximately solve large scale robust MDPs. We show that the proposed method provably succeeds under certain technical conditions, and demonstrate its effectiveness through simulation of an option pricing problem. To the best of our knowledge, this is the first attempt to scale up the robust MDPs paradigm.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1306.6189 [cs.LG]
	(or arXiv:1306.6189v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1306.6189

Submission history

From: Aviv Tamar [view email]
[v1] Wed, 26 Jun 2013 09:52:51 UTC (37 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2013-06

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Aviv Tamar
Huan Xu
Shie Mannor

export BibTeX citation

Computer Science > Machine Learning

Title:Scaling Up Robust MDPs by Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Scaling Up Robust MDPs by Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators