Robust and Adaptive Planning under Model Uncertainty

Sharma, Apoorva; Harrison, James; Tsao, Matthew; Pavone, Marco

Computer Science > Artificial Intelligence

arXiv:1901.02577 (cs)

[Submitted on 9 Jan 2019]

Title:Robust and Adaptive Planning under Model Uncertainty

Authors:Apoorva Sharma, James Harrison, Matthew Tsao, Marco Pavone

View PDF

Abstract:Planning under model uncertainty is a fundamental problem across many applications of decision making and learning. In this paper, we propose the Robust Adaptive Monte Carlo Planning (RAMCP) algorithm, which allows computation of risk-sensitive Bayes-adaptive policies that optimally trade off exploration, exploitation, and robustness. RAMCP formulates the risk-sensitive planning problem as a two-player zero-sum game, in which an adversary perturbs the agent's belief over the models. We introduce two versions of the RAMCP algorithm. The first, RAMCP-F, converges to an optimal risk-sensitive policy without having to rebuild the search tree as the underlying belief over models is perturbed. The second version, RAMCP-I, improves computational efficiency at the cost of losing theoretical guarantees, but is shown to yield empirical results comparable to RAMCP-F. RAMCP is demonstrated on an n-pull multi-armed bandit problem, as well as a patient treatment scenario.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1901.02577 [cs.AI]
	(or arXiv:1901.02577v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1901.02577

Submission history

From: Apoorva Sharma [view email]
[v1] Wed, 9 Jan 2019 01:32:43 UTC (998 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Apoorva Sharma
James Harrison
Matthew Tsao
Marco Pavone

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Robust and Adaptive Planning under Model Uncertainty

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Robust and Adaptive Planning under Model Uncertainty

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators