Calibrated Model-Based Deep Reinforcement Learning

Malik, Ali; Kuleshov, Volodymyr; Song, Jiaming; Nemer, Danny; Seymour, Harlan; Ermon, Stefano

Computer Science > Machine Learning

arXiv:1906.08312 (cs)

[Submitted on 19 Jun 2019]

Title:Calibrated Model-Based Deep Reinforcement Learning

Authors:Ali Malik, Volodymyr Kuleshov, Jiaming Song, Danny Nemer, Harlan Seymour, Stefano Ermon

View PDF

Abstract:Estimates of predictive uncertainty are important for accurate model-based planning and reinforcement learning. However, predictive uncertainties---especially ones derived from modern deep learning systems---can be inaccurate and impose a bottleneck on performance. This paper explores which uncertainties are needed for model-based reinforcement learning and argues that good uncertainties must be calibrated, i.e. their probabilities should match empirical frequencies of predicted events. We describe a simple way to augment any model-based reinforcement learning agent with a calibrated model and show that doing so consistently improves planning, sample complexity, and exploration. On the \textsc{HalfCheetah} MuJoCo task, our system achieves state-of-the-art performance using 50\% fewer samples than the current leading approach. Our findings suggest that calibration can improve the performance of model-based reinforcement learning with minimal computational and implementation overhead.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1906.08312 [cs.LG]
	(or arXiv:1906.08312v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.08312
Journal reference:	Proceedings of the 36th International Conference on Machine Learning, PMLR 97:4314-4323, 2019

Submission history

From: Ali Malik [view email]
[v1] Wed, 19 Jun 2019 19:10:26 UTC (1,066 KB)

Computer Science > Machine Learning

Title:Calibrated Model-Based Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Calibrated Model-Based Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators