Variational Inference MPC for Bayesian Model-based Reinforcement Learning

Okada, Masashi; Taniguchi, Tadahiro

Computer Science > Machine Learning

arXiv:1907.04202 (cs)

[Submitted on 8 Jul 2019 (v1), last revised 7 Oct 2019 (this version, v2)]

Title:Variational Inference MPC for Bayesian Model-based Reinforcement Learning

Authors:Masashi Okada, Tadahiro Taniguchi

View PDF

Abstract:In recent studies on model-based reinforcement learning (MBRL), incorporating uncertainty in forward dynamics is a state-of-the-art strategy to enhance learning performance, making MBRLs competitive to cutting-edge model free methods, especially in simulated robotics tasks. Probabilistic ensembles with trajectory sampling (PETS) is a leading type of MBRL, which employs Bayesian inference to dynamics modeling and model predictive control (MPC) with stochastic optimization via the cross entropy method (CEM). In this paper, we propose a novel extension to the uncertainty-aware MBRL. Our main contributions are twofold: Firstly, we introduce a variational inference MPC, which reformulates various stochastic methods, including CEM, in a Bayesian fashion. Secondly, we propose a novel instance of the framework, called probabilistic action ensembles with trajectory sampling (PaETS). As a result, our Bayesian MBRL can involve multimodal uncertainties both in dynamics and optimal trajectories. In comparison to PETS, our method consistently improves asymptotic performance on several challenging locomotion tasks.

Comments:	Accepted to CoRL2019. Camera-ready ver
Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY); Machine Learning (stat.ML)
Cite as:	arXiv:1907.04202 [cs.LG]
	(or arXiv:1907.04202v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1907.04202

Submission history

From: Masashi Okada Dr [view email]
[v1] Mon, 8 Jul 2019 01:54:08 UTC (4,047 KB)
[v2] Mon, 7 Oct 2019 01:03:08 UTC (2,153 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-07

Change to browse by:

cs
cs.SY
eess
eess.SY
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Masashi Okada
Tadahiro Taniguchi

export BibTeX citation

Computer Science > Machine Learning

Title:Variational Inference MPC for Bayesian Model-based Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Variational Inference MPC for Bayesian Model-based Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators