Safe Reinforcement Learning with Model Uncertainty Estimates

Lütjens, Björn; Everett, Michael; How, Jonathan P.

Computer Science > Robotics

arXiv:1810.08700 (cs)

[Submitted on 19 Oct 2018 (v1), last revised 1 Mar 2019 (this version, v2)]

Title:Safe Reinforcement Learning with Model Uncertainty Estimates

Authors:Björn Lütjens, Michael Everett, Jonathan P. How

View PDF

Abstract:Many current autonomous systems are being designed with a strong reliance on black box predictions from deep neural networks (DNNs). However, DNNs tend to be overconfident in predictions on unseen data and can give unpredictable results for far-from-distribution test data. The importance of predictions that are robust to this distributional shift is evident for safety-critical applications, such as collision avoidance around pedestrians. Measures of model uncertainty can be used to identify unseen data, but the state-of-the-art extraction methods such as Bayesian neural networks are mostly intractable to compute. This paper uses MC-Dropout and Bootstrapping to give computationally tractable and parallelizable uncertainty estimates. The methods are embedded in a Safe Reinforcement Learning framework to form uncertainty-aware navigation around pedestrians. The result is a collision avoidance policy that knows what it does not know and cautiously avoids pedestrians that exhibit unseen behavior. The policy is demonstrated in simulation to be more robust to novel observations and take safer actions than an uncertainty-unaware baseline.

Comments:	ICRA 2019; Presented at IROS 2018 Workshop on Machine Learning in Robot Motion Planning
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1810.08700 [cs.RO]
	(or arXiv:1810.08700v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1810.08700

Submission history

From: Björn Lütjens [view email]
[v1] Fri, 19 Oct 2018 22:04:59 UTC (2,976 KB)
[v2] Fri, 1 Mar 2019 05:03:11 UTC (2,496 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2018-10

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Björn Lütjens
Michael Everett
Jonathan P. How

export BibTeX citation

Computer Science > Robotics

Title:Safe Reinforcement Learning with Model Uncertainty Estimates

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Safe Reinforcement Learning with Model Uncertainty Estimates

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators