Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Zhang, Jesse; Cheung, Brian; Finn, Chelsea; Levine, Sergey; Jayaraman, Dinesh

Computer Science > Machine Learning

arXiv:2008.06622 (cs)

[Submitted on 15 Aug 2020]

Title:Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Authors:Jesse Zhang, Brian Cheung, Chelsea Finn, Sergey Levine, Dinesh Jayaraman

View PDF

Abstract:Reinforcement learning (RL) in real-world safety-critical target settings like urban driving is hazardous, imperiling the RL agent, other agents, and the environment. To overcome this difficulty, we propose a "safety-critical adaptation" task setting: an agent first trains in non-safety-critical "source" environments such as in a simulator, before it adapts to the target environment where failures carry heavy costs. We propose a solution approach, CARL, that builds on the intuition that prior experience in diverse environments equips an agent to estimate risk, which in turn enables relative safety through risk-averse, cautious adaptation. CARL first employs model-based RL to train a probabilistic model to capture uncertainty about transition dynamics and catastrophic states across varied source environments. Then, when exploring a new safety-critical environment with unknown dynamics, the CARL agent plans to avoid actions that could lead to catastrophic states. In experiments on car driving, cartpole balancing, half-cheetah locomotion, and robotic object manipulation, CARL successfully acquires cautious exploration behaviors, yielding higher rewards with fewer failures than strong RL adaptation baselines. Website at this https URL.

Comments:	15 pages, 8 figures, ICML 2020. Website with code: this https URL
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2008.06622 [cs.LG]
	(or arXiv:2008.06622v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2008.06622
Journal reference:	Proceedings of the 37th International Conference on Machine Learning, PMLR 119:11055-11065, 2020

Submission history

From: Jesse Zhang [view email]
[v1] Sat, 15 Aug 2020 01:40:59 UTC (17,166 KB)

Computer Science > Machine Learning

Title:Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators