Including Uncertainty when Learning from Human Corrections

Losey, Dylan P.; O'Malley, Marcia K.

Computer Science > Robotics

arXiv:1806.02454 (cs)

[Submitted on 6 Jun 2018 (v1), last revised 13 Sep 2018 (this version, v2)]

Title:Including Uncertainty when Learning from Human Corrections

Authors:Dylan P. Losey, Marcia K. O'Malley

View PDF

Abstract:It is difficult for humans to efficiently teach robots how to correctly perform a task. One intuitive solution is for the robot to iteratively learn the human's preferences from corrections, where the human improves the robot's current behavior at each iteration. When learning from corrections, we argue that while the robot should estimate the most likely human preferences, it should also know what it does not know, and integrate this uncertainty as it makes decisions. We advance the state-of-the-art by introducing a Kalman filter for learning from corrections: this approach obtains the uncertainty of the estimated human preferences. Next, we demonstrate how the estimate uncertainty can be leveraged for active learning and risk-sensitive deployment. Our results indicate that obtaining and leveraging uncertainty leads to faster learning from human corrections.

Comments:	Accepted for publication at the 2nd Conference on Robot Learning (CoRL), October 2018
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:1806.02454 [cs.RO]
	(or arXiv:1806.02454v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1806.02454

Submission history

From: Dylan Losey [view email]
[v1] Wed, 6 Jun 2018 23:08:09 UTC (861 KB)
[v2] Thu, 13 Sep 2018 05:29:41 UTC (282 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2018-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Dylan P. Losey
Marcia K. O'Malley

export BibTeX citation

Computer Science > Robotics

Title:Including Uncertainty when Learning from Human Corrections

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Including Uncertainty when Learning from Human Corrections

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators