QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

Kalashnikov, Dmitry; Irpan, Alex; Pastor, Peter; Ibarz, Julian; Herzog, Alexander; Jang, Eric; Quillen, Deirdre; Holly, Ethan; Kalakrishnan, Mrinal; Vanhoucke, Vincent; Levine, Sergey

Computer Science > Machine Learning

arXiv:1806.10293 (cs)

[Submitted on 27 Jun 2018 (v1), last revised 28 Nov 2018 (this version, v3)]

Title:QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

Authors:Dmitry Kalashnikov, Alex Irpan, Peter Pastor, Julian Ibarz, Alexander Herzog, Eric Jang, Deirdre Quillen, Ethan Holly, Mrinal Kalakrishnan, Vincent Vanhoucke, Sergey Levine

View PDF

Abstract:In this paper, we study the problem of learning vision-based dynamic manipulation skills using a scalable reinforcement learning approach. We study this problem in the context of grasping, a longstanding challenge in robotic manipulation. In contrast to static learning behaviors that choose a grasp point and then execute the desired grasp, our method enables closed-loop vision-based control, whereby the robot continuously updates its grasp strategy based on the most recent observations to optimize long-horizon grasp success. To that end, we introduce QT-Opt, a scalable self-supervised vision-based reinforcement learning framework that can leverage over 580k real-world grasp attempts to train a deep neural network Q-function with over 1.2M parameters to perform closed-loop, real-world grasping that generalizes to 96% grasp success on unseen objects. Aside from attaining a very high success rate, our method exhibits behaviors that are quite distinct from more standard grasping systems: using only RGB vision-based perception from an over-the-shoulder camera, our method automatically learns regrasping strategies, probes objects to find the most effective grasps, learns to reposition objects and perform other non-prehensile pre-grasp manipulations, and responds dynamically to disturbances and perturbations.

Comments:	CoRL 2018 camera ready. 23 pages, 14 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:1806.10293 [cs.LG]
	(or arXiv:1806.10293v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1806.10293

Submission history

From: Alexander Irpan [view email]
[v1] Wed, 27 Jun 2018 04:34:30 UTC (5,156 KB)
[v2] Mon, 2 Jul 2018 19:08:00 UTC (5,156 KB)
[v3] Wed, 28 Nov 2018 02:40:54 UTC (4,660 KB)

Computer Science > Machine Learning

Title:QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators