Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search

Zhang, Tianhao; Kahn, Gregory; Levine, Sergey; Abbeel, Pieter

Computer Science > Machine Learning

arXiv:1509.06791 (cs)

[Submitted on 22 Sep 2015 (v1), last revised 16 Feb 2016 (this version, v2)]

Title:Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search

Authors:Tianhao Zhang, Gregory Kahn, Sergey Levine, Pieter Abbeel

View PDF

Abstract:Model predictive control (MPC) is an effective method for controlling robotic systems, particularly autonomous aerial vehicles such as quadcopters. However, application of MPC can be computationally demanding, and typically requires estimating the state of the system, which can be challenging in complex, unstructured environments. Reinforcement learning can in principle forego the need for explicit state estimation and acquire a policy that directly maps sensor readings to actions, but is difficult to apply to unstable systems that are liable to fail catastrophically during training before an effective policy has been found. We propose to combine MPC with reinforcement learning in the framework of guided policy search, where MPC is used to generate data at training time, under full state observations provided by an instrumented training environment. This data is used to train a deep neural network policy, which is allowed to access only the raw observations from the vehicle's onboard sensors. After training, the neural network policy can successfully control the robot without knowledge of the full state, and at a fraction of the computational cost of MPC. We evaluate our method by learning obstacle avoidance policies for a simulated quadrotor, using simulated onboard sensors and no explicit state estimation at test time.

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:1509.06791 [cs.LG]
	(or arXiv:1509.06791v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1509.06791

Submission history

From: Gregory Kahn [view email]
[v1] Tue, 22 Sep 2015 21:27:27 UTC (1,109 KB)
[v2] Tue, 16 Feb 2016 06:49:26 UTC (1,132 KB)

Computer Science > Machine Learning

Title:Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators