Deep Intrinsically Motivated Continuous Actor-Critic for Efficient Robotic Visuomotor Skill Learning

Hafez, Muhammad Burhan; Weber, Cornelius; Kerzel, Matthias; Wermter, Stefan

doi:10.1515/pjbr-2019-0005

Computer Science > Machine Learning

arXiv:1810.11388 (cs)

[Submitted on 26 Oct 2018 (v1), last revised 18 Feb 2019 (this version, v2)]

Title:Deep Intrinsically Motivated Continuous Actor-Critic for Efficient Robotic Visuomotor Skill Learning

Authors:Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Stefan Wermter

View PDF

Abstract:In this paper, we present a new intrinsically motivated actor-critic algorithm for learning continuous motor skills directly from raw visual input. Our neural architecture is composed of a critic and an actor network. Both networks receive the hidden representation of a deep convolutional autoencoder which is trained to reconstruct the visual input, while the centre-most hidden representation is also optimized to estimate the state value. Separately, an ensemble of predictive world models generates, based on its learning progress, an intrinsic reward signal which is combined with the extrinsic reward to guide the exploration of the actor-critic learner. Our approach is more data-efficient and inherently more stable than the existing actor-critic methods for continuous control from pixel data. We evaluate our algorithm for the task of learning robotic reaching and grasping skills on a realistic physics simulator and on a humanoid robot. The results show that the control policies learned with our approach can achieve better performance than the compared state-of-the-art and baseline algorithms in both dense-reward and challenging sparse-reward settings.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:1810.11388 [cs.LG]
	(or arXiv:1810.11388v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.11388
Journal reference:	Paladyn, Journal of Behavioral Robotics, Volume 10, Issue 1, Pages 14-29, 2019
Related DOI:	https://doi.org/10.1515/pjbr-2019-0005

Submission history

From: Muhammad Burhan Hafez [view email]
[v1] Fri, 26 Oct 2018 15:32:32 UTC (1,138 KB)
[v2] Mon, 18 Feb 2019 10:54:46 UTC (2,295 KB)

Computer Science > Machine Learning

Title:Deep Intrinsically Motivated Continuous Actor-Critic for Efficient Robotic Visuomotor Skill Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Intrinsically Motivated Continuous Actor-Critic for Efficient Robotic Visuomotor Skill Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators