Learning Humanoid Robot Running Skills through Proximal Policy Optimization

Melo, Luckeciano C.; Maximo, Marcos R. O. A.

Computer Science > Robotics

arXiv:1910.10620 (cs)

[Submitted on 22 Oct 2019]

Title:Learning Humanoid Robot Running Skills through Proximal Policy Optimization

Authors:Luckeciano C. Melo, Marcos R. O. A. Maximo

View PDF

Abstract:In the current level of evolution of Soccer 3D, motion control is a key factor in team's performance. Recent works takes advantages of model-free approaches based on Machine Learning to exploit robot dynamics in order to obtain faster locomotion skills, achieving running policies and, therefore, opening a new research direction in the Soccer 3D environment.
In this work, we present a methodology based on Deep Reinforcement Learning that learns running skills without any prior knowledge, using a neural network whose inputs are related to robot's dynamics. Our results outperformed the previous state-of-the-art sprint velocity reported in Soccer 3D literature by a significant margin. It also demonstrated improvement in sample efficiency, being able to learn how to run in just few hours.
We reported our results analyzing the training procedure and also evaluating the policies in terms of speed, reliability and human similarity. Finally, we presented key factors that lead us to improve previous results and shared some ideas for future work.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1910.10620 [cs.RO]
	(or arXiv:1910.10620v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1910.10620

Submission history

From: Luckeciano Melo [view email]
[v1] Tue, 22 Oct 2019 13:08:11 UTC (8,395 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2019-10

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Luckeciano Carvalho Melo

export BibTeX citation

Computer Science > Robotics

Title:Learning Humanoid Robot Running Skills through Proximal Policy Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Learning Humanoid Robot Running Skills through Proximal Policy Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators