Smooth and Efficient Policy Exploration for Robot Trajectory Learning

Li, Shidi; Chew, Chee-Meng; Subramaniam, Velusamy

Computer Science > Robotics

arXiv:1804.04903 (cs)

This paper has been withdrawn by Shidi Li

[Submitted on 13 Apr 2018 (v1), last revised 10 Aug 2018 (this version, v3)]

Title:Smooth and Efficient Policy Exploration for Robot Trajectory Learning

Authors:Shidi Li, Chee-Meng Chew, Velusamy Subramaniam

No PDF available, click to view other formats

Abstract:Many policy search algorithms have been proposed for robot learning and proved to be practical in real robot applications. However, there are still hyperparameters in the algorithms, such as the exploration rate, which requires manual tuning. The existing methods to design the exploration rate manually or automatically may not be general enough or hard to apply in the real robot. In this paper, we propose a learning model to update the exploration rate adaptively. The overall algorithm is a combination of methods proposed by other researchers. Smooth trajectories for the robot can be produced by the algorithm and the updated exploration rate maximizes the lower bound of the expected return. Our method is tested in the ball-in-cup problem. The results show that our method can receive the same learning outcome as the previous methods but with fewer iterations.

Comments:	Disapproval of funding organization
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:1804.04903 [cs.RO]
	(or arXiv:1804.04903v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1804.04903

Submission history

From: Shidi Li [view email]
[v1] Fri, 13 Apr 2018 11:59:28 UTC (1,357 KB)
[v2] Wed, 20 Jun 2018 05:37:30 UTC (1,336 KB)
[v3] Fri, 10 Aug 2018 02:57:34 UTC (1 KB) (withdrawn)

Computer Science > Robotics

Title:Smooth and Efficient Policy Exploration for Robot Trajectory Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Smooth and Efficient Policy Exploration for Robot Trajectory Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators