PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals

Charlesworth, Henry; Montana, Giovanni

Computer Science > Machine Learning

arXiv:2006.00900 (cs)

[Submitted on 1 Jun 2020]

Title:PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals

Authors:Henry Charlesworth, Giovanni Montana

View PDF

Abstract:Learning with sparse rewards remains a significant challenge in reinforcement learning (RL), especially when the aim is to train a policy capable of achieving multiple different goals. To date, the most successful approaches for dealing with multi-goal, sparse reward environments have been model-free RL algorithms. In this work we propose PlanGAN, a model-based algorithm specifically designed for solving multi-goal tasks in environments with sparse rewards. Our method builds on the fact that any trajectory of experience collected by an agent contains useful information about how to achieve the goals observed during that trajectory. We use this to train an ensemble of conditional generative models (GANs) to generate plausible trajectories that lead the agent from its current state towards a specified goal. We then combine these imagined trajectories into a novel planning algorithm in order to achieve the desired goal as efficiently as possible. The performance of PlanGAN has been tested on a number of robotic navigation/manipulation tasks in comparison with a range of model-free reinforcement learning baselines, including Hindsight Experience Replay. Our studies indicate that PlanGAN can achieve comparable performance whilst being around 4-8 times more sample efficient.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2006.00900 [cs.LG]
	(or arXiv:2006.00900v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.00900

Submission history

From: Henry Charlesworth [view email]
[v1] Mon, 1 Jun 2020 12:53:09 UTC (2,459 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-06

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Henry Charlesworth
Giovanni Montana

export BibTeX citation

Computer Science > Machine Learning

Title:PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators