Hindsight Experience Replay Accelerates Proximal Policy Optimization

Crowder, Douglas C.; McKenzie, Darrien M.; Trappett, Matthew L.; Chance, Frances S.

Computer Science > Machine Learning

arXiv:2410.22524 (cs)

[Submitted on 29 Oct 2024]

Title:Hindsight Experience Replay Accelerates Proximal Policy Optimization

Authors:Douglas C. Crowder, Darrien M. McKenzie, Matthew L. Trappett, Frances S. Chance

View PDF HTML (experimental)

Abstract:Hindsight experience replay (HER) accelerates off-policy reinforcement learning algorithms for environments that emit sparse rewards by modifying the goal of the episode post-hoc to be some state achieved during the episode. Because post-hoc modification of the observed goal violates the assumptions of on-policy algorithms, HER is not typically applied to on-policy algorithms. Here, we show that HER can dramatically accelerate proximal policy optimization (PPO), an on-policy reinforcement learning algorithm, when tested on a custom predator-prey environment.

Comments:	12 pages. 10 Figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2410.22524 [cs.LG]
	(or arXiv:2410.22524v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.22524

Submission history

From: Douglas Crowder [view email]
[v1] Tue, 29 Oct 2024 20:37:23 UTC (7,255 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-10

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Hindsight Experience Replay Accelerates Proximal Policy Optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hindsight Experience Replay Accelerates Proximal Policy Optimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators