A Deeper Look at Experience Replay

Zhang, Shangtong; Sutton, Richard S.

Computer Science > Machine Learning

arXiv:1712.01275v2 (cs)

[Submitted on 4 Dec 2017 (v1), revised 7 Mar 2018 (this version, v2), latest version 30 Apr 2018 (v3)]

Title:A Deeper Look at Experience Replay

Authors:Shangtong Zhang, Richard S. Sutton

View PDF

Abstract:Recently experience replay is widely used in various deep reinforcement learning (RL) algorithms, however in this paper we showcase that it is not as good as people think. To be more specific, experience replay will significantly hurt the learning process if the size of replay buffer is not well tuned. Although experience replay is a necessary component in modern deep RL algorithms to stabilize the network, we should be aware that the idea of experience replay itself is not as good as people think. The size of the replay buffer is an important hyper-parameter, which can significantly influence the performance and has unfortunately been underestimated in the community for a long time. In this paper we did a systematic empirical study of experience replay under various function representations. We showcase that a large replay buffer can significantly hurt the performance. Moreover, we propose a simple O(1) method to remedy the negative influence of a large replay buffer. We showcase its utility in both simple grid world and challenging domains like Atari games. Moreover, we visualize how a large replay buffer hurts the learning process.

Comments:	NIPS 2017 Deep Reinforcement Learning Symposium
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1712.01275 [cs.LG]
	(or arXiv:1712.01275v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1712.01275

Submission history

From: Shangtong Zhang [view email]
[v1] Mon, 4 Dec 2017 06:03:26 UTC (1,048 KB)
[v2] Wed, 7 Mar 2018 16:35:12 UTC (1,811 KB)
[v3] Mon, 30 Apr 2018 04:24:26 UTC (1,037 KB)

Computer Science > Machine Learning

Title:A Deeper Look at Experience Replay

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Deeper Look at Experience Replay

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators