Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data

Mocanu, Decebal Constantin; Vega, Maria Torres; Eaton, Eric; Stone, Peter; Liotta, Antonio

Computer Science > Machine Learning

arXiv:1610.05555 (cs)

[Submitted on 18 Oct 2016]

Title:Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data

Authors:Decebal Constantin Mocanu, Maria Torres Vega, Eric Eaton, Peter Stone, Antonio Liotta

View PDF

Abstract:Conceived in the early 1990s, Experience Replay (ER) has been shown to be a successful mechanism to allow online learning algorithms to reuse past experiences. Traditionally, ER can be applied to all machine learning paradigms (i.e., unsupervised, supervised, and reinforcement learning). Recently, ER has contributed to improving the performance of deep reinforcement learning. Yet, its application to many practical settings is still limited by the memory requirements of ER, necessary to explicitly store previous observations. To remedy this issue, we explore a novel approach, Online Contrastive Divergence with Generative Replay (OCD_GR), which uses the generative capability of Restricted Boltzmann Machines (RBMs) instead of recorded past experiences. The RBM is trained online, and does not require the system to store any of the observed data points. We compare OCD_GR to ER on 9 real-world datasets, considering a worst-case scenario (data points arriving in sorted order) as well as a more realistic one (sequential random-order data points). Our results show that in 64.28% of the cases OCD_GR outperforms ER and in the remaining 35.72% it has an almost equal performance, while having a considerably reduced space complexity (i.e., memory usage) at a comparable time complexity.

Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1610.05555 [cs.LG]
	(or arXiv:1610.05555v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1610.05555

Submission history

From: Decebal Constantin Mocanu [view email]
[v1] Tue, 18 Oct 2016 12:06:14 UTC (746 KB)

Computer Science > Machine Learning

Title:Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators