A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning

Zhang, Amy; Ballas, Nicolas; Pineau, Joelle

Computer Science > Machine Learning

arXiv:1806.07937 (cs)

[Submitted on 20 Jun 2018 (v1), last revised 25 Jun 2018 (this version, v2)]

Title:A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning

Authors:Amy Zhang, Nicolas Ballas, Joelle Pineau

View PDF

Abstract:The risks and perils of overfitting in machine learning are well known. However most of the treatment of this, including diagnostic tools and remedies, was developed for the supervised learning case. In this work, we aim to offer new perspectives on the characterization and prevention of overfitting in deep Reinforcement Learning (RL) methods, with a particular focus on continuous domains. We examine several aspects, such as how to define and diagnose overfitting in MDPs, and how to reduce risks by injecting sufficient training diversity. This work complements recent findings on the brittleness of deep RL methods and offers practical observations for RL researchers and practitioners.

Comments:	20 pages, 16 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1806.07937 [cs.LG]
	(or arXiv:1806.07937v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1806.07937

Submission history

From: Amy Zhang [view email]
[v1] Wed, 20 Jun 2018 19:27:59 UTC (14,607 KB)
[v2] Mon, 25 Jun 2018 17:09:04 UTC (5,388 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-06

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Amy Zhang
Nicolas Ballas
Joelle Pineau

export BibTeX citation

Computer Science > Machine Learning

Title:A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators