Causally Correct Partial Models for Reinforcement Learning

Rezende, Danilo J.; Danihelka, Ivo; Papamakarios, George; Ke, Nan Rosemary; Jiang, Ray; Weber, Theophane; Gregor, Karol; Merzic, Hamza; Viola, Fabio; Wang, Jane; Mitrovic, Jovana; Besse, Frederic; Antonoglou, Ioannis; Buesing, Lars

Computer Science > Machine Learning

arXiv:2002.02836 (cs)

[Submitted on 7 Feb 2020]

Title:Causally Correct Partial Models for Reinforcement Learning

Authors:Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

View PDF

Abstract:In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this paper, we show that partial models can be causally incorrect: they are confounded by the observations they don't model, and can therefore lead to incorrect planning. To address this, we introduce a general family of partial models that are provably causally correct, yet remain fast because they do not need to fully model future observations.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2002.02836 [cs.LG]
	(or arXiv:2002.02836v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.02836

Submission history

From: Ivo Danihelka [view email]
[v1] Fri, 7 Feb 2020 15:18:15 UTC (9,258 KB)

Computer Science > Machine Learning

Title:Causally Correct Partial Models for Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Causally Correct Partial Models for Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators