Optimal Reinforcement Learning for Gaussian Systems

Hennig, Philipp

Statistics > Machine Learning

arXiv:1106.0800v1 (stat)

[Submitted on 4 Jun 2011 (this version), latest version 14 Oct 2011 (v3)]

Title:Optimal Reinforcement Learning for Gaussian Systems

Authors:Philipp Hennig

View PDF

Abstract:The exploration-exploitation tradeoff is among the central challenges of reinforcement learning. A hypothetical exact Bayesian learner would provide the optimal solution, but is intractable in general. I show that, however, in the specific case of Gaussian process inference, it is possible to make analytic statements about optimal learning of both rewards and transition dynamics, for nonlinear, time-varying systems in continuous time and space, subject to a relatively weak restriction on the dynamics. The solution is described by an infinite-dimensional differential equation. For a first impression of how this result may be useful, I also provide an approximate reduction to a finite-dimensional problem, with a numeric solution.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1106.0800 [stat.ML]
	(or arXiv:1106.0800v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1106.0800

Submission history

From: Philipp Hennig [view email]
[v1] Sat, 4 Jun 2011 08:14:59 UTC (2,456 KB)
[v2] Wed, 7 Sep 2011 16:11:15 UTC (37 KB)
[v3] Fri, 14 Oct 2011 15:01:11 UTC (39 KB)

Full-text links:

Access Paper:

view license

Ancillary-file links:

Ancillary files (details):

movie.avi

Current browse context:

stat.ML

< prev | next >

new | recent | 2011-06

Change to browse by:

cs
cs.LG
stat

References & Citations

export BibTeX citation

Statistics > Machine Learning

Title:Optimal Reinforcement Learning for Gaussian Systems

Submission history

Access Paper:

Ancillary files (details):

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Optimal Reinforcement Learning for Gaussian Systems

Submission history

Access Paper:

Ancillary files (details):

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators