Using Echo State Networks to Approximate Value Functions for Control

Hart, Allen G.; Olding, Kevin R.; Cox, A. M. G.; Isupova, Olga; Dawes, J. H. P.

Mathematics > Dynamical Systems

arXiv:2102.06258 (math)

[Submitted on 11 Feb 2021 (v1), last revised 25 Jun 2021 (this version, v2)]

Title:Using Echo State Networks to Approximate Value Functions for Control

Authors:Allen G. Hart, Kevin R. Olding, A. M. G. Cox, Olga Isupova, J. H. P. Dawes

View PDF

Abstract:An Echo State Network (ESN) is a type of single-layer recurrent neural network with randomly-chosen internal weights and a trainable output layer. We prove under mild conditions that a sufficiently large Echo State Network can approximate the value function of a broad class of stochastic and deterministic control problems. Such control problems are generally non-Markovian.
We describe how the ESN can form the basis for novel and computationally efficient reinforcement learning algorithms in a non-Markovian framework. We demonstrate this theory with two examples. In the first, we use an ESN to solve a deterministic, partially observed, control problem which is a simple game we call `Bee World'. In the second example, we consider a stochastic control problem inspired by a market making problem in mathematical finance. In both cases we can compare the dynamics of the algorithms with analytic solutions to show that even after only a single reinforcement policy iteration the algorithms arrive at a good policy.

Subjects:	Dynamical Systems (math.DS); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2102.06258 [math.DS]
	(or arXiv:2102.06258v2 [math.DS] for this version)
	https://doi.org/10.48550/arXiv.2102.06258

Submission history

From: Allen Hart [view email]
[v1] Thu, 11 Feb 2021 20:33:20 UTC (1,739 KB)
[v2] Fri, 25 Jun 2021 12:09:10 UTC (1,757 KB)

Mathematics > Dynamical Systems

Title:Using Echo State Networks to Approximate Value Functions for Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Dynamical Systems

Title:Using Echo State Networks to Approximate Value Functions for Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators