Approximative Policy Iteration for Exit Time Feedback Control Problems driven by Stochastic Differential Equations using Tensor Train format

Fackeldey, Konstantin; Oster, Mathias; Sallandt, Leon; Schneider, Reinhold

Mathematics > Optimization and Control

arXiv:2010.04465 (math)

[Submitted on 9 Oct 2020]

Title:Approximative Policy Iteration for Exit Time Feedback Control Problems driven by Stochastic Differential Equations using Tensor Train format

Authors:Konstantin Fackeldey, Mathias Oster, Leon Sallandt, Reinhold Schneider

View PDF

Abstract:We consider a stochastic optimal exit time feedback control problem. The Bellman equation is solved approximatively via the Policy Iteration algorithm on a polynomial ansatz space by a sequence of linear equations. As high degree multi-polynomials are needed, the corresponding equations suffer from the curse of dimensionality even in moderate dimensions. We employ tensor-train methods to account for this problem. The approximation process within the Policy Iteration is done via a Least-Squares ansatz and the integration is done via Monte-Carlo methods. Numerical evidences are given for the (multi dimensional) double well potential and a three-hole potential.

Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:2010.04465 [math.OC]
	(or arXiv:2010.04465v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2010.04465

Submission history

From: Leon Sallandt [view email]
[v1] Fri, 9 Oct 2020 09:46:07 UTC (822 KB)

Full-text links:

Access Paper:

view license

Current browse context:

math.OC

< prev | next >

new | recent | 2020-10

Change to browse by:

math

References & Citations

export BibTeX citation

Mathematics > Optimization and Control

Title:Approximative Policy Iteration for Exit Time Feedback Control Problems driven by Stochastic Differential Equations using Tensor Train format

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Approximative Policy Iteration for Exit Time Feedback Control Problems driven by Stochastic Differential Equations using Tensor Train format

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators