Pontryagin's Minimum Principle and Forward-Backward Sweep Method for the System of HJB-FP Equations in Memory-Limited Partially Observable Stochastic Control

Tottori, Takehiro; Kobayashi, Tetsuya J.

doi:10.3390/e25020208

Mathematics > Optimization and Control

arXiv:2210.13040 (math)

[Submitted on 24 Oct 2022 (v1), last revised 8 Nov 2022 (this version, v3)]

Title:Pontryagin's Minimum Principle and Forward-Backward Sweep Method for the System of HJB-FP Equations in Memory-Limited Partially Observable Stochastic Control

Authors:Takehiro Tottori, Tetsuya J. Kobayashi

View PDF

Abstract:Memory-limited partially observable stochastic control (ML-POSC) is the stochastic optimal control problem under incomplete information and memory limitation. In order to obtain the optimal control function of ML-POSC, a system of the forward Fokker-Planck (FP) equation and the backward Hamilton-Jacobi-Bellman (HJB) equation needs to be solved. In this work, we firstly show that the system of HJB-FP equations can be interpreted via the Pontryagin's minimum principle on the probability density function space. Based on this interpretation, we then propose the forward-backward sweep method (FBSM) to ML-POSC, which has been used in the Pontryagin's minimum principle. FBSM is an algorithm to compute the forward FP equation and the backward HJB equation alternately. Although the convergence of FBSM is generally not guaranteed, it is guaranteed in ML-POSC because the coupling of HJB-FP equations is limited to the optimal control function in ML-POSC.

Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:2210.13040 [math.OC]
	(or arXiv:2210.13040v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2210.13040
Related DOI:	https://doi.org/10.3390/e25020208

Submission history

From: Takehiro Tottori [view email]
[v1] Mon, 24 Oct 2022 08:50:50 UTC (15,556 KB)
[v2] Sat, 5 Nov 2022 15:22:32 UTC (8,068 KB)
[v3] Tue, 8 Nov 2022 05:39:55 UTC (8,665 KB)

Mathematics > Optimization and Control

Title:Pontryagin's Minimum Principle and Forward-Backward Sweep Method for the System of HJB-FP Equations in Memory-Limited Partially Observable Stochastic Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Pontryagin's Minimum Principle and Forward-Backward Sweep Method for the System of HJB-FP Equations in Memory-Limited Partially Observable Stochastic Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators