What is the Lagrangian for Nonlinear Filtering?

Kim, Jin W.; Mehta, Prashant G.; Meyn, Sean P.

Mathematics > Optimization and Control

arXiv:1903.11195 (math)

[Submitted on 27 Mar 2019 (v1), last revised 24 Oct 2019 (this version, v3)]

Title:What is the Lagrangian for Nonlinear Filtering?

Authors:Jin W. Kim, Prashant G. Mehta, Sean P. Meyn

View PDF

Abstract:Duality between estimation and optimal control is a problem of rich historical significance. The first duality principle appears in the seminal paper of Kalman-Bucy, where the problem of minimum variance estimation is shown to be dual to a linear quadratic (LQ) optimal control problem. Duality offers a constructive proof technique to derive the Kalman filter equation from the optimal control solution. This paper generalizes the classical duality result of Kalman-Bucy to the nonlinear filter: The state evolves as a continuous-time Markov process and the observation is a nonlinear function of state corrupted by an additive Gaussian noise. A dual process is introduced as a backward stochastic differential equation (BSDE). The process is used to transform the problem of minimum variance estimation into an optimal control problem. Its solution is obtained from an application of the maximum principle, and subsequently used to derive the equation of the nonlinear filter. The classical duality result of Kalman-Bucy is shown to be a special case.

Comments:	8 pages, 58th IEEE Conference on Decision and Control (Dec. 2019)
Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:1903.11195 [math.OC]
	(or arXiv:1903.11195v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1903.11195

Submission history

From: Jin Won Kim [view email]
[v1] Wed, 27 Mar 2019 00:06:29 UTC (80 KB)
[v2] Thu, 4 Apr 2019 00:11:31 UTC (84 KB)
[v3] Thu, 24 Oct 2019 18:00:57 UTC (84 KB)

Mathematics > Optimization and Control

Title:What is the Lagrangian for Nonlinear Filtering?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:What is the Lagrangian for Nonlinear Filtering?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators