Learning Optimal Control Policies for Stochastic Systems with a Relaxed Bellman Operator

Martinelli, Andrea; Lygeros, John

Electrical Engineering and Systems Science > Systems and Control

arXiv:2003.08721v1 (eess)

[Submitted on 19 Mar 2020 (this version), latest version 30 Nov 2020 (v2)]

Title:Learning Optimal Control Policies for Stochastic Systems with a Relaxed Bellman Operator

Authors:Andrea Martinelli, John Lygeros

View PDF

Abstract:We introduce a relaxed version of the Bellman operator for q-functions and prove that it is still a monotone contraction mapping with a unique fixed point. In the spirit of the linear programming approach to approximate dynamic programming, we exploit the new operator to build a simplified linear program (LP) for q-functions. In the case of discrete-time stochastic linear systems with infinite state and action spaces, the solution of the LP preserves the minimizers of the optimal q-function. Therefore, even though the solution of the LP does not coincide with the optimal q-function, the policy we retrieve is the optimal one. The LP has fewer decision variables than existing programs, and we show how it can be employed together with reinforcement learning approaches when the dynamics is unknown.

Subjects:	Systems and Control (eess.SY); Optimization and Control (math.OC)
Cite as:	arXiv:2003.08721 [eess.SY]
	(or arXiv:2003.08721v1 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2003.08721

Submission history

From: Andrea Martinelli [view email]
[v1] Thu, 19 Mar 2020 13:01:00 UTC (122 KB)
[v2] Mon, 30 Nov 2020 12:19:50 UTC (709 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Learning Optimal Control Policies for Stochastic Systems with a Relaxed Bellman Operator

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Learning Optimal Control Policies for Stochastic Systems with a Relaxed Bellman Operator

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators