Off-Policy Risk-Sensitive Reinforcement Learning Based Constrained Robust Optimal Control

Li, Cong; Liu, Qingchen; Zhou, Zhehua; Buss, Martin; Liu, Fangzhou

Electrical Engineering and Systems Science > Systems and Control

arXiv:2006.05681 (eess)

[Submitted on 10 Jun 2020 (v1), last revised 20 Apr 2022 (this version, v6)]

Title:Off-Policy Risk-Sensitive Reinforcement Learning Based Constrained Robust Optimal Control

Authors:Cong Li, Qingchen Liu, Zhehua Zhou, Martin Buss, Fangzhou Liu

View PDF

Abstract:This paper proposes an off-policy risk-sensitive reinforcement learning based control framework for stabilization of a continuous-time nonlinear system that subjects to additive disturbances, input saturation, and state constraints. By introducing pseudo controls and risk-sensitive input and state penalty terms, the constrained robust stabilization problem of the original system is converted into an equivalent optimal control problem of an auxiliary system. Then, aiming at the transformed optimal control problem, we adopt adaptive dynamic programming (ADP) implemented as a single critic structure to get the approximate solution to the value function of the Hamilton-Jacobi-Bellman (HJB) equation, which results in the approximate optimal control policy that is able to satisfy both input and state constraints under disturbances. By replaying experience data to the off-policy weight update law of the critic artificial neural network, the weight convergence is guaranteed. Moreover, to get experience data to achieve a sufficient excitation required for the weight convergence, online and offline algorithms are developed to serve as principled ways to record informative experience data. The equivalence proof demonstrates that the optimal control strategy of the auxiliary system robustly stabilizes the original system without violating input and state constraints. The proofs of system stability and weight convergence are provided. Simulation results reveal the validity of the proposed control framework.

Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:2006.05681 [eess.SY]
	(or arXiv:2006.05681v6 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2006.05681

Submission history

From: Cong Li [view email]
[v1] Wed, 10 Jun 2020 06:42:36 UTC (1,899 KB)
[v2] Fri, 14 Aug 2020 08:31:59 UTC (3,826 KB)
[v3] Wed, 30 Dec 2020 19:40:30 UTC (3,978 KB)
[v4] Wed, 2 Jun 2021 14:15:07 UTC (4,379 KB)
[v5] Tue, 8 Jun 2021 09:27:24 UTC (4,379 KB)
[v6] Wed, 20 Apr 2022 06:48:19 UTC (4,379 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Off-Policy Risk-Sensitive Reinforcement Learning Based Constrained Robust Optimal Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Off-Policy Risk-Sensitive Reinforcement Learning Based Constrained Robust Optimal Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators