Ternary Policy Iteration Algorithm for Nonlinear Robust Control

Li, Jie; Li, Shengbo Eben; Guan, Yang; Duan, Jingliang; Li, Wenyu; Yin, Yuming

Electrical Engineering and Systems Science > Systems and Control

arXiv:2007.06810 (eess)

[Submitted on 14 Jul 2020]

Title:Ternary Policy Iteration Algorithm for Nonlinear Robust Control

Authors:Jie Li, Shengbo Eben Li, Yang Guan, Jingliang Duan, Wenyu Li, Yuming Yin

View PDF

Abstract:The uncertainties in plant dynamics remain a challenge for nonlinear control problems. This paper develops a ternary policy iteration (TPI) algorithm for solving nonlinear robust control problems with bounded uncertainties. The controller and uncertainty of the system are considered as game players, and the robust control problem is formulated as a two-player zero-sum differential game. In order to solve the differential game, the corresponding Hamilton-Jacobi-Isaacs (HJI) equation is then derived. Three loss functions and three update phases are designed to match the identity equation, minimization and maximization of the HJI equation, respectively. These loss functions are defined by the expectation of the approximate Hamiltonian in a generated state set to prevent operating all the states in the entire state set concurrently. The parameters of value function and policies are directly updated by diminishing the designed loss functions using the gradient descent method. Moreover, zero-initialization can be applied to the parameters of the control policy. The effectiveness of the proposed TPI algorithm is demonstrated through two simulation studies. The simulation results show that the TPI algorithm can converge to the optimal solution for the linear plant, and has high resistance to disturbances for the nonlinear plant.

Subjects:	Systems and Control (eess.SY); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
Cite as:	arXiv:2007.06810 [eess.SY]
	(or arXiv:2007.06810v1 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2007.06810

Submission history

From: Jie Li [view email]
[v1] Tue, 14 Jul 2020 04:31:28 UTC (2,014 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Ternary Policy Iteration Algorithm for Nonlinear Robust Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Ternary Policy Iteration Algorithm for Nonlinear Robust Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators