Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem

Lawrence, Nathan P.; Stewart, Gregory E.; Loewen, Philip D.; Forbes, Michael G.; Backstrom, Johan U.; Gopaluni, R. Bhushan

doi:10.1016/j.ifacol.2020.12.129

Mathematics > Optimization and Control

arXiv:2005.04539 (math)

[Submitted on 10 May 2020]

Title:Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem

Authors:Nathan P. Lawrence, Gregory E. Stewart, Philip D. Loewen, Michael G. Forbes, Johan U. Backstrom, R. Bhushan Gopaluni

View PDF

Abstract:Deep reinforcement learning (DRL) has seen several successful applications to process control. Common methods rely on a deep neural network structure to model the controller or process. With increasingly complicated control structures, the closed-loop stability of such methods becomes less clear. In this work, we focus on the interpretability of DRL control methods. In particular, we view linear fixed-structure controllers as shallow neural networks embedded in the actor-critic framework. PID controllers guide our development due to their simplicity and acceptance in industrial practice. We then consider input saturation, leading to a simple nonlinear control structure. In order to effectively operate within the actuator limits we then incorporate a tuning parameter for anti-windup compensation. Finally, the simplicity of the controller allows for straightforward initialization. This makes our method inherently stabilizing, both during and after training, and amenable to known operational PID gains.

Comments:	IFAC World Congress 2020
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2005.04539 [math.OC]
	(or arXiv:2005.04539v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2005.04539
Related DOI:	https://doi.org/10.1016/j.ifacol.2020.12.129

Submission history

From: Nathan P. Lawrence [view email]
[v1] Sun, 10 May 2020 01:05:26 UTC (1,078 KB)

Mathematics > Optimization and Control

Title:Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators