-
Model reduction, machine learning based global optimisation for large-scale steady state nonlinear systems
Authors:
Min Tao,
Panagiotis Petsagkourakis,
Jie Li,
Constantinos Theodoropoulos
Abstract:
Many engineering processes can be accurately modelled using partial differential equations (PDEs), but high dimensionality and non-convexity of the resulting systems pose limitations on their efficient optimisation. In this work, a model reduction, machine-learning methodology combining principal component analysis (PCA) and artificial neural networks (ANNs) is employed to construct a reduced surr…
▽ More
Many engineering processes can be accurately modelled using partial differential equations (PDEs), but high dimensionality and non-convexity of the resulting systems pose limitations on their efficient optimisation. In this work, a model reduction, machine-learning methodology combining principal component analysis (PCA) and artificial neural networks (ANNs) is employed to construct a reduced surrogate model, which can then be utilised by advanced deterministic global optimisation algorithms to compute global optimal solutions with theoretical guarantees. However, such optimisation would still be time-consuming due to the high non-convexity of the activation functions inside the reduced ANN structures. To develop a computationally-efficient optimisation framework, we propose two alternative strategies: The first one is a piecewise-affine reformulation of the nonlinear ANN activation functions, while the second one is based on deep rectifier neural networks with ReLU activation function. The performance of the proposed framework is demonstrated through two illustrative case studies.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
Tube-based Distributionally Robust Model Predictive Control for Nonlinear Process Systems via Linearization
Authors:
Zhengang Zhong,
Ehecatl Antonio del Rio-Chanona,
Panagiotis Petsagkourakis
Abstract:
Model predictive control (MPC) is an effective approach to control multivariable dynamic systems with constraints. Most real dynamic models are however affected by plant-model mismatch and process uncertainties, which can lead to closed-loop performance deterioration and constraint violations. Methods such as stochastic MPC (SMPC) have been proposed to alleviate these problems; however, the result…
▽ More
Model predictive control (MPC) is an effective approach to control multivariable dynamic systems with constraints. Most real dynamic models are however affected by plant-model mismatch and process uncertainties, which can lead to closed-loop performance deterioration and constraint violations. Methods such as stochastic MPC (SMPC) have been proposed to alleviate these problems; however, the resulting closed-loop state trajectory might still significantly violate the prescribed constraints if the real system deviates from the assumed disturbance distributions made during the controller design. In this work we propose a novel data-driven distributionally robust MPC scheme for nonlinear systems. Unlike SMPC, which requires the exact knowledge of the disturbance distribution, our scheme decides the control action with respect to the worst distribution from a distribution ambiguity set. This ambiguity set is defined as a Wasserstein ball centered at the empirical distribution. Due to the potential model errors that cause off-sets, the scheme is also extended by leveraging an offset-free method. The favorable results of this control scheme are demonstrated and empirically verified with a nonlinear mass spring system and a nonlinear CSTR case study.
△ Less
Submitted 26 November, 2022;
originally announced November 2022.
-
Neural ODEs as Feedback Policies for Nonlinear Optimal Control
Authors:
Ilya Orson Sandoval,
Panagiotis Petsagkourakis,
Ehecatl Antonio del Rio-Chanona
Abstract:
Neural ordinary differential equations (Neural ODEs) define continuous time dynamical systems with neural networks. The interest in their application for modelling has sparked recently, spanning hybrid system identification problems and time series analysis. In this work we propose the use of a neural control policy capable of satisfying state and control constraints to solve nonlinear optimal con…
▽ More
Neural ordinary differential equations (Neural ODEs) define continuous time dynamical systems with neural networks. The interest in their application for modelling has sparked recently, spanning hybrid system identification problems and time series analysis. In this work we propose the use of a neural control policy capable of satisfying state and control constraints to solve nonlinear optimal control problems. The control policy optimization is posed as a Neural ODE problem to efficiently exploit the availability of a dynamical system model. We showcase the efficacy of this type of deterministic neural policies in two constrained systems: the controlled Van der Pol system and a bioreactor control problem. This approach represents a practical approximation to the intractable closed-loop solution of nonlinear control problems.
△ Less
Submitted 12 November, 2022; v1 submitted 20 October, 2022;
originally announced October 2022.
-
Safe Real-Time Optimization using Multi-Fidelity Gaussian Processes
Authors:
Panagiotis Petsagkourakis,
Benoit Chachuat,
Ehecatl Antonio del Rio-Chanona
Abstract:
This paper proposes a new class of real-time optimization schemes to overcome system-model mismatch of uncertain processes. This work's novelty lies in integrating derivative-free optimization schemes and multi-fidelity Gaussian processes within a Bayesian optimization framework. The proposed scheme uses two Gaussian processes for the stochastic system, one emulates the (known) process model, and…
▽ More
This paper proposes a new class of real-time optimization schemes to overcome system-model mismatch of uncertain processes. This work's novelty lies in integrating derivative-free optimization schemes and multi-fidelity Gaussian processes within a Bayesian optimization framework. The proposed scheme uses two Gaussian processes for the stochastic system, one emulates the (known) process model, and another, the true system through measurements. In this way, low fidelity samples can be obtained via a model, while high fidelity samples are obtained through measurements of the system. This framework captures the system's behavior in a non-parametric fashion while driving exploration through acquisition functions. The benefit of using a Gaussian process to represent the system is the ability to perform uncertainty quantification in real-time and allow for chance constraints to be satisfied with high confidence. This results in a practical approach that is illustrated in numerical case studies, including a semi-batch photobioreactor optimization problem.
△ Less
Submitted 10 November, 2021;
originally announced November 2021.
-
Data-driven distributionally robust MPC using the Wasserstein metric
Authors:
Zhengang Zhong,
Ehecatl Antonio del Rio-Chanona,
Panagiotis Petsagkourakis
Abstract:
A data-driven MPC scheme is proposed to safely control constrained stochastic linear systems using distributionally robust optimization. Distributionally robust constraints based on the Wasserstein metric are imposed to bound the state constraint violations in the presence of process disturbance. A feedback control law is solved to guarantee that the predicted states comply with constraints. The s…
▽ More
A data-driven MPC scheme is proposed to safely control constrained stochastic linear systems using distributionally robust optimization. Distributionally robust constraints based on the Wasserstein metric are imposed to bound the state constraint violations in the presence of process disturbance. A feedback control law is solved to guarantee that the predicted states comply with constraints. The stochastic constraints are satisfied with regard to the worst-case distribution within the Wasserstein ball centered at their discrete empirical probability distribution. The resulting distributionally robust MPC framework is computationally tractable and efficient, as well as recursively feasible. The innovation of this approach is that all the information about the uncertainty can be determined empirically from the data. The effectiveness of the proposed scheme is demonstrated through numerical case studies.
△ Less
Submitted 18 May, 2021;
originally announced May 2021.
-
Safe model-based design of experiments using Gaussian processes
Authors:
Panagiotis Petsagkourakis,
Federico Galvanin
Abstract:
Construction of kinetic models has become an indispensable step in the development and scale up of processes in the industry. Model-based design of experiments (MBDoE) has been widely used for the purpose of improving parameter precision in nonlinear dynamic systems. This process needs to account for both parametric and structural uncertainty, as the feasibility constraints imposed on the system m…
▽ More
Construction of kinetic models has become an indispensable step in the development and scale up of processes in the industry. Model-based design of experiments (MBDoE) has been widely used for the purpose of improving parameter precision in nonlinear dynamic systems. This process needs to account for both parametric and structural uncertainty, as the feasibility constraints imposed on the system may well turn out to be violated leading to unsafe experimental conditions when an optimally designed experiment is performed. In this work, a Gaussian process is utilized in a two-fold manner: 1) to quantify the uncertainty realization of the physical system and calculate the plant-model mismatch, 2) to compute the optimal experimental design while accounting for the parametric uncertainty. This method provides a guarantee for the probabilistic satisfaction of the constraints in the context of model-based design of experiments. The method is assisted with the use of adaptive trust-regions in order to facilitate a satisfactory local approximation. The proposed method is able to allow the design of optimal experiments starting from limited preliminary knowledge of the parameter set, leading to a safe exploration of the parameter space. The performance of this method is demonstrated through illustrative case studies regarding the parameter identification of the kinetic model in flow reactors.
△ Less
Submitted 14 April, 2021; v1 submitted 19 November, 2020;
originally announced November 2020.
-
Constrained Model-Free Reinforcement Learning for Process Optimization
Authors:
Elton Pan,
Panagiotis Petsagkourakis,
Max Mowbray,
Dongda Zhang,
Antonio del Rio-Chanona
Abstract:
Reinforcement learning (RL) is a control approach that can handle nonlinear stochastic optimal control problems. However, despite the promise exhibited, RL has yet to see marked translation to industrial practice primarily due to its inability to satisfy state constraints. In this work we aim to address this challenge. We propose an 'oracle'-assisted constrained Q-learning algorithm that guarantee…
▽ More
Reinforcement learning (RL) is a control approach that can handle nonlinear stochastic optimal control problems. However, despite the promise exhibited, RL has yet to see marked translation to industrial practice primarily due to its inability to satisfy state constraints. In this work we aim to address this challenge. We propose an 'oracle'-assisted constrained Q-learning algorithm that guarantees the satisfaction of joint chance constraints with a high probability, which is crucial for safety critical tasks. To achieve this, constraint tightening (backoffs) are introduced and adjusted using Broyden's method, hence making them self-tuned. This results in a general methodology that can be imbued into approximate dynamic programming-based algorithms to ensure constraint satisfaction with high probability. Finally, we present case studies that analyze the performance of the proposed approach and compare this algorithm with model predictive control (MPC). The favorable performance of this algorithm signifies a step toward the incorporation of RL into real world optimization and control of engineering systems, where constraints are essential in ensuring safety.
△ Less
Submitted 14 April, 2021; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Real-Time Optimization Meets Bayesian Optimization and Derivative-Free Optimization: A Tale of Modifier Adaptation
Authors:
Ehecatl Antonio del Rio-Chanona,
Panagiotis Petsagkourakis,
Eric Bradford,
Jose Eduardo Alves Graciano,
Benoit Chachuat
Abstract:
This paper investigates a new class of modifier-adaptation schemes to overcome plant-model mismatch in real-time optimization of uncertain processes. The main contribution lies in the integration of concepts from the areas of Bayesian optimization and derivative-free optimization. The proposed schemes embed a physical model and rely on trust-region ideas to minimize risk during the exploration, wh…
▽ More
This paper investigates a new class of modifier-adaptation schemes to overcome plant-model mismatch in real-time optimization of uncertain processes. The main contribution lies in the integration of concepts from the areas of Bayesian optimization and derivative-free optimization. The proposed schemes embed a physical model and rely on trust-region ideas to minimize risk during the exploration, while employing Gaussian process regression to capture the plant-model mismatch in a non-parametric way and drive the exploration by means of acquisition functions. The benefits of using an acquisition function, knowing the process noise level, or specifying a nominal process model are illustrated on numerical case studies, including a semi-batch photobioreactor optimization problem.
△ Less
Submitted 1 February, 2021; v1 submitted 18 September, 2020;
originally announced September 2020.
-
Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty
Authors:
Panagiotis Petsagkourakis,
Ilya Orson Sandoval,
Eric Bradford,
Dongda Zhang,
Ehecatl Antonio del Río Chanona
Abstract:
Dynamic real-time optimization (DRTO) is a challenging task due to the fact that optimal operating conditions must be computed in real time. The main bottleneck in the industrial application of DRTO is the presence of uncertainty. Many stochastic systems present the following obstacles: 1) plant-model mismatch, 2) process disturbances, 3) risks in violation of process constraints. To accommodate t…
▽ More
Dynamic real-time optimization (DRTO) is a challenging task due to the fact that optimal operating conditions must be computed in real time. The main bottleneck in the industrial application of DRTO is the presence of uncertainty. Many stochastic systems present the following obstacles: 1) plant-model mismatch, 2) process disturbances, 3) risks in violation of process constraints. To accommodate these difficulties, we present a constrained reinforcement learning (RL) based approach. RL naturally handles the process uncertainty by computing an optimal feedback policy. However, no state constraints can be introduced intuitively. To address this problem, we present a chance-constrained RL methodology. We use chance constraints to guarantee the probabilistic satisfaction of process constraints, which is accomplished by introducing backoffs, such that the optimal policy and backoffs are computed simultaneously. Backoffs are adjusted using the empirical cumulative distribution function to guarantee the satisfaction of a joint chance constraint. The advantage and performance of this strategy are illustrated through a stochastic dynamic bioprocess optimization problem, to produce sustainable high-value bioproducts.
△ Less
Submitted 4 June, 2020;
originally announced June 2020.
-
Reinforcement Learning for Batch Bioprocess Optimization
Authors:
Panagiotis Petsagkourakis,
Ilya Orson Sandoval,
Eric Bradford,
Dongda Zhang,
Ehecatl Antonio del Rio Chanona
Abstract:
Bioprocesses have received a lot of attention to produce clean and sustainable alternatives to fossil-based materials. However, they are generally difficult to optimize due to their unsteady-state operation modes and stochastic behaviours. Furthermore, biological systems are highly complex, therefore plant-model mismatch is often present. To address the aforementioned challenges we propose a Reinf…
▽ More
Bioprocesses have received a lot of attention to produce clean and sustainable alternatives to fossil-based materials. However, they are generally difficult to optimize due to their unsteady-state operation modes and stochastic behaviours. Furthermore, biological systems are highly complex, therefore plant-model mismatch is often present. To address the aforementioned challenges we propose a Reinforcement learning based optimization strategy for batch processes.
In this work, we applied the Policy Gradient method from batch-to-batch to update a control policy parametrized by a recurrent neural network. We assume that a preliminary process model is available, which is exploited to obtain a preliminary optimal control policy. Subsequently, this policy is updatedbased on measurements from thetrueplant. The capabilities of our proposed approach were tested on three case studies (one of which is nonsmooth) using a more complex process model for thetruesystemembedded with adequate process disturbance. Lastly, we discussed the advantages and disadvantages of this strategy compared against current existing approaches such as nonlinear model predictive control.
△ Less
Submitted 27 September, 2019; v1 submitted 15 April, 2019;
originally announced April 2019.