-
A Scenario-based Model Predictive Control Scheme for Pandemic Response through Non-pharmaceutical Interventions
Authors:
Domagoj Herceg,
Marco DellOro,
Riccardo Bertollo,
Fuminari Miura,
Paul de Klaver,
Valentina Breschi,
Dinesh Krishnamoorthy,
Mauro Salazar
Abstract:
This paper presents a scenario-based model predictive control (MPC) scheme designed to control an evolving pandemic via non-pharmaceutical intervention (NPIs). The proposed approach combines predictions of possible pandemic evolution to decide on a level of severity of NPIs to be implemented over multiple weeks to maintain hospital pressure below a prescribed threshold, while minimizing their impa…
▽ More
This paper presents a scenario-based model predictive control (MPC) scheme designed to control an evolving pandemic via non-pharmaceutical intervention (NPIs). The proposed approach combines predictions of possible pandemic evolution to decide on a level of severity of NPIs to be implemented over multiple weeks to maintain hospital pressure below a prescribed threshold, while minimizing their impact on society. Specifically, we first introduce a compartmental model which divides the population into Susceptible, Infected, Detected, Threatened, Healed, and Expired (SIDTHE) subpopulations and describe its positive invariant set. This model is expressive enough to explicitly capture the fraction of hospitalized individuals while preserving parameter identifiability w.r.t. publicly available datasets. Second, we devise a scenario-based MPC scheme with recourse actions that captures potential uncertainty of the model parameters. e.g., due to population behavior or seasonality. Our results show that the scenario-based nature of the proposed controller manages to adequately respond to all scenarios, keeping the hospital pressure at bay also in very challenging situations when conventional MPC methods fail.
△ Less
Submitted 22 June, 2025;
originally announced June 2025.
-
A Comparative Study of Distributed Feedback Optimizing Control Architectures
Authors:
Risvan Dirza,
Hari Prasad Varadarajan,
Vegard Aas,
Sigurd Skogestad,
Dinesh Krishnamoorthy
Abstract:
This paper considers the problem of steady-state real-time optimization (RTO) of interconnected systems with a common constraint that couples several units, for example, a shared resource. Such problems are often studied under the context of distributed optimization, where decisions are made locally in each subsystem, and are coordinated to optimize the overall performance. Here, we use distribute…
▽ More
This paper considers the problem of steady-state real-time optimization (RTO) of interconnected systems with a common constraint that couples several units, for example, a shared resource. Such problems are often studied under the context of distributed optimization, where decisions are made locally in each subsystem, and are coordinated to optimize the overall performance. Here, we use distributed feedback-optimizing control framework, where the local systems and the coordinator problems are converted into feedback control problems. This is a powerful scheme that allows us to design feedback control loops, and estimate parameters locally, as well as provide local fast response, allowing different closed-loop time constants for each local subsystem. This paper provides a comparative study of different distributed feedback optimizing control architectures using two case studies. The first case study considers the problem of demand response in a residential energy hub powered by a common renewable energy source, and compares the different feedback optimizing control approaches using simulations. The second case study experimentally validates and compares the different approaches using a lab-scale experimental rig that emulates a subsea oil production network, where the common resource is the gas lift that must be optimally allocated among the wells. %The pros and cons of the different approaches are discussed.
△ Less
Submitted 7 November, 2024;
originally announced November 2024.
-
ECCBO: An Inherently Safe Bayesian Optimization with Embedded Constraint Control for Real-Time Optimization
Authors:
Dinesh Krishnamoorthy
Abstract:
This paper introduces a model-free real-time optimization (RTO) framework based on unconstrained Bayesian optimization with embedded constraint control. The main contribution lies in demonstrating how this approach simplifies the black-box optimization problem while ensuring "always-feasible" setpoints, addressing a critical challenge in real-time optimization with unknown cost and constraints. No…
▽ More
This paper introduces a model-free real-time optimization (RTO) framework based on unconstrained Bayesian optimization with embedded constraint control. The main contribution lies in demonstrating how this approach simplifies the black-box optimization problem while ensuring "always-feasible" setpoints, addressing a critical challenge in real-time optimization with unknown cost and constraints. Noting that controlling the constraint does not require detailed process models, the key idea of this paper is to control the constraints to "some" setpoint using simple feedback controllers. Bayesian optimization then computes the optimum setpoint for the constraint controllers. By searching over the setpoints for the constraint controllers, as opposed to searching directly over the RTO degrees of freedom, this paper achieves an inherently safe and practical model-free RTO scheme. In particular, this paper shows that the proposed approach can achieve zero cumulative constraint violation without relying on assumptions about the Gaussian process model used in Bayesian optimization. The effectiveness of the proposed approach is demonstrated on a benchmark Williams-Otto reactor example.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Learning the cost-to-go for mixed-integer nonlinear model predictive control
Authors:
Christopher A. Orrico,
W. P. M. H. Heemels,
Dinesh Krishnamoorthy
Abstract:
Application of nonlinear model predictive control (NMPC) to problems with hybrid dynamical systems, disjoint constraints, or discrete controls often results in mixed-integer formulations with both continuous and discrete decision variables. However, solving mixed-integer nonlinear programming problems (MINLP) in real-time is challenging, which can be a limiting factor in many applications. To addr…
▽ More
Application of nonlinear model predictive control (NMPC) to problems with hybrid dynamical systems, disjoint constraints, or discrete controls often results in mixed-integer formulations with both continuous and discrete decision variables. However, solving mixed-integer nonlinear programming problems (MINLP) in real-time is challenging, which can be a limiting factor in many applications. To address the computational complexity of solving mixed integer nonlinear model predictive control problem in real-time, this paper proposes an approximate mixed integer NMPC formulation based on value function approximation. Leveraging Bellman's principle of optimality, the key idea here is to divide the prediction horizon into two parts, where the optimal value function of the latter part of the prediction horizon is approximated offline using expert demonstrations. Doing so allows us to solve the MINMPC problem with a considerably shorter prediction horizon online, thereby reducing the online computation cost. The paper uses an inverted pendulum example with discrete controls to illustrate this approach.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
On Building Myopic MPC Policies using Supervised Learning
Authors:
Christopher A. Orrico,
Bokan Yang,
Dinesh Krishnamoorthy
Abstract:
The application of supervised learning techniques in combination with model predictive control (MPC) has recently generated significant interest, particularly in the area of approximate explicit MPC, where function approximators like deep neural networks are used to learn the MPC policy via optimal state-action pairs generated offline. While the aim of approximate explicit MPC is to closely replic…
▽ More
The application of supervised learning techniques in combination with model predictive control (MPC) has recently generated significant interest, particularly in the area of approximate explicit MPC, where function approximators like deep neural networks are used to learn the MPC policy via optimal state-action pairs generated offline. While the aim of approximate explicit MPC is to closely replicate the MPC policy, substituting online optimization with a trained neural network, the performance guarantees that come with solving the online optimization problem are typically lost. This paper considers an alternative strategy, where supervised learning is used to learn the optimal value function offline instead of learning the optimal policy. This can then be used as the cost-to-go function in a myopic MPC with a very short prediction horizon, such that the online computation burden reduces significantly without affecting the controller performance. This approach differs from existing work on value function approximations in the sense that it learns the cost-to-go function by using offline-collected state-value pairs, rather than closed-loop performance data. The cost of generating the state-value pairs used for training is addressed using a sensitivity-based data augmentation scheme.
△ Less
Submitted 9 August, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.
-
A Bayesian optimization framework for the automatic tuning of MPC-based shared controllers
Authors:
Anne van der Horst,
Bas Meere,
Dinesh Krishnamoorthy,
Saray Bakker,
Bram van de Vrande,
Henry Stoutjesdijk,
Marco Alonso,
Elena Torta
Abstract:
This paper presents a Bayesian optimization framework for the automatic tuning of shared controllers which are defined as a Model Predictive Control (MPC) problem. The proposed framework includes the design of performance metrics as well as the representation of user inputs for simulation-based optimization. The framework is applied to the optimization of a shared controller for an Image Guided Th…
▽ More
This paper presents a Bayesian optimization framework for the automatic tuning of shared controllers which are defined as a Model Predictive Control (MPC) problem. The proposed framework includes the design of performance metrics as well as the representation of user inputs for simulation-based optimization. The framework is applied to the optimization of a shared controller for an Image Guided Therapy robot. VR-based user experiments confirm the increase in performance of the automatically tuned MPC shared controller with respect to a hand-tuned baseline version as well as its generalization ability.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Mixed-Integer MPC Strategies for Fueling and Density Control in Fusion Tokamaks
Authors:
Christopher A. Orrico,
Matthijs van Berkel,
Thomas O. S. J. Bosman,
W. P. M. H. Heemels,
Dinesh Krishnamoorthy
Abstract:
Model predictive control (MPC) is promising for fueling and core density feedback control in nuclear fusion tokamaks, where the primary actuators, frozen hydrogen fuel pellets fired into the plasma, are discrete. Previous density feedback control approaches have only approximated pellet injection as a continuous input due to the complexity that it introduces. In this letter, we model plasma densit…
▽ More
Model predictive control (MPC) is promising for fueling and core density feedback control in nuclear fusion tokamaks, where the primary actuators, frozen hydrogen fuel pellets fired into the plasma, are discrete. Previous density feedback control approaches have only approximated pellet injection as a continuous input due to the complexity that it introduces. In this letter, we model plasma density and pellet injection as a hybrid system and propose two MPC strategies for density control: mixed-integer (MI) MPC using a conventional mixed-integer programming (MIP) solver and MPC utilizing our novel modification of the penalty term homotopy (PTH) algorithm. By relaxing the integer requirements, the PTH algorithm transforms the MIP problem into a series of continuous optimization problems, reducing computational complexity. Our novel modification to the PTH algorithm ensures that it can handle path constraints, making it viable for constrained hybrid MPC in general. Both strategies perform well with regards to reference tracking without violating path constraints and satisfy the computation time limit for real-time control of the pellet injection system. However, the computation time of the PTH-based MPC strategy consistently outpaces the conventional MI-MPC strategy.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Stability Properties of the Adaptive Horizon Multi-Stage MPC
Authors:
Zawadi Mdoe,
Dinesh Krishnamoorthy,
Johannes Jäschke
Abstract:
This paper presents an adaptive horizon multi-stage model-predictive control (MPC) algorithm. It establishes appropriate criteria for recursive feasibility and robust stability using the theory of input-to-state practical stability (ISpS). The proposed algorithm employs parametric nonlinear programming (NLP) sensitivity and terminal ingredients to determine the minimum stabilizing prediction horiz…
▽ More
This paper presents an adaptive horizon multi-stage model-predictive control (MPC) algorithm. It establishes appropriate criteria for recursive feasibility and robust stability using the theory of input-to-state practical stability (ISpS). The proposed algorithm employs parametric nonlinear programming (NLP) sensitivity and terminal ingredients to determine the minimum stabilizing prediction horizon for all the scenarios considered in the subsequent iterations of the multi-stage MPC. This technique notably decreases the computational cost in nonlinear model-predictive control systems with uncertainty, as they involve solving large and complex optimization problems. The efficacy of the controller is illustrated using three numerical examples that illustrate a reduction in computational delay in multi-stage MPC.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
ALADIN-based Distributed Model Predictive Control with dynamic partitioning: An application to Solar Parabolic Trough Plants
Authors:
P. Chanfreut,
J. M. Maestre,
D. Krishnamoorthy,
E. F. Camacho
Abstract:
This article presents a distributed model predictive controller with time-varying partitioning based on the augmented Lagrangian alternating direction inexact Newton method (ALADIN). In particular, we address the problem of controlling the temperature of a heat transfer fluid (HTF) in a set of loops of solar parabolic collectors by adjusting its flow rate. The control problem involves a nonlinear…
▽ More
This article presents a distributed model predictive controller with time-varying partitioning based on the augmented Lagrangian alternating direction inexact Newton method (ALADIN). In particular, we address the problem of controlling the temperature of a heat transfer fluid (HTF) in a set of loops of solar parabolic collectors by adjusting its flow rate. The control problem involves a nonlinear prediction model, decoupled inequality constraints, and coupled affine constraints on the system inputs. The application of ALADIN to address such a problem is combined with a dynamic clustering-based partitioning approach that aims at reducing, with minimum performance losses, the number of variables to be coordinated. Numerical results on a 10-loop plant are presented.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Multi-agent Black-box Optimization using a Bayesian Approach to Alternating Direction Method of Multipliers
Authors:
Dinesh Krishnamoorthy,
Joel A. Paulson
Abstract:
Bayesian optimization (BO) is a powerful black-box optimization framework that looks to efficiently learn the global optimum of an unknown system by systematically trading-off between exploration and exploitation. However, the use of BO as a tool for coordinated decision-making in multi-agent systems with unknown structure has not been widely studied. This paper investigates a black-box optimizati…
▽ More
Bayesian optimization (BO) is a powerful black-box optimization framework that looks to efficiently learn the global optimum of an unknown system by systematically trading-off between exploration and exploitation. However, the use of BO as a tool for coordinated decision-making in multi-agent systems with unknown structure has not been widely studied. This paper investigates a black-box optimization problem over a multi-agent network coupled via shared variables or constraints, where each subproblem is formulated as a BO that uses only its local data. The proposed multi-agent BO (MABO) framework adds a penalty term to traditional BO acquisition functions to account for coupling between the subsystems without data sharing. We derive a suitable form for this penalty term using alternating directions method of multipliers (ADMM), which enables the local decision-making problems to be solved in parallel (and potentially asynchronously). The effectiveness of the proposed MABO method is demonstrated on an intelligent transport system for fuel efficient vehicle platooning.
△ Less
Submitted 25 March, 2023;
originally announced March 2023.
-
An Improved Data Augmentation Scheme for Model Predictive Control Policy Approximation
Authors:
Dinesh Krishnamoorthy
Abstract:
This paper considers the problem of data generation for MPC policy approximation. Learning an approximate MPC policy from expert demonstrations requires a large data set consisting of optimal state-action pairs, sampled across the feasible state space. Yet, the key challenge of efficiently generating the training samples has not been studied widely. Recently, a sensitivity-based data augmentation…
▽ More
This paper considers the problem of data generation for MPC policy approximation. Learning an approximate MPC policy from expert demonstrations requires a large data set consisting of optimal state-action pairs, sampled across the feasible state space. Yet, the key challenge of efficiently generating the training samples has not been studied widely. Recently, a sensitivity-based data augmentation framework for MPC policy approximation was proposed, where the parametric sensitivities are exploited to cheaply generate several additional samples from a single offline MPC computation. The error due to augmenting the training data set with inexact samples was shown to increase with the size of the neighborhood around each sample used for data augmentation. Building upon this work, this letter paper presents an improved data augmentation scheme based on predictor-corrector steps that enforces a user-defined level of accuracy, and shows that the error bound of the augmented samples are independent of the size of the neighborhood used for data augmentation.
△ Less
Submitted 29 May, 2023; v1 submitted 9 March, 2023;
originally announced March 2023.
-
Personalized Dose Guidance using Safe Bayesian Optimization
Authors:
Dinesh Krishnamoorthy,
Francis J. Doyle III
Abstract:
This work considers the problem of personalized dose guidance using Bayesian optimization that learns the optimum drug dose tailored to each individual, thus improving therapeutic outcomes. Safe learning using interior point method ensures patient safety with high probability. This is demonstrated using the problem of learning the optimum bolus insulin dose in patients with type 1 diabetes to coun…
▽ More
This work considers the problem of personalized dose guidance using Bayesian optimization that learns the optimum drug dose tailored to each individual, thus improving therapeutic outcomes. Safe learning using interior point method ensures patient safety with high probability. This is demonstrated using the problem of learning the optimum bolus insulin dose in patients with type 1 diabetes to counteract the effect of meal consumption. Starting from no a priori information about the patients, our dose guidance algorithm is able to improve the therapeutic outcome (measured in terms of % time-in-range) without jeopardizing patient safety. Other potential healthcare applications are also discussed.
△ Less
Submitted 30 October, 2022;
originally announced October 2022.
-
Safe Bayesian Optimization using Interior-Point Methods -- Applied to Personalized Insulin Dose Guidance
Authors:
Dinesh Krishnamoorthy,
Francis J. Doyle III
Abstract:
This paper considers the problem of Bayesian optimization for systems with safety-critical constraints, where both the objective function and the constraints are unknown, but can be observed by querying the system. In safety-critical applications, querying the system at an infeasible point can have catastrophic consequences. Such systems require a safe learning framework, such that the performance…
▽ More
This paper considers the problem of Bayesian optimization for systems with safety-critical constraints, where both the objective function and the constraints are unknown, but can be observed by querying the system. In safety-critical applications, querying the system at an infeasible point can have catastrophic consequences. Such systems require a safe learning framework, such that the performance objective can be optimized while satisfying the safety-critical constraints with high probability. In this paper we propose a safe Bayesian optimization framework that ensures that the points queried are always in the interior of the partially revealed safe region, thereby guaranteeing constraint satisfaction with high probability. The proposed interior-point Bayesian optimization framework can be used with any acquisition function, making it broadly applicable. The performance of the proposed method is demonstrated using a personalized insulin dosing application for patients with type 1 diabetes.
△ Less
Submitted 4 May, 2022;
originally announced May 2022.
-
On the Design and Analysis of Multivariable Extremum Seeking Control using Fast Fourier Transform
Authors:
Dinesh Krishnamoorthy
Abstract:
This paper proposes a multivariable extremum seeking scheme using Fast Fourier Transform (FFT) for a network of subsystems working towards optimizing the sum of their local objectives, where the overall objective is the only available measurement. Here, the different inputs are perturbed with different dither frequencies, and the power spectrum of the overall output signal obtained using FFT is us…
▽ More
This paper proposes a multivariable extremum seeking scheme using Fast Fourier Transform (FFT) for a network of subsystems working towards optimizing the sum of their local objectives, where the overall objective is the only available measurement. Here, the different inputs are perturbed with different dither frequencies, and the power spectrum of the overall output signal obtained using FFT is used to estimate the steady-state cost gradient w.r.t. each input. The inputs for the subsystems are then updated using integral control in order to drive the respective gradients to zero. This paper provides analytical rules for designing the FFT-based gradient estimation algorithm and analyzes the stability properties of the resulting extremum seeking scheme for the static map setting. The effectiveness of the proposed FFT-based multivariable extremum seeking scheme is demonstrated using two examples, namely, wind farm power optimization problem, and a heat exchanger network for industrial waste-to-heat recovery.
△ Less
Submitted 10 May, 2021; v1 submitted 29 April, 2021;
originally announced April 2021.
-
A Sensitivity-based Data Augmentation Framework for Model Predictive Control Policy Approximation
Authors:
Dinesh Krishnamoorthy
Abstract:
Approximating model predictive control (MPC) policy using expert-based supervised learning techniques requires labeled training data sets sampled from the MPC policy. This is typically obtained by sampling the feasible state-space and evaluating the control law by solving the numerical optimization problem offline for each sample. Although the resulting approximate policy can be cheaply evaluated…
▽ More
Approximating model predictive control (MPC) policy using expert-based supervised learning techniques requires labeled training data sets sampled from the MPC policy. This is typically obtained by sampling the feasible state-space and evaluating the control law by solving the numerical optimization problem offline for each sample. Although the resulting approximate policy can be cheaply evaluated online, generating large training samples to learn the MPC policy can be time consuming and prohibitively expensive. This is one of the fundamental bottlenecks that limit the design and implementation of MPC policy approximation. This technical note aims to address this challenge, and proposes a novel sensitivity-based data augmentation scheme for direct policy approximation. The proposed approach is based on exploiting the parametric sensitivities to cheaply generate additional training samples in the neighborhood of the existing samples.
△ Less
Submitted 14 March, 2022; v1 submitted 15 September, 2020;
originally announced September 2020.
-
Sensitivity Assisted Alternating Directions Method of Multipliers for Distributed Optimization and Statistical Learning
Authors:
Dinesh Krishnamoorthy,
Vyacheslav Kungurtsev
Abstract:
This paper considers the problem of distributed model fitting using the alternating directions method of multipliers (ADMM). ADMM splits the learning problem into several smaller subproblems, usually by partitioning the data samples. The different subproblems can be solved in parallel by a set of worker computing nodes coordinated by a master node, and the subproblems are repeatedly solved until c…
▽ More
This paper considers the problem of distributed model fitting using the alternating directions method of multipliers (ADMM). ADMM splits the learning problem into several smaller subproblems, usually by partitioning the data samples. The different subproblems can be solved in parallel by a set of worker computing nodes coordinated by a master node, and the subproblems are repeatedly solved until convergence. At each iteration, the worker nodes must solve a convex optimization problem whose difficulty increases with the size of the problem. In this paper, we propose a sensitivity-assisted ADMM algorithm that leverages the parametric sensitivities such that the subproblems solutions can be approximated using a tangential predictor, thus easing the computational burden to computing one linear solve. We study the convergence properties of the proposed sensitivity-assisted ADMM algorithm. The numerical performance of the algorithm is illustrated on a nonlinear parameter estimation problem, and a multilayer perceptron learning problem.
△ Less
Submitted 2 March, 2022; v1 submitted 12 September, 2020;
originally announced September 2020.