Search | arXiv e-print repository

Experience in Engineering Complex Systems: Active Preference Learning with Multiple Outcomes and Certainty Levels

Authors: Le Anh Dao, Loris Roveda, Marco Maccarini, Matteo Lavit Nicora, Marta Mondellini, Matteo Meregalli Falerni, Palaniappan Veerappan, Lorenzo Mantovani, Dario Piga, Simone Formentin, Matteo Malosio

Abstract: Black-box optimization refers to the optimization problem whose objective function and/or constraint sets are either unknown, inaccessible, or non-existent. In many applications, especially with the involvement of humans, the only way to access the optimization problem is through performing physical experiments with the available outcomes being the preference of one candidate with respect to one o… ▽ More Black-box optimization refers to the optimization problem whose objective function and/or constraint sets are either unknown, inaccessible, or non-existent. In many applications, especially with the involvement of humans, the only way to access the optimization problem is through performing physical experiments with the available outcomes being the preference of one candidate with respect to one or many others. Accordingly, the algorithm so-called Active Preference Learning has been developed to exploit this specific information in constructing a surrogate function based on standard radial basis functions, and then forming an easy-to-solve acquisition function which repetitively suggests new decision vectors to search for the optimal solution. Based on this idea, our approach aims to extend the algorithm in such a way that can exploit further information effectively, which can be obtained in reality such as: 5-point Likert type scale for the outcomes of the preference query (i.e., the preference can be described in not only "this is better than that" but also "this is much better than that" level), or multiple outcomes for a single preference query with possible additive information on how certain the outcomes are. The validation of the proposed algorithm is done through some standard benchmark functions, showing a promising improvement with respect to the state-of-the-art algorithm. △ Less

Submitted 27 February, 2023; originally announced February 2023.

arXiv:2106.05639 [pdf, other]

C-GLISp: Preference-Based Global Optimization under Unknown Constraints with Applications to Controller Calibration

Authors: Mengjia Zhu, Dario Piga, Alberto Bemporad

Abstract: Preference-based global optimization algorithms minimize an unknown objective function only based on whether the function is better, worse, or similar for given pairs of candidate optimization vectors. Such optimization problems arise in many real-life examples, such as finding the optimal calibration of the parameters of a control law. The calibrator can judge whether a particular combination of… ▽ More Preference-based global optimization algorithms minimize an unknown objective function only based on whether the function is better, worse, or similar for given pairs of candidate optimization vectors. Such optimization problems arise in many real-life examples, such as finding the optimal calibration of the parameters of a control law. The calibrator can judge whether a particular combination of parameters leads to a better, worse, or similar closed-loop performance. Often, the search for the optimal parameters is also subject to unknown constraints. For example, the vector of calibration parameters must not lead to closed-loop instability. This paper extends an active preference learning algorithm introduced recently by the authors to handle unknown constraints. The proposed method, called C-GLISp, looks for an optimizer of the problem only based on preferences expressed on pairs of candidate vectors, and on whether a given vector is reported feasible and/or satisfactory. C-GLISp learns a surrogate of the underlying objective function based on the expressed preferences, and a surrogate of the probability that a sample is feasible and/or satisfactory based on whether each of the tested vectors was judged as such. The surrogate functions are used iteratively to propose a new candidate vector to test and judge. Numerical benchmarks and a semi-automated control calibration task demonstrate the effectiveness of C-GLISp, showing that it can reach near-optimal solutions within a small number of iterations. △ Less

Submitted 18 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

Comments: A MATLAB and a Python implementation of C-GLISp is available at http://cse.lab.imtlucca.it/~bemporad/glis

arXiv:2012.06846 [pdf, other]

A unified framework for closed-form nonparametric regression, classification, preference and mixed problems with Skew Gaussian Processes

Authors: Alessio Benavoli, Dario Azzimonti, Dario Piga

Abstract: Skew-Gaussian processes (SkewGPs) extend the multivariate Unified Skew-Normal distributions over finite dimensional vectors to distribution over functions. SkewGPs are more general and flexible than Gaussian processes, as SkewGPs may also represent asymmetric distributions. In a recent contribution we showed that SkewGP and probit likelihood are conjugate, which allows us to compute the exact post… ▽ More Skew-Gaussian processes (SkewGPs) extend the multivariate Unified Skew-Normal distributions over finite dimensional vectors to distribution over functions. SkewGPs are more general and flexible than Gaussian processes, as SkewGPs may also represent asymmetric distributions. In a recent contribution we showed that SkewGP and probit likelihood are conjugate, which allows us to compute the exact posterior for non-parametric binary classification and preference learning. In this paper, we generalize previous results and we prove that SkewGP is conjugate with both the normal and affine probit likelihood, and more in general, with their product. This allows us to (i) handle classification, preference, numeric and ordinal regression, and mixed problems in a unified framework; (ii) derive closed-form expression for the corresponding posterior distributions. We show empirically that the proposed framework based on SkewGP provides better performance than Gaussian processes in active learning and Bayesian (constrained) optimization. These two tasks are fundamental for design of experiments and in Data Science. △ Less

Submitted 27 January, 2021; v1 submitted 12 December, 2020; originally announced December 2020.

MSC Class: stat.ML; cs.LG

arXiv:2009.09778 [pdf, ps, other]

Computation of Parameter Dependent Robust Invariant Sets for LPV Models with Guaranteed Performance

Authors: Ankit Gupta, Manas Mejari, Paolo Falcone, Dario Piga

Abstract: This paper presents an iterative algorithm to compute a Robust Control Invariant (RCI) set, along with an invariance-inducing control law, for Linear Parameter-Varying (LPV) systems. As the real-time measurements of the scheduling parameters are typically available, in the presented formulation, we allow the RCI set description along with the invariance-inducing controller to be scheduling paramet… ▽ More This paper presents an iterative algorithm to compute a Robust Control Invariant (RCI) set, along with an invariance-inducing control law, for Linear Parameter-Varying (LPV) systems. As the real-time measurements of the scheduling parameters are typically available, in the presented formulation, we allow the RCI set description along with the invariance-inducing controller to be scheduling parameter dependent. The considered formulation thus leads to parameter-dependent conditions for the set invariance, which are replaced by sufficient Linear Matrix Inequality (LMI) conditions via Polya's relaxation. These LMI conditions are then combined with a novel volume maximization approach in a Semidefinite Programming (SDP) problem, which aims at computing the desirably large RCI set. In addition to ensuring invariance, it is also possible to guarantee performance within the RCI set by imposing a chosen quadratic performance level as an additional constraint in the SDP problem. The reported numerical example shows that the presented iterative algorithm can generate invariant sets which are larger than the maximal RCI sets computed without exploiting scheduling parameter information. △ Less

Submitted 30 November, 2022; v1 submitted 21 September, 2020; originally announced September 2020.

Comments: 15 pages, 6 figures, preprint submitted to Automatica

arXiv:2008.06677 [pdf, other]

Preferential Bayesian optimisation with Skew Gaussian Processes

Authors: Alessio Benavoli, Dario Azzimonti, Dario Piga

Abstract: Preferential Bayesian optimisation (PBO) deals with optimisation problems where the objective function can only be accessed via preference judgments, such as "this is better than that" between two candidate solutions (like in A/B tests or recommender systems). The state-of-the-art approach to PBO uses a Gaussian process to model the preference function and a Bernoulli likelihood to model the obser… ▽ More Preferential Bayesian optimisation (PBO) deals with optimisation problems where the objective function can only be accessed via preference judgments, such as "this is better than that" between two candidate solutions (like in A/B tests or recommender systems). The state-of-the-art approach to PBO uses a Gaussian process to model the preference function and a Bernoulli likelihood to model the observed pairwise comparisons. Laplace's method is then employed to compute posterior inferences and, in particular, to build an appropriate acquisition function. In this paper, we prove that the true posterior distribution of the preference function is a Skew Gaussian Process (SkewGP), with highly skewed pairwise marginals and, thus, show that Laplace's method usually provides a very poor approximation. We then derive an efficient method to compute the exact SkewGP posterior and use it as surrogate model for PBO employing standard acquisition functions (Upper Credible Bound, etc.). We illustrate the benefits of our exact PBO-SkewGP in a variety of experiments, by showing that it consistently outperforms PBO based on Laplace's approximation both in terms of convergence speed and computational time. We also show that our framework can be extended to deal with mixed preferential-categorical BO, where binary judgments (valid or non-valid) together with preference judgments are available. △ Less

Submitted 1 April, 2021; v1 submitted 15 August, 2020; originally announced August 2020.

Comments: arXiv admin note: text overlap with arXiv:2012.06846

arXiv:2003.11294 [pdf, other]

Preference-based MPC calibration

Authors: Mengjia Zhu, Alberto Bemporad, Dario Piga

Abstract: Automating the calibration of the parameters of a control policy by means of global optimization requires quantifying a closed-loop performance function. As this can be impractical in many situations, in this paper we suggest a semi-automated calibration approach that requires instead a human calibrator to express a preference on whether a certain control policy is "better" than another one, there… ▽ More Automating the calibration of the parameters of a control policy by means of global optimization requires quantifying a closed-loop performance function. As this can be impractical in many situations, in this paper we suggest a semi-automated calibration approach that requires instead a human calibrator to express a preference on whether a certain control policy is "better" than another one, therefore eliminating the need of an explicit performance index. In particular, we focus our attention on semi-automated calibration of Model Predictive Controllers (MPCs), for which we attempt computing the set of best calibration parameters by employing the recently-developed active preference-based optimization algorithm GLISp. Based on the preferences expressed by the human operator, GLISp learns a surrogate of the underlying closed-loop performance index that the calibrator (unconsciously) uses and proposes, iteratively, a new set of calibration parameters to him or her for testing and for comparison against previous experimental results. The resulting semi-automated calibration procedure is tested on two case studies, showing the capabilities of the approach in achieving near-optimal performance within a limited number of experiments. △ Less

Submitted 26 May, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

Comments: 8 pages, 4 figures, to be published in European Control Conference, 2021

arXiv:1911.13021 [pdf, other]

Efficient Calibration of Embedded MPC

Authors: Marco Forgione, Dario Piga, Alberto Bemporad

Abstract: Model Predictive Control (MPC) is a powerful and flexible design tool of high-performance controllers for physical systems in the presence of input and output constraints. A challenge for the practitioner applying MPC is the need of tuning a large number of parameters such as prediction and control horizons, weight matrices of the MPC cost function, and observer gains, according to different trade… ▽ More Model Predictive Control (MPC) is a powerful and flexible design tool of high-performance controllers for physical systems in the presence of input and output constraints. A challenge for the practitioner applying MPC is the need of tuning a large number of parameters such as prediction and control horizons, weight matrices of the MPC cost function, and observer gains, according to different trade-offs. The MPC design task is even more involved when the control law has to be deployed to an embedded hardware unit endowed with limited computational resources. In this case, real-time system requirements limit the complexity of the applicable MPC configuration, engendering additional design tradeoffs and requiring to tune further parameters, such as the sampling time and the tolerances used in the on-line numerical solver. To take into account closed-loop performance and real-time requirements, in this paper we tackle the embedded MPC design problem using a global, data-driven, optimization approach We showcase the potential of this approach by tuning an MPC controller on two hardware platforms characterized by largely different computational capabilities. △ Less

Submitted 17 January, 2021; v1 submitted 29 November, 2019; originally announced November 2019.

Comments: Source code generating the results of the paper available at https://github.com/forgi86/efficient-calibration-embedded-MPC

arXiv:1909.13049 [pdf, other]

Active preference learning based on radial basis functions

Authors: Alberto Bemporad, Dario Piga

Abstract: This paper proposes a method for solving optimization problems in which the decision-maker cannot evaluate the objective function, but rather can only express a preference such as "this is better than that" between two candidate decision vectors. The algorithm described in this paper aims at reaching the global optimizer by iteratively proposing the decision maker a new comparison to make, based o… ▽ More This paper proposes a method for solving optimization problems in which the decision-maker cannot evaluate the objective function, but rather can only express a preference such as "this is better than that" between two candidate decision vectors. The algorithm described in this paper aims at reaching the global optimizer by iteratively proposing the decision maker a new comparison to make, based on actively learning a surrogate of the latent (unknown and perhaps unquantifiable) objective function from past sampled decision vectors and pairwise preferences. The surrogate is fit by means of radial basis functions, under the constraint of satisfying, if possible, the preferences expressed by the decision maker on existing samples. The surrogate is used to propose a new sample of the decision vector for comparison with the current best candidate based on two possible criteria: minimize a combination of the surrogate and an inverse weighting distance function to balance between exploitation of the surrogate and exploration of the decision space, or maximize a function related to the probability that the new candidate will be preferred. Compared to active preference learning based on Bayesian optimization, we show that our approach is superior in that, within the same number of comparisons, it approaches the global optimum more closely and is computationally lighter. MATLAB and a Python implementations of the algorithms described in the paper are available at http://cse.lab.imtlucca.it/~bemporad/idwgopt. △ Less

Submitted 28 September, 2019; originally announced September 2019.

Comments: 33 pages, 10 figures

arXiv:1904.10839 [pdf, other]

doi 10.1109/LCSYS.2019.2913347

Performance-oriented model learning for data-driven MPC design

Authors: Dario Piga, Marco Forgione, Simone Formentin, Alberto Bemporad

Abstract: Model Predictive Control (MPC) is an enabling technology in applications requiring controlling physical processes in an optimized way under constraints on inputs and outputs. However, in MPC closed-loop performance is pushed to the limits only if the plant under control is accurately modeled; otherwise, robust architectures need to be employed, at the price of reduced performance due to worst-case… ▽ More Model Predictive Control (MPC) is an enabling technology in applications requiring controlling physical processes in an optimized way under constraints on inputs and outputs. However, in MPC closed-loop performance is pushed to the limits only if the plant under control is accurately modeled; otherwise, robust architectures need to be employed, at the price of reduced performance due to worst-case conservative assumptions. In this paper, instead of adapting the controller to handle uncertainty, we adapt the learning procedure so that the prediction model is selected to provide the best closed-loop performance. More specifically, we apply for the first time the above "identification for control" rationale to hierarchical MPC using data-driven methods and Bayesian optimization. △ Less

Submitted 23 April, 2019; originally announced April 2019.

Comments: Accepted for publication in the IEEE Control Systems Letters (L-CSS)

Journal ref: IEEE Control Systems Letters, pp. 577-582, 2019

arXiv:1711.09220 [pdf, other]

Fitting Jump Models

Authors: A. Bemporad, V. Breschi, D. Piga, S. Boyd

Abstract: We describe a new framework for fitting jump models to a sequence of data. The key idea is to alternate between minimizing a loss function to fit multiple model parameters, and minimizing a discrete loss function to determine which set of model parameters is active at each data point. The framework is quite general and encompasses popular classes of models, such as hidden Markov models and piecewi… ▽ More We describe a new framework for fitting jump models to a sequence of data. The key idea is to alternate between minimizing a loss function to fit multiple model parameters, and minimizing a discrete loss function to determine which set of model parameters is active at each data point. The framework is quite general and encompasses popular classes of models, such as hidden Markov models and piecewise affine models. The shape of the chosen loss functions to minimize determine the shape of the resulting jump model. △ Less

Submitted 21 May, 2018; v1 submitted 25 November, 2017; originally announced November 2017.

Comments: Accepted for publication in Automatica

arXiv:1705.02663 [pdf, other]

SOS for bounded rationality

Authors: Alessio Benavoli, Alessandro Facchini, Dario Piga, Marco Zaffalon

Abstract: In the gambling foundation of probability theory, rationality requires that a subject should always (never) find desirable all nonnegative (negative) gambles, because no matter the result of the experiment the subject never (always) decreases her money. Evaluating the nonnegativity of a gamble in infinite spaces is a difficult task. In fact, even if we restrict the gambles to be polynomials in R^n… ▽ More In the gambling foundation of probability theory, rationality requires that a subject should always (never) find desirable all nonnegative (negative) gambles, because no matter the result of the experiment the subject never (always) decreases her money. Evaluating the nonnegativity of a gamble in infinite spaces is a difficult task. In fact, even if we restrict the gambles to be polynomials in R^n , the problem of determining nonnegativity is NP-hard. The aim of this paper is to develop a computable theory of desirable gambles. Instead of requiring the subject to accept all nonnegative gambles, we only require her to accept gambles for which she can efficiently determine the nonnegativity (in particular SOS polynomials). We refer to this new criterion as bounded rationality. △ Less

Submitted 20 November, 2018; v1 submitted 7 May, 2017; originally announced May 2017.

arXiv:1609.04447 [pdf, ps, other]

doi 10.1109/TCST.2017.2702118

Direct data-driven control of constrained linear parameter-varying systems: A hierarchical approach

Authors: Dario Piga, Simone Formentin, Alberto Bemporad

Abstract: In many nonlinear control problems, the plant can be accurately described by a linear model whose operating point depends on some measurable variables, called scheduling signals. When such a linear parameter-varying (LPV) model of the open-loop plant needs to be derived from a set of data, several issues arise in terms of parameterization, estimation, and validation of the model before designing t… ▽ More In many nonlinear control problems, the plant can be accurately described by a linear model whose operating point depends on some measurable variables, called scheduling signals. When such a linear parameter-varying (LPV) model of the open-loop plant needs to be derived from a set of data, several issues arise in terms of parameterization, estimation, and validation of the model before designing the controller. Moreover, the way modeling errors affect the closed-loop performance is still largely unknown in the LPV context. In this paper, a direct data-driven control method is proposed to design LPV controllers directly from data without deriving a model of the plant. The main idea of the approach is to use a hierarchical control architecture, where the inner controller is designed to match a simple and a-priori specified closed-loop behavior. Then, an outer model predictive controller is synthesized to handle input/output constraints and to enhance the performance of the inner loop. The effectiveness of the approach is illustrated by means of a simulation and an experimental example. Practical implementation issues are also discussed. △ Less

Submitted 17 June, 2018; v1 submitted 14 September, 2016; originally announced September 2016.

Comments: Preliminary version of the paper "Direct data-driven control of constrained systems" published in the IEEE Transactions on Control Systems Technology

Journal ref: IEEE Transactions on Control Systems Technology (Volume: 26, Issue: 4, pg. 1422-1429, 2018)

arXiv:1604.02031 [pdf, other]

doi 10.1109/TAC.2017.2699281

A unified framework for deterministic and probabilistic D-stability analysis of uncertain polynomial matrices

Authors: Dario Piga, Alessio Benavoli

Abstract: Many problems in systems and control theory can be formulated in terms of robust D-stability analysis, which aims at verifying if all the eigenvalues of an uncertain matrix lie in a given region D of the complex plane. Robust D-stability analysis is an NP-hard problem and many polynomial-time algorithms providing either sufficient or necessary conditions for an uncertain matrix to be robustly D-st… ▽ More Many problems in systems and control theory can be formulated in terms of robust D-stability analysis, which aims at verifying if all the eigenvalues of an uncertain matrix lie in a given region D of the complex plane. Robust D-stability analysis is an NP-hard problem and many polynomial-time algorithms providing either sufficient or necessary conditions for an uncertain matrix to be robustly D-stable have been developed in the past decades. Despite the vast literature on the subject, most of the contributions consider specific families of uncertain matrices, mainly with interval or polytopic uncertainty. In this work, we present a novel approach providing sufficient conditions to verify if a family of matrices, whose entries depend polynomially on some uncertain parameters, is robustly D-stable. The only assumption on the stability region D is that its complement is a semialgebraic set described by polynomial constraints, which comprises the main important cases in stability analysis. Furthermore, the D-stability analysis problem is formulated in a probabilistic framework. In this context, the uncertain parameters characterizing the considered family of matrices are described by a set of non a priori specified probability measures. Only the support and some of the moments (e.g., expected values) are assumed to be known and, among all possible probability measures, we seek the one which provides the minimum probability of D-stability. The robust and the probabilistic D-stability analysis problems are formulated in a unified framework, and relaxations based on the theory of moments are used to solve the D-stability analysis problem through convex optimization. Application to robustness and probabilistic analysis of dynamical systems is discussed. △ Less

Submitted 17 June, 2018; v1 submitted 7 April, 2016; originally announced April 2016.

Comments: Extended version of the paper published in the IEEE Transactions on Automatic Control

Journal ref: IEEE Transactions on Automatic Control (Vol. 62, Issue 10, 2017)

arXiv:1505.01034 [pdf, ps, other]

A probabilistic interpretation of set-membership filtering: application to polynomial systems through polytopic bounding

Authors: Alessio Benavoli, Dario Piga

Abstract: Set-membership estimation is usually formulated in the context of set-valued calculus and no probabilistic calculations are necessary. In this paper, we show that set-membership estimation can be equivalently formulated in the probabilistic setting by employing sets of probability measures. Inference in set-membership estimation is thus carried out by computing expectations with respect to the upd… ▽ More Set-membership estimation is usually formulated in the context of set-valued calculus and no probabilistic calculations are necessary. In this paper, we show that set-membership estimation can be equivalently formulated in the probabilistic setting by employing sets of probability measures. Inference in set-membership estimation is thus carried out by computing expectations with respect to the updated set of probability measures P as in the probabilistic case. In particular, it is shown that inference can be performed by solving a particular semi-infinite linear programming problem, which is a special case of the truncated moment problem in which only the zero-th order moment is known (i.e., the support). By writing the dual of the above semi-infinite linear programming problem, it is shown that, if the nonlinearities in the measurement and process equations are polynomial and if the bounding sets for initial state, process and measurement noises are described by polynomial inequalities, then an approximation of this semi-infinite linear programming problem can efficiently be obtained by using the theory of sum-of-squares polynomial optimization. We then derive a smart greedy procedure to compute a polytopic outer-approximation of the true membership-set, by computing the minimum-volume polytope that outer-bounds the set that includes all the means computed with respect to P. △ Less

Submitted 12 April, 2016; v1 submitted 5 May, 2015; originally announced May 2015.

arXiv:1408.0532 [pdf, ps, other]

doi 10.1109/TAC.2014.2351695

A unified framework for solving a general class of conditional and robust set-membership estimation problems

Authors: Vito Cerone, Jean-Bernard Lasserre, Dario Piga, Diego Regruto

Abstract: In this paper we present a unified framework for solving a general class of problems arising in the context of set-membership estimation/identification theory. More precisely, the paper aims at providing an original approach for the computation of optimal conditional and robust projection estimates in a nonlinear estimation setting where the operator relating the data and the parameter to be estim… ▽ More In this paper we present a unified framework for solving a general class of problems arising in the context of set-membership estimation/identification theory. More precisely, the paper aims at providing an original approach for the computation of optimal conditional and robust projection estimates in a nonlinear estimation setting where the operator relating the data and the parameter to be estimated is assumed to be a generic multivariate polynomial function and the uncertainties affecting the data are assumed to belong to semialgebraic sets. By noticing that the computation of both the conditional and the robust projection optimal estimators requires the solution to min-max optimization problems that share the same structure, we propose a unified two-stage approach based on semidefinite-relaxation techniques for solving such estimation problems. The key idea of the proposed procedure is to recognize that the optimal functional of the inner optimization problems can be approximated to any desired precision by a multivariate polynomial function by suitably exploiting recently proposed results in the field of parametric optimization. Two simulation examples are reported to show the effectiveness of the proposed approach. △ Less

Submitted 3 August, 2014; originally announced August 2014.

Comments: Accpeted for publication in the IEEE Transactions on Automatic Control (2014)

Showing 1–15 of 15 results for author: Piga, D