-
Networked Control and Mean Field Problems Under Diagonal Dominance: Decentralized and Social Optimality
Authors:
Vivek Khatana,
Duo Wang,
Petros Voulgaris,
Nicola Elia,
Naira Hovakimyan
Abstract:
In this article, we employ an input-output approach to expand the study of cooperative multi-agent control and optimization problems characterized by mean-field interactions that admit decentralized and selfish solutions. The setting involves $n$ independent agents that interact solely through a shared cost function, which penalizes deviations of each agent from the group's average collective beha…
▽ More
In this article, we employ an input-output approach to expand the study of cooperative multi-agent control and optimization problems characterized by mean-field interactions that admit decentralized and selfish solutions. The setting involves $n$ independent agents that interact solely through a shared cost function, which penalizes deviations of each agent from the group's average collective behavior. Building on our earlier results established for homogeneous agents, we extend the framework to nonidentical agents and show that, under a diagonal dominant interaction of the collective dynamics, with bounded local open-loop dynamics, the optimal controller for $H_\infty$ and $H_2$ norm minimization remains decentralized and selfish in the limit as the number of agents $n$ grows to infinity.
△ Less
Submitted 1 October, 2025;
originally announced October 2025.
-
$\mathcal{L}_1$-DRAC: Distributionally Robust Adaptive Control
Authors:
Aditya Gahlawat,
Sambhu H. Karumanchi,
Naira Hovakimyan
Abstract:
Data-driven machine learning methodologies have attracted considerable attention for the control and estimation of dynamical systems. However, such implementations suffer from a lack of predictability and robustness. Thus, adoption of data-driven tools has been minimal for safety-aware applications despite their impressive empirical results. While classical tools like robust adaptive control can e…
▽ More
Data-driven machine learning methodologies have attracted considerable attention for the control and estimation of dynamical systems. However, such implementations suffer from a lack of predictability and robustness. Thus, adoption of data-driven tools has been minimal for safety-aware applications despite their impressive empirical results. While classical tools like robust adaptive control can ensure predictable performance, their consolidation with data-driven methods remains a challenge and, when attempted, leads to conservative results. The difficulty of consolidation stems from the inherently different `spaces' that robust control and data-driven methods occupy. Data-driven methods suffer from the distribution-shift problem, which current robust adaptive controllers can only tackle if using over-simplified learning models and unverifiable assumptions. In this paper, we present $\mathcal{L}_1$ distributionally robust adaptive control ($\mathcal{L}_1$-DRAC): a control methodology for uncertain stochastic processes that guarantees robustness certificates in terms of uniform (finite-time) and maximal distributional deviation. We leverage the $\mathcal{L}_1$ adaptive control methodology to ensure the existence of Wasserstein ambiguity set around a nominal distribution, which is guaranteed to contain the true distribution. The uniform ambiguity set produces an ambiguity tube of distributions centered on the nominal temporally-varying nominal distribution. The designed controller generates the ambiguity tube in response to both epistemic (model uncertainties) and aleatoric (inherent randomness and disturbances) uncertainties.
△ Less
Submitted 4 September, 2025;
originally announced September 2025.
-
Hessian Riemannian Flow For Multi-Population Wardrop Equilibrium
Authors:
Tigran Bakaryan,
Christoph Aoun,
Ricardo de Lima Ribeiro,
Naira Hovakimyan,
Diogo Gomes
Abstract:
In this paper, we address the problem of optimizing flows on generalized graphs that feature multiple entry points and multiple populations, each with varying cost structures. We tackle this problem by considering the multi-population Wardrop equilibrium, defined through variational inequalities. We rigorously analyze the existence and uniqueness of the Wardrop equilibrium. Furthermore, we introdu…
▽ More
In this paper, we address the problem of optimizing flows on generalized graphs that feature multiple entry points and multiple populations, each with varying cost structures. We tackle this problem by considering the multi-population Wardrop equilibrium, defined through variational inequalities. We rigorously analyze the existence and uniqueness of the Wardrop equilibrium. Furthermore, we introduce an efficient numerical method to find the solution. In particular, we reformulate the equilibrium problem as a distributed optimization problem over subgraphs and introduce a novel Hessian Riemannian flow method, a Riemannian-manifold-projected Hessian flow, to efficiently compute a solution. Finally, we demonstrate the effectiveness of our approach through examples in urban traffic management, including routing for diverse vehicle types and strategies for minimizing emissions in congested environments.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
$\mathcal{L}_{1}$ Adaptive Optimizer for Online Time-Varying Convex Optimization
Authors:
Jinrae Kim,
Naira Hovakimyan
Abstract:
We propose an adaptive method for online time-varying (TV) convex optimization, termed $\mathcal{L}_{1}$ adaptive optimization ($\mathcal{L}_{1}$-AO). TV optimizers utilize a prediction model to exploit the temporal structure of TV problems, which can be inaccurate in the online implementation. Inspired by $\mathcal{L}_{1}$ adaptive control, the proposed method augments an adaptive update law to e…
▽ More
We propose an adaptive method for online time-varying (TV) convex optimization, termed $\mathcal{L}_{1}$ adaptive optimization ($\mathcal{L}_{1}$-AO). TV optimizers utilize a prediction model to exploit the temporal structure of TV problems, which can be inaccurate in the online implementation. Inspired by $\mathcal{L}_{1}$ adaptive control, the proposed method augments an adaptive update law to estimate and compensate for the uncertainty from the prediction inaccuracies. The proposed method provides performance bounds of the error in the optimization variables and cost function, allowing efficient and reliable optimization for TV problems. Numerical simulation results demonstrate the effectiveness of the proposed method for online TV convex optimization.
△ Less
Submitted 1 March, 2025; v1 submitted 24 September, 2024;
originally announced September 2024.
-
Multi-population opinion dynamics model
Authors:
Tigran Bakaryan,
Yuliang Gu,
Naira Hovakimyan,
Tarek Abdelzaher,
Christian Lebiere
Abstract:
We introduce multi-population opinion dynamics models linked to the bounded confidence model, aiming to explore how interactions between individuals contribute to the emergence of consensus, polarization, or fragmentation. Existing models either neglect agent similarities, sacrificing accuracy for scalability, or prioritize accuracy by introducing agent-wise connections, constraining scalability.…
▽ More
We introduce multi-population opinion dynamics models linked to the bounded confidence model, aiming to explore how interactions between individuals contribute to the emergence of consensus, polarization, or fragmentation. Existing models either neglect agent similarities, sacrificing accuracy for scalability, or prioritize accuracy by introducing agent-wise connections, constraining scalability. Our proposed model captures similarities between agents in scalable matter.
In our setting, agents similarities are defined by their group affiliations. Specifically, each sub-population is characterized by its distribution, and the closeness between two sub-populations is measured by the Wasserstein distance of their corresponding distributions. This leads to two mutually connected dynamics: micro, the individual-based dynamics, and the macro, the distribution-based one. The individual-wise interactions take into account the population-wise interactions (similarities), and the population-wise interactions are updated based on the individual-wise interactions.
We have proven the well-posedness of our models. Additionally, we conducted several simulations to mimic certain complex social phenomena.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
An Information-Theoretic Analysis of Discrete-Time Control and Filtering Limitations by the I-MMSE Relationships
Authors:
Neng Wan,
Dapeng Li,
Naira Hovakimyan,
Petros G. Voulgaris
Abstract:
Fundamental limitations or performance trade-offs/limits are important properties and constraints of both control and filtering systems. Among various trade-off metrics, total information rate that characterizes the sensitivity trade-offs and time-averaged performance of control and filtering systems was conventionally studied by using the differential entropy rate and Kolmogorov-Bode formula. In…
▽ More
Fundamental limitations or performance trade-offs/limits are important properties and constraints of both control and filtering systems. Among various trade-off metrics, total information rate that characterizes the sensitivity trade-offs and time-averaged performance of control and filtering systems was conventionally studied by using the differential entropy rate and Kolmogorov-Bode formula. In this paper, by extending the famous I-MMSE (mutual information -- minimum mean-square error) relationships to the discrete-time additive white Gaussian channels with and without feedback, a new paradigm is introduced to estimate and analyze total information rate as a control and filtering trade-off metric. Under this framework, we explore the trade-off properties of total information rate for a variety of the discrete-time control and filtering systems, e.g., LTI, LTV, and nonlinear, and propose an alternative approach to investigate total information rate via optimal estimation.
△ Less
Submitted 17 March, 2025; v1 submitted 18 April, 2023;
originally announced April 2023.
-
Safety Embedded Stochastic Optimal Control of Networked Multi-Agent Systems via Barrier States
Authors:
Lin Song,
Pan Zhao,
Neng Wan,
Naira Hovakimyan
Abstract:
This paper presents a novel approach for achieving safe stochastic optimal control in networked multi-agent systems (MASs). The proposed method incorporates barrier states (BaSs) into the system dynamics to embed safety constraints. To accomplish this, the networked MAS is factorized into multiple subsystems, and each one is augmented with BaSs for the central agent. The optimal control law is obt…
▽ More
This paper presents a novel approach for achieving safe stochastic optimal control in networked multi-agent systems (MASs). The proposed method incorporates barrier states (BaSs) into the system dynamics to embed safety constraints. To accomplish this, the networked MAS is factorized into multiple subsystems, and each one is augmented with BaSs for the central agent. The optimal control law is obtained by solving the joint Hamilton-Jacobi-Bellman (HJB) equation on the augmented subsystem, which guarantees safety via the boundedness of the BaSs. The BaS-based optimal control technique yields safe control actions while maintaining optimality. The safe optimal control solution is approximated using path integrals. To validate the effectiveness of the proposed approach, numerical simulations are conducted on a cooperative UAV team in two different scenarios.
△ Less
Submitted 3 April, 2023; v1 submitted 7 October, 2022;
originally announced October 2022.
-
Fundamental Limitations of Control and Filtering in Continuous-Time Systems: An Information-Theoretic Analysis
Authors:
Neng Wan,
Dapeng Li,
Naira Hovakimyan
Abstract:
While information theory has been introduced to investigate and characterize the control and filtering limitations for a few decades, the existing information-theoretic methods are indirect and cumbersome for analyzing the fundamental limitations of continuous-time systems. To answer this challenge, we lift the information-theoretic analysis to continuous function spaces of infinite dimensions by…
▽ More
While information theory has been introduced to investigate and characterize the control and filtering limitations for a few decades, the existing information-theoretic methods are indirect and cumbersome for analyzing the fundamental limitations of continuous-time systems. To answer this challenge, we lift the information-theoretic analysis to continuous function spaces of infinite dimensions by using Duncan's theorem or the I-MMSE relationships. Continuous-time control and filtering systems are modeled as an additive Gaussian channel with or without feedback, and total information rate is identified as a control and filtering trade-off metric and directly computed from the estimation error of channel input. Inequality constraints for the trade-off metric are derived in a general setting and then applied to capture the fundamental limitations of various control and filtering systems subject to linear and nonlinear plants. For the linear systems, we show that total information rate has similar properties as some established trade-offs, e.g., Bode-type integrals and minimum estimation error. For the nonlinear systems, we provide a direct method to compute the total information rate and its lower bound by the Stratonovich-Kushner equation.
△ Less
Submitted 29 June, 2022; v1 submitted 4 January, 2022;
originally announced January 2022.
-
Constrained Attack-Resilient Estimation of Stochastic Cyber-Physical Systems
Authors:
Wenbin Wan,
Hunmin Kim,
Naira Hovakimyan,
Petros Voulgaris
Abstract:
In this paper, a constrained attack-resilient estimation algorithm (CARE) is developed for stochastic cyber-physical systems. The proposed CARE can simultaneously estimate the compromised system states and attack signals. It has improved estimation accuracy and attack detection performance when physical constraints and operational limitations are available. In particular, CARE is designed for simu…
▽ More
In this paper, a constrained attack-resilient estimation algorithm (CARE) is developed for stochastic cyber-physical systems. The proposed CARE can simultaneously estimate the compromised system states and attack signals. It has improved estimation accuracy and attack detection performance when physical constraints and operational limitations are available. In particular, CARE is designed for simultaneous input and state estimation that provides minimum-variance unbiased estimates, and these estimates are projected onto the constrained space restricted by inequality constraints subsequently. We prove that the estimation errors and their covariances from CARE are less than those from unconstrained algorithms and confirm that this property can further reduce the false negative rate in attack detection. We show that estimation errors of CARE are practically exponentially stable in mean square. Finally, an illustrative example of attacks on a vehicle is given to demonstrate the improved estimation accuracy and detection performance compared to an existing unconstrained algorithm.
△ Less
Submitted 10 October, 2022; v1 submitted 24 September, 2021;
originally announced September 2021.
-
Generalization of Safe Optimal Control Actions on Networked Multi-Agent Systems
Authors:
Lin Song,
Neng Wan,
Aditya Gahlawat,
Chuyuan Tao,
Naira Hovakimyan,
Evangelos A. Theodorou
Abstract:
We propose a unified framework to fast generate a safe optimal control action for a new task from existing controllers on Multi-Agent Systems (MASs). The control action composition is achieved by taking a weighted mixture of the existing controllers according to the contribution of each component task. Instead of sophisticatedly tuning the cost parameters and other hyper-parameters for safe and re…
▽ More
We propose a unified framework to fast generate a safe optimal control action for a new task from existing controllers on Multi-Agent Systems (MASs). The control action composition is achieved by taking a weighted mixture of the existing controllers according to the contribution of each component task. Instead of sophisticatedly tuning the cost parameters and other hyper-parameters for safe and reliable behavior in the optimal control framework, the safety of each single task solution is guaranteed using the control barrier functions (CBFs) for high-degree stochastic systems, which constrains the system state within a known safe operation region where it originates from. Linearity of CBF constraints in control enables the control action composition. The discussed framework can immediately provide reliable solutions to new tasks by taking a weighted mixture of solved component-task actions and filtering on some CBF constraints, instead of performing an extensive sampling to achieve a new controller. Our results are verified and demonstrated on both a single UAV and two cooperative UAV teams in an environment with obstacles.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
$\mathcal{L}_1$ Adaptive Control with Switched Reference Models: Application to Learn-to-Fly
Authors:
Steven Snyder,
Pan Zhao,
Naira Hovakimyan
Abstract:
Learn-to-Fly (L2F) is a new framework that aims to replace the traditional iterative development paradigm for aerial vehicles with a combination of real-time aerodynamic modeling, guidance, and learning control. To ensure safe learning of the vehicle dynamics on the fly, this paper presents an $\mathcal{L}_1$ adaptive control ($\mathcal{L}_1$AC) based scheme, which actively estimates and compensat…
▽ More
Learn-to-Fly (L2F) is a new framework that aims to replace the traditional iterative development paradigm for aerial vehicles with a combination of real-time aerodynamic modeling, guidance, and learning control. To ensure safe learning of the vehicle dynamics on the fly, this paper presents an $\mathcal{L}_1$ adaptive control ($\mathcal{L}_1$AC) based scheme, which actively estimates and compensates for the discrepancy between the intermediately learned dynamics and the actual dynamics. First, to incorporate the periodic update of the learned model within the L2F framework, this paper extends the $\mathcal{L}_1$AC architecture to handle a switched reference system subject to unknown time-varying parameters and disturbances. The paper also includes an analysis of both transient and steady-state performance of the $\mathcal{L}_1$AC architecture in the presence of non-zero initialization error for the state predictor. Second, the paper presents how the proposed $\mathcal{L}_1$AC scheme is integrated into the L2F framework, including its interaction with the baseline controller and the real-time modeling module. Finally, flight tests on an unmanned aerial vehicle (UAV) validate the efficacy of the proposed control and learning scheme.
△ Less
Submitted 5 August, 2022; v1 submitted 18 August, 2021;
originally announced August 2021.
-
Distributed Algorithms for Linearly-Solvable Optimal Control in Networked Multi-Agent Systems
Authors:
Neng Wan,
Aditya Gahlawat,
Naira Hovakimyan,
Evangelos A. Theodorou,
Petros G. Voulgaris
Abstract:
Distributed algorithms for both discrete-time and continuous-time linearly solvable optimal control (LSOC) problems of networked multi-agent systems (MASs) are investigated in this paper. A distributed framework is proposed to partition the optimal control problem of a networked MAS into several local optimal control problems in factorial subsystems, such that each (central) agent behaves optimall…
▽ More
Distributed algorithms for both discrete-time and continuous-time linearly solvable optimal control (LSOC) problems of networked multi-agent systems (MASs) are investigated in this paper. A distributed framework is proposed to partition the optimal control problem of a networked MAS into several local optimal control problems in factorial subsystems, such that each (central) agent behaves optimally to minimize the joint cost function of a subsystem that comprises a central agent and its neighboring agents, and the local control actions (policies) only rely on the knowledge of local observations. Under this framework, we not only preserve the correlations between neighboring agents, but moderate the communication and computational complexities by decentralizing the sampling and computational processes over the network. For discrete-time systems modeled by Markov decision processes, the joint Bellman equation of each subsystem is transformed into a system of linear equations and solved using parallel programming. For continuous-time systems modeled by Itô diffusion processes, the joint optimality equation of each subsystem is converted into a linear partial differential equation, whose solution is approximated by a path integral formulation and a sample-efficient relative entropy policy search algorithm, respectively. The learned control policies are generalized to solve the unlearned tasks by resorting to the compositionality principle, and illustrative examples of cooperative UAV teams are provided to verify the effectiveness and advantages of these algorithms.
△ Less
Submitted 17 February, 2021;
originally announced February 2021.
-
$\mathcal{L}_1$ Adaptive Control for Switching Reference Systems: Application to Flight Control
Authors:
Steven Snyder,
Pan Zhao,
Naira Hovakimyan
Abstract:
This paper presents a framework for the design and analysis of an $\mathcal{L}_1$ adaptive controller with a switching reference system. The use of a switching reference system allows the desired behavior to be scheduled across the operating envelope, which is often required in aerospace applications. The analysis uses a switched reference system that assumes perfect knowledge of uncertainties and…
▽ More
This paper presents a framework for the design and analysis of an $\mathcal{L}_1$ adaptive controller with a switching reference system. The use of a switching reference system allows the desired behavior to be scheduled across the operating envelope, which is often required in aerospace applications. The analysis uses a switched reference system that assumes perfect knowledge of uncertainties and uses a corresponding non-adaptive controller. Provided that this switched reference system is stable, it is shown that the closed-loop system with unknown parameters and disturbances and the $\mathcal{L}_1$ adaptive controller can behave arbitrarily close to this reference system. Simulations of the short period dynamics of a transport class aircraft during the approach phase illustrate the theoretical results.
△ Less
Submitted 31 December, 2020;
originally announced January 2021.
-
Adaptive Robust Quadratic Programs using Control Lyapunov and Barrier Functions
Authors:
Pan Zhao,
Yanbing Mao,
Chuyuan Tao,
Naira Hovakimyan,
Xiaofeng Wang
Abstract:
This paper presents adaptive robust quadratic program (QP) based control using control Lyapunov and barrier functions for nonlinear systems subject to time-varying and state-dependent uncertainties. An adaptive estimation law is proposed to estimate the pointwise value of the uncertainties with pre-computable estimation error bounds. The estimated uncertainty and the error bounds are then used to…
▽ More
This paper presents adaptive robust quadratic program (QP) based control using control Lyapunov and barrier functions for nonlinear systems subject to time-varying and state-dependent uncertainties. An adaptive estimation law is proposed to estimate the pointwise value of the uncertainties with pre-computable estimation error bounds. The estimated uncertainty and the error bounds are then used to formulate a robust QP, which ensures that the actual uncertain system will not violate the safety constraints defined by the control barrier function. Additionally, the accuracy of the uncertainty estimation can be systematically improved by reducing the estimation sampling time, leading subsequently to reduced conservatism of the formulated robust QP. The proposed approach is validated in simulations on an adaptive cruise control problem and through comparisons with existing approaches.
△ Less
Submitted 19 October, 2020; v1 submitted 9 October, 2020;
originally announced October 2020.
-
Cooperative Path Integral Control for Stochastic Multi-Agent Systems
Authors:
Neng Wan,
Aditya Gahlawat,
Naira Hovakimyan,
Evangelos A. Theodorou,
Petros G. Voulgaris
Abstract:
A distributed stochastic optimal control solution is presented for cooperative multi-agent systems. The network of agents is partitioned into multiple factorial subsystems, each of which consists of a central agent and neighboring agents. Local control actions that rely only on agents' local observations are designed to optimize the joint cost functions of subsystems. When solving for the local co…
▽ More
A distributed stochastic optimal control solution is presented for cooperative multi-agent systems. The network of agents is partitioned into multiple factorial subsystems, each of which consists of a central agent and neighboring agents. Local control actions that rely only on agents' local observations are designed to optimize the joint cost functions of subsystems. When solving for the local control actions, the joint optimality equation for each subsystem is cast as a linear partial differential equation and solved using the Feynman-Kac formula. The solution and the optimal control action are then formulated as path integrals and approximated by a Monte-Carlo method. Numerical verification is provided through a simulation example consisting of a team of cooperative UAVs.
△ Less
Submitted 20 March, 2021; v1 submitted 30 September, 2020;
originally announced September 2020.
-
Compositionality of Linearly Solvable Optimal Control in Networked Multi-Agent Systems
Authors:
Lin Song,
Neng Wan,
Aditya Gahlawat,
Naira Hovakimyan,
Evangelos A. Theodorou
Abstract:
In this paper, we discuss the methodology of generalizing the optimal control law from learned component tasks to unlearned composite tasks on Multi-Agent Systems (MASs), by using the linearity composition principle of linearly solvable optimal control (LSOC) problems. The proposed approach achieves both the compositionality and optimality of control actions simultaneously within the cooperative M…
▽ More
In this paper, we discuss the methodology of generalizing the optimal control law from learned component tasks to unlearned composite tasks on Multi-Agent Systems (MASs), by using the linearity composition principle of linearly solvable optimal control (LSOC) problems. The proposed approach achieves both the compositionality and optimality of control actions simultaneously within the cooperative MAS framework in both discrete- and continuous-time in a sample-efficient manner, which reduces the burden of re-computation of the optimal control solutions for the new task on the MASs. We investigate the application of the proposed approach on the MAS with coordination between agents. The experiments show feasible results in investigated scenarios, including both discrete and continuous dynamical systems for task generalization without resampling.
△ Less
Submitted 22 March, 2021; v1 submitted 28 September, 2020;
originally announced September 2020.
-
Contraction $\mathcal{L}_1$-Adaptive Control using Gaussian Processes
Authors:
Aditya Gahlawat,
Arun Lakshmanan,
Lin Song,
Andrew Patterson,
Zhuohuan Wu,
Naira Hovakimyan,
Evangelos Theodorou
Abstract:
We present $\mathcal{CL}_1$-$\mathcal{GP}$, a control framework that enables safe simultaneous learning and control for systems subject to uncertainties. The two main constituents are contraction theory-based $\mathcal{L}_1$ ($\mathcal{CL}_1$) control and Bayesian learning in the form of Gaussian process (GP) regression. The $\mathcal{CL}_1$ controller ensures that control objectives are met while…
▽ More
We present $\mathcal{CL}_1$-$\mathcal{GP}$, a control framework that enables safe simultaneous learning and control for systems subject to uncertainties. The two main constituents are contraction theory-based $\mathcal{L}_1$ ($\mathcal{CL}_1$) control and Bayesian learning in the form of Gaussian process (GP) regression. The $\mathcal{CL}_1$ controller ensures that control objectives are met while providing safety certificates. Furthermore, $\mathcal{CL}_1$-$\mathcal{GP}$ incorporates any available data into a GP model of uncertainties, which improves performance and enables the motion planner to achieve optimality safely. This way, the safe operation of the system is always guaranteed, even during the learning transients. We provide a few illustrative examples for the safe learning and control of planar quadrotor systems in a variety of environments.
△ Less
Submitted 30 November, 2021; v1 submitted 8 September, 2020;
originally announced September 2020.
-
Inequality Constraints in Facility Location and Other Similar Optimization Problems: An Entropy Based Approach
Authors:
Amber Srivastava,
Gabriel Barsi Haberfeld,
Naira Hovakimyan,
Srinivasa M Salapaka
Abstract:
In this paper we propose an annealing based framework to incorporate inequality constraints in optimization problems such as facility location, simultaneous facility location with path optimization, and the last mile delivery problem. These inequality constraints are used to model several application specific size and capacity limitations on the corresponding facilities, transportation paths and t…
▽ More
In this paper we propose an annealing based framework to incorporate inequality constraints in optimization problems such as facility location, simultaneous facility location with path optimization, and the last mile delivery problem. These inequality constraints are used to model several application specific size and capacity limitations on the corresponding facilities, transportation paths and the service vehicles. We design our algorithms in such a way that it allows to (possibly) violate the constraints during the initial stages of the algorithm, so as to facilitate a thorough exploration of the solution space; as the algorithm proceeds, this violation (controlled through the annealing parameter) is gradually lowered till the solution converges in the feasible region of the optimization problem. We present simulations on various datasets that demonstrate the efficacy of our algorithm.
△ Less
Submitted 9 February, 2020;
originally announced February 2020.
-
Intent-Aware Probabilistic Trajectory Estimation for Collision Prediction with Uncertainty Quantification
Authors:
Andrew Patterson,
Arun Lakshmanan,
Naira Hovakimyan
Abstract:
Collision prediction in a dynamic and unknown environment relies on knowledge of how the environment is changing. Many collision prediction methods rely on deterministic knowledge of how obstacles are moving in the environment. However, complete deterministic knowledge of the obstacles' motion is often unavailable. This work proposes a Gaussian process based prediction method that replaces the ass…
▽ More
Collision prediction in a dynamic and unknown environment relies on knowledge of how the environment is changing. Many collision prediction methods rely on deterministic knowledge of how obstacles are moving in the environment. However, complete deterministic knowledge of the obstacles' motion is often unavailable. This work proposes a Gaussian process based prediction method that replaces the assumption of deterministic knowledge of each obstacle's future behavior with probabilistic knowledge, to allow a larger class of obstacles to be considered. The method solely relies on position and velocity measurements to predict collisions with dynamic obstacles. We show that the uncertainty region for obstacle positions can be expressed in terms of a combination of polynomials generated with Gaussian process regression. To control the growth of uncertainty over arbitrary time horizons, a probabilistic obstacle intention is assumed as a distribution over obstacle positions and velocities, which can be naturally included in the Gaussian process framework. Our approach is demonstrated in two case studies in which (i), an obstacle overtakes the agent and (ii), an obstacle crosses the agent's path perpendicularly. In these simulations we show that the collision can be predicted despite having limited knowledge of the obstacle's behavior.
△ Less
Submitted 4 April, 2019;
originally announced April 2019.
-
Attack-resilient Estimation for Linear Discrete-time Stochastic Systems with Input and State Constraints
Authors:
Wenbin Wan,
Hunmin Kim,
Naira Hovakimyan,
Petros G. Voulgaris
Abstract:
In this paper, an attack-resilient estimation algorithm is presented for linear discrete-time stochastic systems with state and input constraints. It is shown that the state estimation errors of the proposed estimation algorithm are practically exponentially stable.
In this paper, an attack-resilient estimation algorithm is presented for linear discrete-time stochastic systems with state and input constraints. It is shown that the state estimation errors of the proposed estimation algorithm are practically exponentially stable.
△ Less
Submitted 19 March, 2019;
originally announced March 2019.
-
L1 Adaptive Output Feedback for Non-square Systems with Arbitrary Relative Degree
Authors:
Hanmin Lee,
Venanzio Cichella,
Naira Hovakimyan
Abstract:
This paper considers the problem of output feedback control for non-square multi-input multi-output systems with arbitrary relative degree. The proposed controller, based on the L1 adaptive control architecture, is designed using the right interactor matrix and a suitably defined projection matrix. A state-output predictor, a low-pass filter, and adaptive laws are introduced that achieve output tr…
▽ More
This paper considers the problem of output feedback control for non-square multi-input multi-output systems with arbitrary relative degree. The proposed controller, based on the L1 adaptive control architecture, is designed using the right interactor matrix and a suitably defined projection matrix. A state-output predictor, a low-pass filter, and adaptive laws are introduced that achieve output tracking of a desired reference signal. It is shown that the proposed control strategy guarantees closed-loop stability with arbitrarily small steady-state errors. The transient performance in the presence of non-zero initialization errors is quantified in terms of decreasing functions. Rigorous mathematical analysis and illustrative examples are provided to validate the theoretical claims.
△ Less
Submitted 22 January, 2019;
originally announced January 2019.
-
Bernstein approximation of optimal control problems
Authors:
Venanzio Cichella,
Isaac Kaminer,
Claire Walton,
Naira Hovakimyan,
Antonio Pascoal
Abstract:
Bernstein polynomial approximation to a continuous function has a slower rate of convergence as compared to other approximation methods. "The fact seems to have precluded any numerical application of Bernstein polynomials from having been made. Perhaps they will find application when the properties of the approximant in the large are of more importance than the closeness of the approximation." --…
▽ More
Bernstein polynomial approximation to a continuous function has a slower rate of convergence as compared to other approximation methods. "The fact seems to have precluded any numerical application of Bernstein polynomials from having been made. Perhaps they will find application when the properties of the approximant in the large are of more importance than the closeness of the approximation." -- has remarked P.J. Davis in his 1963 book Interpolation and Approximation. This paper presents a direct approximation method for nonlinear optimal control problems with mixed input and state constraints based on Bernstein polynomial approximation. We provide a rigorous analysis showing that the proposed method yields consistent approximations of time continuous optimal control problems. Furthermore, we demonstrate that the proposed method can also be used for costate estimation of the optimal control problems. This latter result leads to the formulation of the Covector Mapping Theorem for Bernstein polynomial approximation. Finally, we explore the numerical and geometric properties of Bernstein polynomials, and illustrate the advantages of the proposed approximation method through several numerical examples.
△ Less
Submitted 14 December, 2018;
originally announced December 2018.
-
A Simplified Approach to Analyze Complementary Sensitivity Trade-offs in Continuous-Time and Discrete-Time Systems
Authors:
Neng Wan,
Dapeng Li,
Naira Hovakimyan
Abstract:
A simplified approach is proposed to investigate the continuous-time and discrete-time complementary sensitivity Bode integrals (CSBIs) in this note. For continuous-time feedback systems with unbounded frequency domain, the CSBI weighted by $1/ω^2$ is considered, where this simplified method reveals a more explicit relationship between the value of CSBI and the structure of the open-loop transfer…
▽ More
A simplified approach is proposed to investigate the continuous-time and discrete-time complementary sensitivity Bode integrals (CSBIs) in this note. For continuous-time feedback systems with unbounded frequency domain, the CSBI weighted by $1/ω^2$ is considered, where this simplified method reveals a more explicit relationship between the value of CSBI and the structure of the open-loop transfer function. With a minor modification of this method, the CSBI of discrete-time system is derived, and illustrative examples are provided. Compared with the existing results on CSBI, neither Cauchy integral theorem nor Poisson integral formula are used throughout the analysis, and the analytic constraint on the integrand is removed.
△ Less
Submitted 24 November, 2018;
originally announced November 2018.
-
Sensitivity Analysis of Continuous-Time Linear Control Systems subject to Control and Measurement Noise: An Information-Theoretic Approach
Authors:
Neng Wan,
Dapeng Li,
Naira Hovakimyan
Abstract:
Sensitivity of linear continuous-time control systems, subject to control and measurement noise, is analyzed by deriving the lower bounds of Bode-like integrals via an information-theoretic approach. Bode integrals of four different sensitivity-like functions are employed to gauge the control trade-offs. When the signals of the control system are stationary Gaussian, these four different Bode-like…
▽ More
Sensitivity of linear continuous-time control systems, subject to control and measurement noise, is analyzed by deriving the lower bounds of Bode-like integrals via an information-theoretic approach. Bode integrals of four different sensitivity-like functions are employed to gauge the control trade-offs. When the signals of the control system are stationary Gaussian, these four different Bode-like integrals can be represented as differences between mutual information rates. These mutual information rates and hence the corresponding Bode-like integrals are proven to be bounded below by the unstable poles and zeros of the plant model, if the signals of the control system are wide-sense stationary.
△ Less
Submitted 24 November, 2018;
originally announced November 2018.
-
Primal-Dual Algorithm for Distributed Reinforcement Learning: Distributed GTD
Authors:
Donghwan Lee,
Hyungjin Yoon,
Naira Hovakimyan
Abstract:
The goal of this paper is to study a distributed version of the gradient temporal-difference (GTD) learning algorithm for multi-agent Markov decision processes (MDPs). The temporal difference (TD) learning is a reinforcement learning (RL) algorithm which learns an infinite horizon discounted cost function (or value function) for a given fixed policy without the model knowledge. In the distributed…
▽ More
The goal of this paper is to study a distributed version of the gradient temporal-difference (GTD) learning algorithm for multi-agent Markov decision processes (MDPs). The temporal difference (TD) learning is a reinforcement learning (RL) algorithm which learns an infinite horizon discounted cost function (or value function) for a given fixed policy without the model knowledge. In the distributed RL case each agent receives local reward through a local processing. Information exchange over sparse communication network allows the agents to learn the global value function corresponding to a global reward, which is a sum of local rewards. In this paper, the problem is converted into a constrained convex optimization problem with a consensus constraint. Then, we propose a primal-dual distributed GTD algorithm and prove that it almost surely converges to a set of stationary points of the optimization problem.
△ Less
Submitted 22 August, 2018; v1 submitted 21 March, 2018;
originally announced March 2018.
-
Stability Margins of $\mathcal{L}_1$ Adaptive Controller: Part II
Authors:
Chengyu Cao,
Naira Hovakimyan
Abstract:
In Part I of this paper, we have developed a novel $\mathcal{L}_1$ adaptive control architecture that enables fast adaptation and leads to uniformly bounded transient and asymptotic tracking for system's both signals, input and output, simultaneously. In this paper, we derive the stability margins of $\mathcal{L}_1$ adaptive control architecture, including time-delay and gain margins in the pres…
▽ More
In Part I of this paper, we have developed a novel $\mathcal{L}_1$ adaptive control architecture that enables fast adaptation and leads to uniformly bounded transient and asymptotic tracking for system's both signals, input and output, simultaneously. In this paper, we derive the stability margins of $\mathcal{L}_1$ adaptive control architecture, including time-delay and gain margins in the presence of time-varying bounded disturbance.
Simulations verify the theoretical findings.
△ Less
Submitted 15 August, 2006;
originally announced August 2006.
-
Guaranteed Transient Performance with $\mathcal{L}_1$ Adaptive Controller for Systems with Unknown Time-varying Parameters: Part I
Authors:
Chengyu Cao,
Naira Hovakimyan
Abstract:
This paper presents a novel adaptive control methodology for uncertain systems with time-varying unknown parameters and time-varying bounded disturbance. The adaptive controller ensures uniformly bounded transient and asymptotic tracking for system's both signals, input and output, simultaneously. The performance bounds can be systematically improved by increasing the adaptation gain. Simulation…
▽ More
This paper presents a novel adaptive control methodology for uncertain systems with time-varying unknown parameters and time-varying bounded disturbance. The adaptive controller ensures uniformly bounded transient and asymptotic tracking for system's both signals, input and output, simultaneously. The performance bounds can be systematically improved by increasing the adaptation gain. Simulations of a robotic arm with time-varying friction verify the theoretical findings.
△ Less
Submitted 15 August, 2006;
originally announced August 2006.
-
Design and Analysis of a Novel $\mathcal{L}_1$ Adaptive Control Architecture with Guaranteed Transient Performance
Authors:
Chengyu Cao,
Naira Hovakimyan
Abstract:
Novel adaptive control architecture is presented that has guaranteed transient performance for system's both signals, input and output, simultaneously.
Novel adaptive control architecture is presented that has guaranteed transient performance for system's both signals, input and output, simultaneously.
△ Less
Submitted 13 August, 2006;
originally announced August 2006.