Search | arXiv e-print repository

Sample Complexity of Linear Quadratic Regulator Without Initial Stability

Authors: Amirreza Neshaei Moghaddam, Alex Olshevsky, Bahman Gharesifard

Abstract: Inspired by REINFORCE, we introduce a novel receding-horizon algorithm for the Linear Quadratic Regulator (LQR) problem with unknown parameters. Unlike prior methods, our algorithm avoids reliance on two-point gradient estimates while maintaining the same order of sample complexity. Furthermore, it eliminates the restrictive requirement of starting with a stable initial policy, broadening its appl… ▽ More Inspired by REINFORCE, we introduce a novel receding-horizon algorithm for the Linear Quadratic Regulator (LQR) problem with unknown parameters. Unlike prior methods, our algorithm avoids reliance on two-point gradient estimates while maintaining the same order of sample complexity. Furthermore, it eliminates the restrictive requirement of starting with a stable initial policy, broadening its applicability. Beyond these improvements, we introduce a refined analysis of error propagation through the contraction of the Riemannian distance over the Riccati operator. This refinement leads to a better sample complexity and ensures improved convergence guarantees. Numerical simulations validate the theoretical results, demonstrating the method's practical feasibility and performance in realistic scenarios. △ Less

Submitted 1 March, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

arXiv:2409.03871 [pdf, ps, other]

Inferring Global Exponential Stability Properties using Lie-bracket Approximations

Authors: Marc Weber, Bahman Gharesifard, Christian Ebenbauer

Abstract: In the present paper, a novel result for inferring uniform global, not semi-global, exponential stability in the sense of Lyapunov with respect to input-affine systems from global uniform exponential stability properties with respect to their associated Lie-bracket systems is shown. The result is applied to adapt dither frequencies to find a sufficiently high gain in adaptive control of linear unk… ▽ More In the present paper, a novel result for inferring uniform global, not semi-global, exponential stability in the sense of Lyapunov with respect to input-affine systems from global uniform exponential stability properties with respect to their associated Lie-bracket systems is shown. The result is applied to adapt dither frequencies to find a sufficiently high gain in adaptive control of linear unknown systems, and a simple numerical example is simulated to support the theoretical findings. △ Less

Submitted 5 September, 2024; originally announced September 2024.

Comments: Extended Version

arXiv:2404.10851 [pdf, ps, other]

Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens

Authors: Amirreza Neshaei Moghaddam, Alex Olshevsky, Bahman Gharesifard

Abstract: We provide the first known algorithm that provably achieves $\varepsilon$-optimality within $\widetilde{\mathcal{O}}(1/\varepsilon)$ function evaluations for the discounted discrete-time LQR problem with unknown parameters, without relying on two-point gradient estimates. These estimates are known to be unrealistic in many settings, as they depend on using the exact same initialization, which is t… ▽ More We provide the first known algorithm that provably achieves $\varepsilon$-optimality within $\widetilde{\mathcal{O}}(1/\varepsilon)$ function evaluations for the discounted discrete-time LQR problem with unknown parameters, without relying on two-point gradient estimates. These estimates are known to be unrealistic in many settings, as they depend on using the exact same initialization, which is to be selected randomly, for two different policies. Our results substantially improve upon the existing literature outside the realm of two-point gradient estimates, which either leads to $\widetilde{\mathcal{O}}(1/\varepsilon^2)$ rates or heavily relies on stability assumptions. △ Less

Submitted 18 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

arXiv:2312.09083 [pdf, ps, other]

Sparse Linear Ensemble Systems and Structural Averaged Controllability: Single-input Case

Authors: Xudong Chen, Bahman Gharesifard

Abstract: We consider continuum ensembles of linear time-invariant control systems with single inputs. A sparsity pattern is said to be structurally averaged controllability if it admits an averaged controllable linear ensemble system. We provide a necessary and sufficient condition for a sparsity pattern to be structurally averaged controllable. We consider continuum ensembles of linear time-invariant control systems with single inputs. A sparsity pattern is said to be structurally averaged controllability if it admits an averaged controllable linear ensemble system. We provide a necessary and sufficient condition for a sparsity pattern to be structurally averaged controllable. △ Less

Submitted 14 December, 2023; originally announced December 2023.

arXiv:2209.07560 [pdf, other]

doi 10.1016/j.automatica.2022.110688

Event-Triggered Control for Discrete-Time Delay Systems

Authors: Kexue Zhang, Elena Braverman, Bahman Gharesifard

Abstract: This study focuses on event-triggered control of nonlinear discrete-time systems with time delays. Based on a Lyapunov-Krasovskii type input-to-state stability result, we propose a novel event-triggered control algorithm that works as follows. The control inputs are updated only when a certain measurement error surpasses a dynamical threshold depending on both the system states and the evolution t… ▽ More This study focuses on event-triggered control of nonlinear discrete-time systems with time delays. Based on a Lyapunov-Krasovskii type input-to-state stability result, we propose a novel event-triggered control algorithm that works as follows. The control inputs are updated only when a certain measurement error surpasses a dynamical threshold depending on both the system states and the evolution time. Sufficient conditions are established to ensure that the closed-loop system maintains its asymptotic stability. It is shown that the time-dependent portion in the dynamical threshold is essential to derive the lower bound of the times between two consecutive control updates. As a special case of our results, we demonstrate the performance of the designed event-triggering algorithm for a class of linear control systems with time delays. Numerical simulations are provided to demonstrate the effectiveness of our algorithm and theoretical results. △ Less

Submitted 15 September, 2022; originally announced September 2022.

Journal ref: Automatica 147 (2023) 110688

arXiv:2207.03566 [pdf, other]

A Note on Stability of Event-Triggered Control Systems with Time Delays

Authors: Kexue Zhang, Bahman Gharesifard, Elena Braverman

Abstract: This note studies stability of event-triggered control systems with the event-triggered control algorithm proposed in [1]. We construct a novel Halanay-type inequality, which is used to show that sufficient conditions of the main results in [1] ensure stability of the event-triggered control systems that was missing in [1]. It is also shown that a positive parameter in the proposed event-triggerin… ▽ More This note studies stability of event-triggered control systems with the event-triggered control algorithm proposed in [1]. We construct a novel Halanay-type inequality, which is used to show that sufficient conditions of the main results in [1] ensure stability of the event-triggered control systems that was missing in [1]. It is also shown that a positive parameter in the proposed event-triggering condition in [1] can be freely selected to exclude Zeno behavior from the event-triggered control system. An illustrative example is investigated to demonstrate the theoretical results of this study with numerical simulations. [1] K. Zhang, B. Gharesifard, and E. Braverman, Event-triggered control for nonlinear time-delay systems, IEEE Transactions on Automatic Control, vol. 67, no. 2, pp. 1031-1037, 2022. △ Less

Submitted 29 September, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

arXiv:2206.03640 [pdf, other]

doi 10.1109/TAC.2021.3062577

Event-Triggered Control for Nonlinear Time-Delay Systems

Authors: Kexue Zhang, Bahman Gharesifard, Elena Braverman

Abstract: This article studies the event-triggered control problem of general nonlinear systems with time delay. A novel event-triggering scheme is presented with two tunable design parameters, based on a Lyapunov functional result for the input-to-state stability of time-delay systems. The proposed event-triggered control algorithm guarantees the resulting closed-loop systems to be globally asymptotically… ▽ More This article studies the event-triggered control problem of general nonlinear systems with time delay. A novel event-triggering scheme is presented with two tunable design parameters, based on a Lyapunov functional result for the input-to-state stability of time-delay systems. The proposed event-triggered control algorithm guarantees the resulting closed-loop systems to be globally asymptotically stable, uniformly bounded, and/or globally attractive for different choices of these parameters. Sufficient conditions on the parameters are derived to exclude Zeno behavior. Two illustrative examples are studied to demonstrate our theoretical results. △ Less

Submitted 7 June, 2022; originally announced June 2022.

Journal ref: IEEE Transactions on Automatic Control 67(2)(2022) 1031-1037

arXiv:2205.09241 [pdf, other]

Neural ODE Control for Trajectory Approximation of Continuity Equation

Authors: Karthik Elamvazhuthi, Bahman Gharesifard, Andrea Bertozzi, Stanley Osher

Abstract: We consider the controllability problem for the continuity equation, corresponding to neural ordinary differential equations (ODEs), which describes how a probability measure is pushedforward by the flow. We show that the controlled continuity equation has very strong controllability properties. Particularly, a given solution of the continuity equation corresponding to a bounded Lipschitz vector f… ▽ More We consider the controllability problem for the continuity equation, corresponding to neural ordinary differential equations (ODEs), which describes how a probability measure is pushedforward by the flow. We show that the controlled continuity equation has very strong controllability properties. Particularly, a given solution of the continuity equation corresponding to a bounded Lipschitz vector field defines a trajectory on the set of probability measures. For this trajectory, we show that there exist piecewise constant training weights for a neural ODE such that the solution of the continuity equation corresponding to the neural ODE is arbitrarily close to it. As a corollary to this result, we establish that the continuity equation of the neural ODE is approximately controllable on the set of compactly supported probability measures that are absolutely continuous with respect to the Lebesgue measure. △ Less

Submitted 18 May, 2022; originally announced May 2022.

arXiv:2203.02591 [pdf, ps, other]

A Small Gain Analysis of Single Timescale Actor Critic

Authors: Alex Olshevsky, Bahman Gharesifard

Abstract: We consider a version of actor-critic which uses proportional step-sizes and only one critic update with a single sample from the stationary distribution per actor step. We provide an analysis of this method using the small-gain theorem. Specifically, we prove that this method can be used to find a stationary point, and that the resulting sample complexity improves the state of the art for actor-c… ▽ More We consider a version of actor-critic which uses proportional step-sizes and only one critic update with a single sample from the stationary distribution per actor step. We provide an analysis of this method using the small-gain theorem. Specifically, we prove that this method can be used to find a stationary point, and that the resulting sample complexity improves the state of the art for actor-critic methods to $O \left(μ^{-2} ε^{-2} \right)$ to find an $ε$-approximate stationary point where $μ$ is the condition number associated with the critic. △ Less

Submitted 25 May, 2023; v1 submitted 4 March, 2022; originally announced March 2022.

arXiv:2201.00724 [pdf, other]

Submodular Maximization with Limited Function Access

Authors: Andrew Downie, Bahman Gharesifard, Stephen L. Smith

Abstract: We consider a class of submodular maximization problems in which decision-makers have limited access to the objective function. We explore scenarios where the decision-maker can observe only pairwise information, i.e., can evaluate the objective function on sets of size two. We begin with a negative result that no algorithm using only $k$-wise information can guarantee performance better than… ▽ More We consider a class of submodular maximization problems in which decision-makers have limited access to the objective function. We explore scenarios where the decision-maker can observe only pairwise information, i.e., can evaluate the objective function on sets of size two. We begin with a negative result that no algorithm using only $k$-wise information can guarantee performance better than $k/n$. We present two algorithms that utilize only pairwise information about the function and characterize their performance relative to the optimal, which depends on the curvature of the submodular function. Additionally, if the submodular function possess a property called supermodularity of conditioning, then we can provide a method to bound the performance based purely on pairwise information. The proposed algorithms offer significant computational speedups over a traditional greedy strategy. A by-product of our study is the introduction of two new notions of curvature, the $k$-Marginal Curvature and the $k$-Cardinality Curvature. Finally, we present experiments highlighting the performance of our proposed algorithms in terms of approximation and time complexity. △ Less

Submitted 7 February, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

Comments: 14 pages, 8 figures

arXiv:2109.13559 [pdf, other]

A Note on Nussbaum-type Control and Lie-bracket Approximation

Authors: Marc Weber, Christian Ebenbauer, Bahman Gharesifard

Abstract: In this paper, we propose an adaptive control law for completely unknown scalar linear systems based on Lie-bracket approximation methods. We investigate stability and convergence properties for the resulting Lie-bracket system, compare our proposal with existing Nussbaum-type solutions and demonstrate our results with an example. Even though we prove global stability properties of the Lie-bracket… ▽ More In this paper, we propose an adaptive control law for completely unknown scalar linear systems based on Lie-bracket approximation methods. We investigate stability and convergence properties for the resulting Lie-bracket system, compare our proposal with existing Nussbaum-type solutions and demonstrate our results with an example. Even though we prove global stability properties of the Lie-bracket system, the stability properties of the proposed dynamics remain open, making the proposed control law an object of further studies. We elaborate the difficulties of establishing stability results by investigating connections to partial stability as well as studying the corresponding Chen-Fliess expansion. △ Less

Submitted 25 October, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

Comments: This is the extended version of our conference paper for CDC2021 with additional calculation steps

MSC Class: 93D21 (Primary); 93D21 (Secondary)

arXiv:2104.13839 [pdf, ps, other]

Structural averaged controllability of linear ensemble systems

Authors: Bahman Gharesifard, Xudong Chen

Abstract: In the paper, we introduce and address the problem of structural averaged controllability for linear ensemble systems. We provide examples highlighting the differences between this problem and others. In particular, we show that structural averaged controllability is strictly weaker than structural controllability for single (or ensembles of) linear systems. We establish a set of necessary or suff… ▽ More In the paper, we introduce and address the problem of structural averaged controllability for linear ensemble systems. We provide examples highlighting the differences between this problem and others. In particular, we show that structural averaged controllability is strictly weaker than structural controllability for single (or ensembles of) linear systems. We establish a set of necessary or sufficient conditions for sparsity patterns to be structurally averaged controllable. △ Less

Submitted 28 April, 2021; originally announced April 2021.

arXiv:2007.06007 [pdf, ps, other]

Universal Approximation Power of Deep Residual Neural Networks via Nonlinear Control Theory

Authors: Paulo Tabuada, Bahman Gharesifard

Abstract: In this paper, we explain the universal approximation capabilities of deep residual neural networks through geometric nonlinear control. Inspired by recent work establishing links between residual networks and control systems, we provide a general sufficient condition for a residual network to have the power of universal approximation by asking the activation function, or one of its derivatives, t… ▽ More In this paper, we explain the universal approximation capabilities of deep residual neural networks through geometric nonlinear control. Inspired by recent work establishing links between residual networks and control systems, we provide a general sufficient condition for a residual network to have the power of universal approximation by asking the activation function, or one of its derivatives, to satisfy a quadratic differential equation. Many activation functions used in practice satisfy this assumption, exactly or approximately, and we show this property to be sufficient for an adequately deep neural network with $n+1$ neurons per layer to approximate arbitrarily well, on a compact set and with respect to the supremum norm, any continuous function from $\mathbb{R}^n$ to $\mathbb{R}^n$. We further show this result to hold for very simple architectures for which the weights only need to assume two values. The first key technical contribution consists of relating the universal approximation problem to controllability of an ensemble of control systems corresponding to a residual network and to leverage classical Lie algebraic techniques to characterize controllability. The second technical contribution is to identify monotonicity as the bridge between controllability of finite ensembles and uniform approximability on compact sets. △ Less

Submitted 9 February, 2024; v1 submitted 12 July, 2020; originally announced July 2020.

Comments: Sejun Park and Geonho Hwang brought to our atention a mistake in the proof of Theorem 5.1. This mistake is corrected in this version with the consequence of increasing the number of neurons per layer from n+1 to 2n+1

Journal ref: ICLR 2021, TAC 2023

arXiv:1912.02396 [pdf, other]

doi 10.1016/j.nahs.2021.101109

Hybrid Event-Triggered and Impulsive Control for Time-Delay Systems

Authors: Kexue Zhang, Bahman Gharesifard

Abstract: In this paper, we study the problem of hybrid event-triggered control for a class of nonlinear time-delay systems. Using a Razumikhin-type input-to-state stability result for time-delay systems, we design an event-triggered control algorithm to stabilize the given time-delay system. In order to exclude Zeno behavior, we combine the impulsive control mechanism with our event-triggered strategy. In… ▽ More In this paper, we study the problem of hybrid event-triggered control for a class of nonlinear time-delay systems. Using a Razumikhin-type input-to-state stability result for time-delay systems, we design an event-triggered control algorithm to stabilize the given time-delay system. In order to exclude Zeno behavior, we combine the impulsive control mechanism with our event-triggered strategy. In this sense, the proposed algorithm is a hybrid impulsive and event-triggered strategy. Sufficient conditions for the stabilization of the nonlinear systems with time delay are obtained by using Lyapunov method and Razumikhin technique. Numerical simulations are provided to show the effectiveness of our theoretical results. △ Less

Submitted 16 March, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

Journal ref: Nonlinear Analysis: Hybrid Systems 43 (2021) 101109

arXiv:1802.10519 [pdf, other]

On the Lie bracket approximation approach to distributed optimization: Extensions and limitations

Authors: Simon Michalowsky, Bahman Gharesifard, Christian Ebenbauer

Abstract: We consider the problem of solving a smooth convex optimization problem with equality and inequality constraints in a distributed fashion. Assuming that we have a group of agents available capable of communicating over a communication network described by a time-invariant directed graph, we derive distributed continuous-time agent dynamics that ensure convergence to a neighborhood of the optimal s… ▽ More We consider the problem of solving a smooth convex optimization problem with equality and inequality constraints in a distributed fashion. Assuming that we have a group of agents available capable of communicating over a communication network described by a time-invariant directed graph, we derive distributed continuous-time agent dynamics that ensure convergence to a neighborhood of the optimal solution of the optimization problem. Following the ideas introduced in our previous work, we combine saddle-point dynamics with Lie bracket approximation techniques. While the methodology was previously limited to linear constraints and objective functions given by a sum of strictly convex separable functions, we extend these result here and show that it applies to a very general class of optimization problems under mild assumptions on the communication topology. △ Less

Submitted 28 February, 2018; originally announced February 2018.

arXiv:1711.05486 [pdf, other]

A Lie bracket approximation approach to distributed optimization over directed graphs

Authors: Simon Michalowsky, Bahman Gharesifard, Christian Ebenbauer

Abstract: We consider a group of computation units trying to cooperatively solve a distributed optimization problem with shared linear equality and inequality constraints. Assuming that the computation units are communicating over a network whose topology is described by a time-invariant directed graph, by combining saddle-point dynamics with Lie bracket approximation techniques we derive a methodology that… ▽ More We consider a group of computation units trying to cooperatively solve a distributed optimization problem with shared linear equality and inequality constraints. Assuming that the computation units are communicating over a network whose topology is described by a time-invariant directed graph, by combining saddle-point dynamics with Lie bracket approximation techniques we derive a methodology that allows to design distributed continuous-time optimization algorithms that solve this problem under minimal assumptions on the graph topology as well as on the structure of the constraints. We discuss several extensions as well as special cases in which the proposed procedure becomes particularly simple. △ Less

Submitted 5 June, 2019; v1 submitted 15 November, 2017; originally announced November 2017.

arXiv:1710.01397 [pdf, ps, other]

Controllability of coupled parabolic systems with multiple underactuations: parts I and II

Authors: Drew Steeves, Bahman Gharesifard, Abdol-Reza Mansouri

Abstract: This work studies the null controllability of a system of coupled parabolic PDEs. In particular, our work specializes to an important subclass of these control problems which are coupled by first and zero-order couplings and are, additionally, underactuated. We pose our control problem in a fairly new framework which divides the problem into interconnected parts: we refer to the first part as the… ▽ More This work studies the null controllability of a system of coupled parabolic PDEs. In particular, our work specializes to an important subclass of these control problems which are coupled by first and zero-order couplings and are, additionally, underactuated. We pose our control problem in a fairly new framework which divides the problem into interconnected parts: we refer to the first part as the analytic control problem, where we use slightly non-classical techniques to prove null controllability by means of internal controls appearing on every equation; we refer to the second part as the algebraic control problem, where we use an algebraic method to invert a linear partial differential operator that describes our system; this allows us to recover null controllability by means of internal controls which appear on only a few of the equations. We establish a null controllability result for the original problem by solving these control problems concurrently. △ Less

Submitted 15 October, 2018; v1 submitted 3 October, 2017; originally announced October 2017.

Comments: 49 pages

arXiv:1706.04082 [pdf, other]

Distributed Submodular Maximization with Limited Information

Authors: Bahman Gharesifard, Stephen L. Smith

Abstract: We consider a class of distributed submodular maximization problems in which each agent must choose a single strategy from its strategy set. The global objective is to maximize a submodular function of the strategies chosen by each agent. When choosing a strategy, each agent has access to only a limited number of other agents' choices. For each of its strategies, an agent can evaluate its marginal… ▽ More We consider a class of distributed submodular maximization problems in which each agent must choose a single strategy from its strategy set. The global objective is to maximize a submodular function of the strategies chosen by each agent. When choosing a strategy, each agent has access to only a limited number of other agents' choices. For each of its strategies, an agent can evaluate its marginal contribution to the global objective given its information. The main objective is to investigate how this limitation of information about the strategies chosen by other agents affects the performance when agents make choices according to a local greedy algorithm. In particular, we provide lower bounds on the performance of greedy algorithms for submodular maximization, which depend on the clique number of a graph that captures the information structure. We also characterize graph-theoretic upper bounds in terms of the chromatic number of the graph. Finally, we demonstrate how certain graph properties limit the performance of the greedy algorithm. Simulations on several common models for random networks demonstrate our results. △ Less

Submitted 12 June, 2017; originally announced June 2017.

Comments: 11 pages, 8 figures

arXiv:1606.08939 [pdf, ps, other]

Distributed Optimization Under Adversarial Nodes

Authors: Shreyas Sundaram, Bahman Gharesifard

Abstract: We investigate the vulnerabilities of consensus-based distributed optimization protocols to nodes that deviate from the prescribed update rule (e.g., due to failures or adversarial attacks). We first characterize certain fundamental limitations on the performance of any distributed optimization algorithm in the presence of adversaries. We then propose a resilient distributed optimization algorithm… ▽ More We investigate the vulnerabilities of consensus-based distributed optimization protocols to nodes that deviate from the prescribed update rule (e.g., due to failures or adversarial attacks). We first characterize certain fundamental limitations on the performance of any distributed optimization algorithm in the presence of adversaries. We then propose a resilient distributed optimization algorithm that guarantees that the non-adversarial nodes converge to the convex hull of the minimizers of their local functions under certain conditions on the graph topology, regardless of the actions of a certain number of adversarial nodes. In particular, we provide sufficient conditions on the graph topology to tolerate a bounded number of adversaries in the neighborhood of every non-adversarial node, and necessary and sufficient conditions to tolerate a globally bounded number of adversaries. For situations where there are up to F adversaries in the neighborhood of every node, we use the concept of maximal F-local sets of graphs to provide lower bounds on the distance-to-optimality of achievable solutions under any algorithm. We show that finding the size of such sets is NP-hard. △ Less

Submitted 28 June, 2016; originally announced June 2016.

arXiv:1407.6076 [pdf, ps, other]

Stability of Epidemic Models over Directed Graphs: A Positive Systems Approach

Authors: Ali Khanafer, Tamer Başar, Bahman Gharesifard

Abstract: We study the stability properties of a susceptible-infected-susceptible (SIS) diffusion model, so-called the $n$-intertwined Markov model, over arbitrary directed network topologies. As in the majority of the work on infection spread dynamics, this model exhibits a threshold phenomenon. When the curing rates in the network are high, the disease-free state is the unique equilibrium over the network… ▽ More We study the stability properties of a susceptible-infected-susceptible (SIS) diffusion model, so-called the $n$-intertwined Markov model, over arbitrary directed network topologies. As in the majority of the work on infection spread dynamics, this model exhibits a threshold phenomenon. When the curing rates in the network are high, the disease-free state is the unique equilibrium over the network. Otherwise, an endemic equilibrium state emerges, where some infection remains within the network. Using notions from positive systems theory, {we provide novel proofs for the global asymptotic stability of the equilibrium points in both cases over strongly connected networks based on the value of the basic reproduction number, a fundamental quantity in the study of epidemics.} When the network topology is weakly connected, we provide conditions for the existence, uniqueness, and global asymptotic stability of an endemic state, and we study the stability of the disease-free state. Finally, we demonstrate that the $n$-intertwined Markov model can be viewed as a best-response dynamical system of a concave game among the nodes. This characterization allows us to cast new infection spread dynamics; additionally, we provide a sufficient condition for the global convergence to the disease-free state, which can be checked in a distributed fashion. Several simulations demonstrate our results. △ Less

Submitted 20 February, 2015; v1 submitted 22 July, 2014; originally announced July 2014.

Comments: 13 pages, 5 figures, submitted to Automatica

arXiv:1204.0852 [pdf, ps, other]

Distributed convergence to Nash equilibria in two-network zero-sum games

Authors: Bahman Gharesifard, Jorge Cortes

Abstract: This paper considers a class of strategic scenarios in which two networks of agents have opposing objectives with regards to the optimization of a common objective function. In the resulting zero-sum game, individual agents collaborate with neighbors in their respective network and have only partial knowledge of the state of the agents in the other network. For the case when the interaction topolo… ▽ More This paper considers a class of strategic scenarios in which two networks of agents have opposing objectives with regards to the optimization of a common objective function. In the resulting zero-sum game, individual agents collaborate with neighbors in their respective network and have only partial knowledge of the state of the agents in the other network. For the case when the interaction topology of each network is undirected, we synthesize a distributed saddle-point strategy and establish its convergence to the Nash equilibrium for the class of strictly concave-convex and locally Lipschitz objective functions. We also show that this dynamics does not converge in general if the topologies are directed. This justifies the introduction, in the directed case, of a generalization of this distributed dynamics which we show converges to the Nash equilibrium for the class of strictly concave-convex differentiable functions with locally Lipschitz gradients. The technical approach combines tools from algebraic graph theory, nonsmooth analysis, set-valued dynamical systems, and game theory. △ Less

Submitted 21 December, 2012; v1 submitted 3 April, 2012; originally announced April 2012.

arXiv:1204.0304 [pdf, ps, other]

Distributed continuous-time convex optimization on weight-balanced digraphs

Authors: Bahman Gharesifard, Jorge Cortes

Abstract: This paper studies the continuous-time distributed optimization of a sum of convex functions over directed graphs. Contrary to what is known in the consensus literature, where the same dynamics works for both undirected and directed scenarios, we show that the consensus-based dynamics that solves the continuous-time distributed optimization problem for undirected graphs fails to converge when tran… ▽ More This paper studies the continuous-time distributed optimization of a sum of convex functions over directed graphs. Contrary to what is known in the consensus literature, where the same dynamics works for both undirected and directed scenarios, we show that the consensus-based dynamics that solves the continuous-time distributed optimization problem for undirected graphs fails to converge when transcribed to the directed setting. This study sets the basis for the design of an alternative distributed dynamics which we show is guaranteed to converge, on any strongly connected weight-balanced digraph, to the set of minimizers of a sum of convex differentiable functions with globally Lipschitz gradients. Our technical approach combines notions of invariance and cocoercivity with the positive definiteness properties of graph matrices to establish the results. △ Less

Submitted 21 December, 2012; v1 submitted 1 April, 2012; originally announced April 2012.

arXiv:0911.0232 [pdf, ps, other]

Distributed strategies for generating weight-balanced and doubly stochastic digraphs

Authors: Bahman Gharesifard, Jorge Cortes

Abstract: Weight-balanced and doubly stochastic digraphs are two classes of digraphs that play an essential role in a variety of cooperative control problems, including formation control, distributed averaging, and optimization. We refer to a digraph as doubly stochasticable (weight-balanceable) if it admits a doubly stochastic (weight-balanced) adjacency matrix. This paper studies the characterization of b… ▽ More Weight-balanced and doubly stochastic digraphs are two classes of digraphs that play an essential role in a variety of cooperative control problems, including formation control, distributed averaging, and optimization. We refer to a digraph as doubly stochasticable (weight-balanceable) if it admits a doubly stochastic (weight-balanced) adjacency matrix. This paper studies the characterization of both classes of digraphs, and introduces distributed algorithms to compute the appropriate set of weights in each case. △ Less

Submitted 17 October, 2011; v1 submitted 1 November, 2009; originally announced November 2009.

Showing 1–23 of 23 results for author: Gharesifard, B