Skip to main content

Showing 1–50 of 68 results for author: Mesbahi, M

.
  1. arXiv:2503.12711  [pdf, other

    math.OC eess.SY

    Intrinsic Successive Convexification: Trajectory Optimization on Smooth Manifolds

    Authors: Spencer Kraisler, Mehran Mesbahi, Behcet Acikmese

    Abstract: A fundamental issue at the core of trajectory optimization on smooth manifolds is handling the implicit manifold constraint within the dynamics. The conventional approach is to enforce the dynamic model as a constraint. However, we show this approach leads to significantly redundant operations, as well as being heavily dependent on the state space representation. Specifically, we propose an intrin… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  2. arXiv:2501.09192  [pdf, other

    math.OC cs.RO eess.SY

    Estimation-Aware Trajectory Optimization with Set-Valued Measurement Uncertainties

    Authors: Aditya Deole, Mehran Mesbahi

    Abstract: In this paper, an optimization-based framework for generating estimation-aware trajectories is presented. In this setup, measurement (output) uncertainties are state-dependent and set-valued. Enveloping ellipsoids are employed to characterize state-dependent uncertainties with unknown distributions. The concept of regularity for set-valued output maps is then introduced, facilitating the formulati… ▽ More

    Submitted 10 May, 2025; v1 submitted 15 January, 2025; originally announced January 2025.

    Comments: 40 pages, 9 figures

  3. arXiv:2412.15573  [pdf, other

    cs.MA cs.LG

    Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems

    Authors: Joshua Holder, Natasha Jaques, Mehran Mesbahi

    Abstract: Assignment problems are a classic combinatorial optimization problem in which a group of agents must be assigned to a group of tasks such that maximum utility is achieved while satisfying assignment constraints. Given the utility of each agent completing each task, polynomial-time algorithms exist to solve a single assignment problem in its simplest form. However, in many modern-day applications s… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  4. arXiv:2406.04243  [pdf, other

    math.OC eess.SY math.DG

    Policy Optimization in Control: Geometry and Algorithmic Implications

    Authors: Shahriar Talebi, Yang Zheng, Spencer Kraisler, Na Li, Mehran Mesbahi

    Abstract: This survey explores the geometric perspective on policy optimization within the realm of feedback control systems, emphasizing the intrinsic relationship between control design and optimization. By adopting a geometric viewpoint, we aim to provide a nuanced understanding of how various ``complete parameterization'' -- referring to the policy parameters together with its Riemannian geometry -- of… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2405.16680  [pdf, ps, other

    math.OC

    Six-Degree-of-Freedom Aircraft Landing Trajectory Planning with Runway Alignment

    Authors: Taewan Kim, Abhinav G. Kamath, Niyousha Rahimi, Jasper Corleis, Behçet Açıkmeşe, Mehran Mesbahi

    Abstract: This paper presents a numerical optimization algorithm for generating approach and landing trajectories for a six-degree-of-freedom (6-DoF) aircraft. We improve on the existing research on aircraft landing trajectory generation by formulating the trajectory optimization problem with additional real-world operational constraints, including 6-DoF aircraft dynamics, runway alignment, constant wind fi… ▽ More

    Submitted 10 June, 2025; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: This article has been accepted to JGCD

  6. Output-feedback Synthesis Orbit Geometry: Quotient Manifolds and LQG Direct Policy Optimization

    Authors: Spencer Kraisler, Mehran Mesbahi

    Abstract: We consider direct policy optimization for the linear-quadratic Gaussian (LQG) setting. Over the past few years, it has been recognized that the landscape of dynamic output-feedback controllers of relevance to LQG has an intricate geometry, particularly pertaining to the existence of degenerate stationary points, that hinders gradient methods. In order to address these challenges, in this paper, w… ▽ More

    Submitted 15 August, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Journal ref: IEEE Control Systems Letters, vol. 8, pp. 1577-1582, 2024

  7. arXiv:2311.12230  [pdf, other

    eess.SY cs.LG math.OC

    Data-Guided Regulator for Adaptive Nonlinear Control

    Authors: Niyousha Rahimi, Mehran Mesbahi

    Abstract: This paper addresses the problem of designing a data-driven feedback controller for complex nonlinear dynamical systems in the presence of time-varying disturbances with unknown dynamics. Such disturbances are modeled as the "unknown" part of the system dynamics. The goal is to achieve finite-time regulation of system states through direct policy updates while also generating informative data that… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  8. arXiv:2311.10221  [pdf, other

    eess.SY

    An Active-Sensing Approach for Bearing-based Target Localization

    Authors: Beniamino Pozzan, Giulia Michieletto, Mehran Mesbahi, Angelo Cenedese

    Abstract: Characterized by a cross-disciplinary nature, the bearing-based target localization task involves estimating the position of an entity of interest by a group of agents capable of collecting noisy bearing measurements. In this work, this problem is tackled by resting both on the weighted least square estimation approach and on the active-sensing control paradigm. Indeed, we propose an iterative alg… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  9. arXiv:2308.08054  [pdf, other

    math.OC eess.SY

    Consensus on Lie groups for the Riemannian Center of Mass

    Authors: Spencer Kraisler, Shahriar Talebi, Mehran Mesbahi

    Abstract: In this paper, we develop a consensus algorithm for distributed computation of the Riemannian center of mass (RCM) on Lie Groups. The algorithm is built upon a distributed optimization reformulation that allows developing an intrinsic, distributed (without relying on a consensus subroutine), and a computationally efficient protocol for the RCM computation. The novel idea for developing this fast d… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  10. arXiv:2305.17836  [pdf, other

    eess.SY math.OC

    Data-driven Optimal Filtering for Linear Systems with Unknown Noise Covariances

    Authors: Shahriar Talebi, Amirhossein Taghvaei, Mehran Mesbahi

    Abstract: This paper examines learning the optimal filtering policy, known as the Kalman gain, for a linear system with unknown noise covariance matrices using noisy output data. The learning problem is formulated as a stochastic policy optimization problem, aiming to minimize the output prediction error. This formulation provides a direct bridge between data-driven optimal control and, its dual, optimal fi… ▽ More

    Submitted 26 October, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.14878

  11. Optimization-based Constrained Funnel Synthesis for Systems with Lipschitz Nonlinearities via Numerical Optimal Control

    Authors: Taewan Kim, Purnanand Elango, Taylor P. Reynolds, Behçet Açıkmeşe, Mehran Mesbahi

    Abstract: This paper presents a funnel synthesis algorithm for computing controlled invariant sets and feedback control gains around a given nominal trajectory for dynamical systems with locally Lipschitz nonlinearities and bounded disturbances. The resulting funnel synthesis problem involves a differential linear matrix inequality (DLMI) whose solution satisfies a Lyapunov condition that implies invariance… ▽ More

    Submitted 1 July, 2023; v1 submitted 18 March, 2023; originally announced March 2023.

    Comments: 6 pages, 3 figures, accepted to LCSS

  12. arXiv:2210.14878  [pdf, other

    eess.SY eess.SP math.OC

    Duality-Based Stochastic Policy Optimization for Estimation with Unknown Noise Covariances

    Authors: Shahriar Talebi, Amirhossein Taghvaei, Mehran Mesbahi

    Abstract: Duality of control and estimation allows mapping recent advances in data-guided control to the estimation setup. This paper formalizes and utilizes such a mapping to consider learning the optimal (steady-state) Kalman gain when process and measurement noise statistics are unknown. Specifically, building on the duality between synthesizing optimal control and estimation gains, the filter design pro… ▽ More

    Submitted 6 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

  13. arXiv:2210.04810  [pdf, other

    math.OC cs.LG stat.ML

    Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies

    Authors: Bin Hu, Kaiqing Zhang, Na Li, Mehran Mesbahi, Maryam Fazel, Tamer Başar

    Abstract: Gradient-based methods have been widely used for system design and optimization in diverse application domains. Recently, there has been a renewed interest in studying theoretical properties of these methods in the context of control and reinforcement learning. This article surveys some of the recent developments on policy optimization, a gradient-based iterative approach for feedback control synt… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: To Appear in Annual Review of Control, Robotics, and Autonomous Systems

  14. arXiv:2209.10786  [pdf, ps, other

    eess.SY

    Vector-valued Privacy-Preserving Average Consensus

    Authors: Lulu Pan, Haibin Shao, Yang Lu, Mehran Mesbahi, Dewei Li, Yugeng Xi

    Abstract: Achieving average consensus without disclosing sensitive information can be a critical concern for multi-agent coordination. This paper examines privacy-preserving average consensus (PPAC) for vector-valued multi-agent networks. In particular, a set of agents with vector-valued states aim to collaboratively reach an exact average consensus of their initial states, while each agent's initial state… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  15. arXiv:2208.13223  [pdf, ps, other

    eess.SY

    Structural Adaptivity of Directed Networks

    Authors: Lulu Pan, Haibin Shao, Mehran Mesbahi, Dewei Li, Yugeng Xi

    Abstract: Network structure plays a critical role in functionality and performance of network systems. This paper examines structural adaptivity of diffusively coupled, directed multi-agent networks that are subject to diffusion performance. Inspired by the observation that the link redundancy in a network may degrade its diffusion performance, a distributed data-driven neighbor selection framework is propo… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

  16. arXiv:2208.08969  [pdf, other

    math.OC eess.SY

    To charge in-flight or not: an inquiry into parallel-hybrid electric aircraft configurations via optimal control

    Authors: Mengyuan Wang, Mehran Mesbahi

    Abstract: We examine two configurations for parallel hybrid electric aircraft, one with, and one without, a mechanical connection between the engines and the electric motors. For this two designs, we then review the power allocation problem in the context of aircraft energy management for a 19-seat conceptual Hybrid Electric Aircraft. We then represent the original optimal control problem as a finite-dimens… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  17. arXiv:2203.05702  [pdf, other

    math.OC

    Vertiport Selection in Hybrid Air-Ground Transportation Networks via Mathematical Programs with Equilibrium Constraints

    Authors: Yue Yu, Mengyuan Wang, Mehran Mesbahi, Ufuk Topcu

    Abstract: Urban air mobility is a concept that promotes aerial modes of transport in urban areas. In these areas, the location and capacity of the vertiports--where the travelers embark and disembark the aircraft--not only affect the flight delays of the aircraft, but can also aggravate the congestion of ground vehicles by creating extra ground travel demands. We introduce a mathematical model for selecting… ▽ More

    Submitted 1 July, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

  18. arXiv:2201.11157  [pdf, other

    math.OC eess.SY math.DG

    Policy Optimization over Submanifolds for Linearly Constrained Feedback Synthesis

    Authors: Shahriar Talebi, Mehran Mesbahi

    Abstract: In this paper, we study linearly constrained policy optimization over the manifold of Schur stabilizing controllers, equipped with a Riemannian metric that emerges naturally in the context of optimal control problems. We provide extrinsic analysis of a generic constrained smooth cost function, that subsequently facilitates subsuming any such constrained problem into this framework. By studying the… ▽ More

    Submitted 26 October, 2023; v1 submitted 26 January, 2022; originally announced January 2022.

  19. arXiv:2109.02347  [pdf, ps, other

    math.OC

    Discrete-Time Linear-Quadratic Regulation via Optimal Transport

    Authors: Mathias Hudoba de Badyn, Erik Miehling, Dylan Janak, Behçet Açıkmeşe, Mehran Mesbahi, Tamer Başar, John Lygeros, Roy S. Smith

    Abstract: In this paper, we consider a discrete-time stochastic control problem with uncertain initial and target states. We first discuss the connection between optimal transport and stochastic control problems of this form. Next, we formulate a linear-quadratic regulator problem where the initial and terminal states are distributed according to specified probability densities. A closed-form solution for t… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 8 pages, 6 figures. To be included in the Proceedings of the 60th Conference on Decision and Control. This version includes proofs

  20. arXiv:2107.12022  [pdf, ps, other

    eess.SY cs.MA

    Distributed Neighbor Selection in Multi-agent Networks

    Authors: Haibin Shao, Lulu Pan, Mehran Mesbahi, Yugeng Xi, Dewei Li

    Abstract: Achieving consensus via nearest neighbor rules is an important prerequisite for multi-agent networks to accomplish collective tasks. A common assumption in consensus setup is that each agent interacts with all its neighbors. This paper examines whether network functionality and performance can be maintained-and even enhanced-when agents interact only with a subset of their respective (available) n… ▽ More

    Submitted 22 June, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

  21. arXiv:2107.09292  [pdf, ps, other

    eess.SY cs.MA

    Cluster Consensus on Matrix-weighted Switching Networks

    Authors: Lulu Pan, Haibin Shao, Mehran Mesbahi, Dewei Li, Yugeng Xi

    Abstract: This paper examines the cluster consensus problem of multi-agent systems on matrix-weighted switching networks. Necessary and/or sufficient conditions under which cluster consensus can be achieved are obtained and quantitative characterization of the steady-state of the cluster consensus are provided as well. Specifically, if the underlying network switches amongst finite number of networks, a nec… ▽ More

    Submitted 20 July, 2021; v1 submitted 20 July, 2021; originally announced July 2021.

  22. arXiv:2103.11572  [pdf, other

    eess.SY cs.MA math.OC

    Data-Driven Structured Policy Iteration for Homogeneous Distributed Systems

    Authors: Siavash Alemzadeh, Shahriar Talebi, Mehran Mesbahi

    Abstract: Control of networked systems, comprised of interacting agents, is often achieved through modeling the underlying interactions. Constructing accurate models of such interactions--in the meantime--can become prohibitive in applications. Data-driven control methods avoid such complications by directly synthesizing a controller from the observed data. In this paper, we propose an algorithm referred to… ▽ More

    Submitted 16 November, 2023; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: S. Alemzadeh and S. Talebi contributed equally to this work

  23. arXiv:2102.02953  [pdf, other

    eess.SY

    On Controllability and Persistency of Excitation in Data-Driven Control: Extensions of Willems' Fundamental Lemma

    Authors: Yue Yu, Shahriar Talebi, Henk J. van Waarde, Ufuk Topcu, Mehran Mesbahi, Behçet Açıkmeşe

    Abstract: Willems' fundamental lemma asserts that all trajectories of a linear time-invariant system can be obtained from a finite number of measured ones, assuming that controllability and a persistency of excitation condition hold. We show that these two conditions can be relaxed. First, we prove that the controllability condition can be replaced by a condition on the controllable subspace, unobservable s… ▽ More

    Submitted 9 April, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

  24. arXiv:2007.10960  [pdf, other

    cs.LG stat.ML

    Adaptive Traffic Control with Deep Reinforcement Learning: Towards State-of-the-art and Beyond

    Authors: Siavash Alemzadeh, Ramin Moslemi, Ratnesh Sharma, Mehran Mesbahi

    Abstract: In this work, we study adaptive data-guided traffic planning and control using Reinforcement Learning (RL). We shift from the plain use of classic methods towards state-of-the-art in deep RL community. We embed several recent techniques in our algorithm that improve the original Deep Q-Networks (DQN) for discrete control and discuss the traffic-related interpretations that follow. We propose a nov… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  25. arXiv:2007.05880  [pdf, other

    eess.SP cs.LG

    Deep Learning-based Resource Allocation for Infrastructure Resilience

    Authors: Siavash Alemzadeh, Hesam Talebiyan, Shahriar Talebi, Leonardo Duenas-Osorio, Mehran Mesbahi

    Abstract: From an optimization point of view, resource allocation is one of the cornerstones of research for addressing limiting factors commonly arising in applications such as power outages and traffic jams. In this paper, we take a data-driven approach to estimate an optimal nodal restoration sequence for immediate recovery of the infrastructure networks after natural disasters such as earthquakes. We ge… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

  26. arXiv:2006.16201  [pdf, ps, other

    math.OC

    Graph-theoretic optimization for edge consensus

    Authors: Mathias Hudoba de Badyn, Dillon R. Foight, Daniel Calderone, Mehran Mesbahi, Roy S. Smith

    Abstract: We consider network structures that optimize the $\mathcal{H}_2$ norm of weighted, time scaled consensus networks, under a minimal representation of such consensus networks described by the edge Laplacian. We show that a greedy algorithm can be used to find the minimum-$\mathcal{H}_2$ norm spanning tree, as well as how to choose edges to optimize the $\mathcal{H}_2$ norm when edges are added back… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: 8 pages, 3 figures. Accepted to the 24th International Symposium on Mathematical Theory of Networks and Systems (MTNS 2020), which has been postponed to August 2021. This version is the extended paper, which includes the proofs that were submitted for review

  27. arXiv:2006.09178  [pdf, other

    eess.SY math.OC

    Policy Gradient-based Algorithms for Continuous-time Linear Quadratic Control

    Authors: Jingjing Bu, Afshin Mesbahi, Mehran Mesbahi

    Abstract: We consider the continuous-time Linear-Quadratic-Regulator (LQR) problem in terms of optimizing a real-valued matrix function over the set of feedback gains. The results developed are in parallel to those in Bu et al. [1] for discrete-time LTI systems. In this direction, we characterize several analytical properties (smoothness, coerciveness, quadratic growth) that are crucial in the analysis of g… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1907.08921

  28. arXiv:2006.08548  [pdf, ps, other

    math.OC

    A Note on Nesterov's Accelerated Method in Nonconvex Optimization: a Weak Estimate Sequence Approach

    Authors: Jingjing Bu, Mehran Mesbahi

    Abstract: We present a variant of accelerated gradient descent algorithms, adapted from Nesterov's optimal first-order methods, for weakly-quasi-convex and weakly-quasi-strongly-convex functions. We show that by tweaking the so-called estimate sequence method, the derived algorithm achieves optimal convergence rate for weakly-quasi-convex and weakly-quasi-strongly-convex in terms of oracle complexity. In pa… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  29. Performance and design of consensus on matrix-weighted and time scaled graphs

    Authors: Dillon R. Foight, Mathias Hudoba de Badyn, Mehran Mesbahi

    Abstract: In this paper, we consider the $\mathcal{H}_2$-norm of networked systems with multi-time scale consensus dynamics and vector-valued agent states. This allows us to explore how measurement and process noise affect consensus on matrix-weighted graphs by examining edge-state consensus. In particular, we highlight an interesting case where the influences of the weighting and scaling on the… ▽ More

    Submitted 24 December, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 10 pages, 5 figures, accepted to the IEEE Transactions on Control of Network Systems. arXiv admin note: text overlap with arXiv:1909.07864

    Journal ref: IEEE Transactions on Control of Network Systems, 2020, vol. 7, no. 4, pp. 1812-1822

  30. arXiv:2006.00870  [pdf, ps, other

    math.OC

    From noisy data to feedback controllers: non-conservative design via a matrix S-lemma

    Authors: Henk J. van Waarde, M. Kanat Camlibel, Mehran Mesbahi

    Abstract: We propose a new method to obtain feedback controllers of an unknown dynamical system directly from noisy input/state data. The key ingredient of our design is a new matrix S-lemma that will be proven in this paper. We provide both strict and non-strict versions of this S-lemma, that are of interest in their own right. Thereafter, we will apply these results to data-driven control. In particular,… ▽ More

    Submitted 9 December, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

  31. On Regularizability and its Application to Online Control of Unstable LTI Systems

    Authors: Shahriar Talebi, Siavash Alemzadeh, Niyousha Rahimi, Mehran Mesbahi

    Abstract: Learning, say through direct policy updates, often requires assumptions such as knowing a priori that the initial policy (gain) is stabilizing, or persistently exciting (PE) input-output data, is available. In this paper, we examine online regulation of (possibly unstable) partially unknown linear systems with no prior access to an initial stabilizing controller nor PE input-output data; we instea… ▽ More

    Submitted 19 January, 2022; v1 submitted 29 May, 2020; originally announced June 2020.

  32. $\mathcal{H}_2$ performance of series-parallel networks: A compositional perspective

    Authors: Mathias Hudoba de Badyn, Mehran Mesbahi

    Abstract: We examine the $\mathcal{H}_2$ norm of matrix-weighted leader-follower consensus on series-parallel networks. By using an extension of electrical network theory on matrix-valued resistances, voltages and currents, we show that the computation of the $\mathcal{H}_2$ norm can be performed efficiently by decomposing the network into atomic elements and composition rules. Lastly, we examine the proble… ▽ More

    Submitted 24 December, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: Provisionally accepted to the IEEE Transactions on Automatic Control. arXiv admin note: substantial text overlap with arXiv:1903.05325

    Journal ref: IEEE Transactions on Automatic Control, 2021, vol. 6, no. 1, pp. 354 - 361

  33. arXiv:2002.05023  [pdf, other

    math.OC eess.SY

    Global Convergence of Policy Gradient Algorithms for Indefinite Least Squares Stationary Optimal Control

    Authors: Jingjing Bu, Mehran Mesbahi

    Abstract: We consider policy gradient algorithms for the indefinite least squares stationary optimal control, e.g., linear-quadratic-regulator (LQR) with indefinite state and input penalization matrices. Such a setup has important applications in control design with conflicting objectives, such as linear quadratic dynamic games. We show the global convergence of gradient, natural gradient and quasi-Newton p… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: text overlap with arXiv:1911.04672

  34. arXiv:2001.11179  [pdf, ps, other

    eess.SY

    Consensus on Matrix-weighted Time-varying Networks

    Authors: Lulu Pan, Haibin Shao, Mehran Mesbahi, Yugeng Xi, Dewei Li

    Abstract: This paper examines the consensus problem on time-varying matrix-weighed undirected networks. First, we introduce the matrix-weighted integral network for the analysis of such networks. Under mild assumptions on the switching pattern of the time-varying network, necessary and/or sufficient conditions for which average consensus can be achieved are then provided in terms of the null space of matrix… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

  35. arXiv:2001.04035  [pdf, ps, other

    eess.SY math.OC

    On the Controllability of Matrix-weighted Networks

    Authors: Lulu Pan, Haibin Shao, Mehran Mesbahi, Yugeng Xi, Dewei Li

    Abstract: This letter examines the controllability of consensus dynamics on matrix-weighed networks from a graph-theoretic perspective. Unlike the scalar-weighted networks, the rank of weight matrix introduces additional intricacies into characterizing the dimension of controllable subspace for such networks. Specifically, we investigate how the definiteness of weight matrices influences the dimension of th… ▽ More

    Submitted 12 January, 2020; originally announced January 2020.

  36. arXiv:1912.07671  [pdf, other

    math.OC

    Data-driven parameterizations of suboptimal LQR and H2 controllers

    Authors: Henk J. van Waarde, Mehran Mesbahi

    Abstract: In this paper we design suboptimal control laws for an unknown linear system on the basis of measured data. We focus on the suboptimal linear quadratic regulator problem and the suboptimal H2 control problem. For both problems, we establish conditions under which a given data set contains sufficient information for controller design. We follow up by providing a data-driven parameterization of all… ▽ More

    Submitted 7 May, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: 6 pages

  37. arXiv:1911.04672  [pdf, ps, other

    eess.SY math.OC

    Global Convergence of Policy Gradient for Sequential Zero-Sum Linear Quadratic Dynamic Games

    Authors: Jingjing Bu, Lillian J. Ratliff, Mehran Mesbahi

    Abstract: We propose projection-free sequential algorithms for linear-quadratic dynamics games. These policy gradient based algorithms are akin to Stackelberg leadership model and can be extended to model-free settings. We show that if the leader performs natural gradient descent/ascent, then the proposed algorithm has a global sublinear convergence to the Nash equilibrium. Moreover, if the leader adopts a… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

  38. arXiv:1909.07864  [pdf, ps, other

    math.OC

    Time scale design for network resilience

    Authors: Dillon R. Foight, Mathias Hudoba de Badyn, Mehran Mesbahi

    Abstract: In this paper we consider the $\mathcal{H}_2$-norm of networked systems with multi-time scale consensus dynamics. We develop a general framework for such systems that allows for edge weighting, independent agent-based time scales, as well as measurement and process noise. From this general system description, we highlight an interesting case where the influences of the weighting and scaling can be… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: 6 pages, accepted to 58th IEEE Conference on Decision and Control

  39. arXiv:1908.11329  [pdf, other

    eess.SY

    Augmented State Feedback for Improving Observability of Linear Systems with Nonlinear Measurements

    Authors: Atiye Alaeddini, Kristi A. Morgansen, Mehran Mesbahi

    Abstract: This paper is concerned with the design of an augmented state feedback controller for finite-dimensional linear systems with nonlinear observation dynamics. Most of the theoretical results in the area of (optimal) feedback design are based on the assumption that the state is available for measurement. In this paper, we focus on finding a feedback control that avoids state trajectories with undesir… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

    Comments: Accepted in System and Control Letters

  40. Strong Structural Controllability of Signed Networks

    Authors: Shima Sadat Mousavi, Mohammad Haeri, Mehran Mesbahi

    Abstract: In this paper, we discuss the controllability of a family of linear time-invariant (LTI) networks defined on a signed graph. In this direction, we introduce the notion of positive and negative signed zero forcing sets for the controllability analysis of positive and negative eigenvalues of system matrices with the same sign pattern. A sufficient combinatorial condition that ensures the strong stru… ▽ More

    Submitted 10 October, 2019; v1 submitted 15 August, 2019; originally announced August 2019.

  41. arXiv:1907.08921  [pdf, other

    eess.SY math.OC

    LQR through the Lens of First Order Methods: Discrete-time Case

    Authors: Jingjing Bu, Afshin Mesbahi, Maryam Fazel, Mehran Mesbahi

    Abstract: We consider the Linear-Quadratic-Regulator (LQR) problem in terms of optimizing a real-valued matrix function over the set of feedback gains. Such a setup facilitates examining the implications of a natural initial-state independent formulation of LQR in designing first order algorithms. It is shown that this cost function is smooth and coercive, and provide an alternate means of noting its gradie… ▽ More

    Submitted 29 July, 2019; v1 submitted 21 July, 2019; originally announced July 2019.

  42. arXiv:1906.04857  [pdf, other

    math.OC

    Fast Trajectory Optimization via Successive Convexification for Spacecraft Rendezvous with Integer Constraints

    Authors: Danylo Malyuta, Taylor P. Reynolds, Michael Szmuk, Behcet Acikmese, Mehran Mesbahi

    Abstract: In this paper we present a fast method based on successive convexification for generating fuel-optimized spacecraft rendezvous trajectories in the presence of mixed-integer constraints. A recently developed paradigm of state-triggered constraints allows to efficiently embed a subset of discrete decision constraints into the continuous optimization framework of successive convexification. As a resu… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: 23 pages, 10 figures, submitted to AIAA SciTech 2020

  43. Strong Structural Controllability of Networks under Time-Invariant and Time-Varying Topological Perturbations

    Authors: Shima Sadat Mousavi, Mohammad Haeri, Mehran Mesbahi

    Abstract: This paper investigates the robustness of strong structural controllability for linear time-invariant and linear time-varying directed networks with respect to structural perturbations, including edge deletions and additions. In this direction, we introduce a new construct referred to as a perfect graph associated with a network with a given set of control nodes. The tight upper bounds on the numb… ▽ More

    Submitted 21 May, 2020; v1 submitted 22 April, 2019; originally announced April 2019.

  44. Dual Quaternion Based Powered Descent Guidance with State-Triggered Constraints

    Authors: Taylor P. Reynolds, Michael Szmuk, Danylo Malyuta, Mehran Mesbahi, Behcet Acikmese, John M. Carson III

    Abstract: This paper presents a numerical algorithm for computing 6-degree-of-freedom free-final-time powered descent guidance trajectories. The trajectory generation problem is formulated using a unit dual quaternion representation of the rigid body dynamics, and several standard path constraints. Our formulation also includes a special line of sight constraints that is enforced only within a specified ban… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

    Comments: Submitted to the AIAA Journal of Guidance, Control, and Dynamics

  45. arXiv:1904.08451  [pdf, other

    eess.SY

    On Topological Properties of the Set of Stabilizing Feedback Gains

    Authors: Jingjing Bu, Afshin Mesbahi, Mehran Mesbahi

    Abstract: This work presents a fairly complete account on various topological and metrical aspects of feedback stabilization for single-input-single-output (SISO) continuous and discrete time linear-time-invariant (LTI) systems. In particular, we prove that the set of stabilizing output feedback gains for a SISO system with n states has at most $\lceil{\frac{n}{2}}\rceil$ connected components. Furthermore,… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

  46. arXiv:1904.08449  [pdf, ps, other

    eess.SY

    Nonlinear Observability via Koopman Analysis: Characterizing the Role of Symmetry

    Authors: Afshin Mesbahi, Jingjing Bu, Mehran Mesbahi

    Abstract: This paper considers the observability of nonlinear systems from a Koopman operator theoretic perspective--and in particular--the effect of symmetry on observability. We first examine an infinite-dimensional linear system (constructed using independent Koopman eigenfunctions) such that its observability is equivalent to the observability of the original nonlinear system. Next, we derive an analyti… ▽ More

    Submitted 10 February, 2020; v1 submitted 17 April, 2019; originally announced April 2019.

  47. arXiv:1904.02737  [pdf, ps, other

    eess.SY

    On Topological and Metrical Properties of Stabilizing Feedback Gains: the MIMO Case

    Authors: Jingjing Bu, Afshin Mesbahi, Mehran Mesbahi

    Abstract: In this paper, we discuss various topological and metrical aspects of the set of stabilizing static feedback gains for multiple-input-multiple-output (MIMO) linear-time-invariant (LTI) systems, in both continuous and discrete-time. Recently, connectivity properties of this set (for continuous time) have been reported in the literature, along with a discussion on how this connectivity is affected b… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: 17 pages

  48. Efficient Computation of H2 Performance on Series-Parallel Networks

    Authors: Mathias Hudoba de Badyn, Mehran Mesbahi

    Abstract: Series-parallel networks are a class of graphs on which many NP-hard problems have tractable solutions. In this paper, we examine performance measures on leader-follower consensus on series-parallel networks. We show that a distributed computation of the $\mathcal{H}_2$ norm can be done efficiently on this system by exploiting a decomposition of the network into atomic elements and composition rul… ▽ More

    Submitted 25 April, 2020; v1 submitted 13 March, 2019; originally announced March 2019.

    Comments: 6 pages, 5 figures. To appear in proceedings of the 2019 American Control Conference

    Journal ref: Proc. 2019 American Control Conference, pp. 3364-3369

  49. arXiv:1901.02181  [pdf, other

    math.OC

    Successive Convexification for 6-DoF Powered Descent Guidance with Compound State-Triggered Constraints

    Authors: Michael Szmuk, Taylor P. Reynolds, Behcet Acikmese, Mehran Mesbahi, John M. Carson III

    Abstract: This paper introduces a continuous formulation for compound state-triggered constraints, which are generalizations of the recently introduced state-triggered constraints. State-triggered constraints are different from ordinary constraints found in optimal control in that they use a state-dependent trigger condition to enable or disable a constraint condition, and can be expressed as continuous fun… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: This paper is a modified version of the one presented at the 2019 AIAA Guidance, Navigation, and Control Conference (SciTech) in San Diego, California (17 pages, 10 figures)

  50. arXiv:1809.08745  [pdf, other

    math.OC

    Distributed Q-Learning for Dynamically Decoupled Systems

    Authors: Siavash Alemzadeh, Mehran Mesbahi

    Abstract: Control of large-scale networked systems often necessitates the availability of complex models for the interactions amongst the agents. However in many applications, building accurate models of agents or interactions amongst them might be infeasible or computationally prohibitive due to the curse of dimensionality or the complexity of these interactions. In the meantime, data-guided control method… ▽ More

    Submitted 19 March, 2019; v1 submitted 24 September, 2018; originally announced September 2018.