-
Stability properties of gradient flow dynamics for the symmetric low-rank matrix factorization problem
Authors:
Hesameddin Mohammadi,
Mohammad Tinati,
Stephen Tu,
Mahdi Soltanolkotabi,
Mihailo R. Jovanović
Abstract:
The symmetric low-rank matrix factorization serves as a building block in many learning tasks, including matrix recovery and training of neural networks. However, despite a flurry of recent research, the dynamics of its training via non-convex factorized gradient-descent-type methods is not fully understood especially in the over-parameterized regime where the fitted rank is higher than the true r…
▽ More
The symmetric low-rank matrix factorization serves as a building block in many learning tasks, including matrix recovery and training of neural networks. However, despite a flurry of recent research, the dynamics of its training via non-convex factorized gradient-descent-type methods is not fully understood especially in the over-parameterized regime where the fitted rank is higher than the true rank of the target matrix. To overcome this challenge, we characterize equilibrium points of the gradient flow dynamics and examine their local and global stability properties. To facilitate a precise global analysis, we introduce a nonlinear change of variables that brings the dynamics into a cascade connection of three subsystems whose structure is simpler than the structure of the original system. We demonstrate that the Schur complement to a principal eigenspace of the target matrix is governed by an autonomous system that is decoupled from the rest of the dynamics. In the over-parameterized regime, we show that this Schur complement vanishes at an $O(1/t)$ rate, thereby capturing the slow dynamics that arises from excess parameters. We utilize a Lyapunov-based approach to establish exponential convergence of the other two subsystems. By decoupling the fast and slow parts of the dynamics, we offer new insight into the shape of the trajectories associated with local search algorithms and provide a complete characterization of the equilibrium points and their global stability properties. Such an analysis via nonlinear control techniques may prove useful in several related over-parameterized problems.
△ Less
Submitted 24 November, 2024;
originally announced November 2024.
-
Cohomology rings of oriented Grassmann manifolds $\widetilde G_{2^t,4}$
Authors:
Uroš A. Colović,
Milica Jovanović,
Branislav I. Prvulović
Abstract:
We give a description of the mod 2 cohomology algebra of the oriented Grassmann manifold $\widetilde G_{2^t,4}$ as the quotient of a polynomial algebra by a certain ideal. In the process we find a Gröbner basis for that ideal, which we then use to exhibit an additive basis for $H^*(\widetilde G_{2^t,4};\mathbb Z_2)$.
We give a description of the mod 2 cohomology algebra of the oriented Grassmann manifold $\widetilde G_{2^t,4}$ as the quotient of a polynomial algebra by a certain ideal. In the process we find a Gröbner basis for that ideal, which we then use to exhibit an additive basis for $H^*(\widetilde G_{2^t,4};\mathbb Z_2)$.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
Tannenbaum's gain-margin optimization meets Polyak's heavy-ball algorithm
Authors:
Wuwei Wu,
Jie Chen,
Mihailo R. Jovanović,
Tryphon T. Georgiou
Abstract:
The paper highlights a relatively unknown link between algorithm design in optimization and control synthesis in robust control. Specifically, quadratic optimization can be recast as a regulation problem within the framework of $\mathcal{H}_\infty$ control. From this vantage point, the optimality of Polyak's fastest heavy-ball algorithm can be ascertained as a solution to a gain margin optimizatio…
▽ More
The paper highlights a relatively unknown link between algorithm design in optimization and control synthesis in robust control. Specifically, quadratic optimization can be recast as a regulation problem within the framework of $\mathcal{H}_\infty$ control. From this vantage point, the optimality of Polyak's fastest heavy-ball algorithm can be ascertained as a solution to a gain margin optimization problem. The approach is independent of Polyak's original and brilliant argument, yet simpler, and relies on the foundational work by Tannenbaum that introduced and solved the gain margin optimization via Nevanlinna--Pick interpolation theory. The link between first-order optimization methods and robust control theory sheds new light into limits of algorithmic performance for such methods, and suggests a new framework where similar computational problems can be systematically studied and algorithms optimized. In particular, it raises the question as to whether periodically scheduled algorithms can achieve faster rates for quadratic optimization, in a manner analogous to periodic control that extends gain margin beyond that of time-invariant control. This turns out not to be the case, due to the analytic obstruction of a transmission zero that is inherent in causal optimization algorithms. Interestingly, this obstruction can be removed with implicit algorithms, cast in a similar manner as feedback regulation problems with causal, but not strictly causal dynamics, thereby devoid of the transmission zero at infinity and able to achieve superior convergence rates. The confluence of the fields of optimization algorithms and control provides a frame to tackle questions pertaining to speed, accuracy, distributed computation, and so forth, and to delineate respective limits to performance and tradeoffs in a systematic manner, utilizing the formalism of robust control.
△ Less
Submitted 29 September, 2024;
originally announced September 2024.
-
From exponential to finite/fixed-time stability: Applications to optimization
Authors:
Ibrahim K. Ozaslan,
Mihailo R. Jovanović
Abstract:
The development of finite/fixed-time stable optimization algorithms typically involves study of specific problem instances. The lack of a unified framework hinders understanding of more sophisticated algorithms, e.g., primal-dual gradient flow dynamics. The purpose of this paper is to address the following question: Given an exponentially stable optimization algorithm, can it be modified to obtain…
▽ More
The development of finite/fixed-time stable optimization algorithms typically involves study of specific problem instances. The lack of a unified framework hinders understanding of more sophisticated algorithms, e.g., primal-dual gradient flow dynamics. The purpose of this paper is to address the following question: Given an exponentially stable optimization algorithm, can it be modified to obtain a finite/fixed-time stable algorithm? We provide an affirmative answer, demonstrate how the solution can be computed on a finite-time interval via a simple scaling of the right-hand-side of the original dynamics, and certify the desired properties of the modified algorithm using the Lyapunov function that proves exponential stability of the original system. Finally, we examine nonsmooth composite optimization problems and smooth problems with linear constraints to demonstrate the merits of our approach.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Stability of Primal-Dual Gradient Flow Dynamics for Multi-Block Convex Optimization Problems
Authors:
Ibrahim K. Ozaslan,
Panagiotis Patrinos,
Mihailo R. Jovanović
Abstract:
We examine stability properties of primal-dual gradient flow dynamics for composite convex optimization problems with multiple, possibly nonsmooth, terms in the objective function under the generalized consensus constraint. The proposed dynamics are based on the proximal augmented Lagrangian and they provide a viable alternative to ADMM which faces significant challenges from both analysis and imp…
▽ More
We examine stability properties of primal-dual gradient flow dynamics for composite convex optimization problems with multiple, possibly nonsmooth, terms in the objective function under the generalized consensus constraint. The proposed dynamics are based on the proximal augmented Lagrangian and they provide a viable alternative to ADMM which faces significant challenges from both analysis and implementation viewpoints in large-scale multi-block scenarios. In contrast to customized algorithms with individualized convergence guarantees, we provide a systematic approach for solving a broad class of challenging composite optimization problems. We leverage various structural properties to establish global (exponential) convergence guarantees for the proposed dynamics. Our assumptions are much weaker than those required to prove (exponential) stability of various primal-dual dynamics as well as (linear) convergence of discrete-time methods, e.g., standard two-block and multi-block ADMM and EXTRA algorithms. Finally, we show necessity of some of our structural assumptions for exponential stability and provide computational experiments to demonstrate the convenience of the proposed dynamics for parallel and distributed computing applications.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Accelerated forward-backward and Douglas-Rachford splitting dynamics
Authors:
Ibrahim K. Ozaslan,
Mihailo R. Jovanović
Abstract:
We examine convergence properties of continuous-time variants of accelerated Forward-Backward (FB) and Douglas-Rachford (DR) splitting algorithms for nonsmooth composite optimization problems. When the objective function is given by the sum of a quadratic and a nonsmooth term, we establish accelerated sublinear and exponential convergence rates for convex and strongly convex problems, respectively…
▽ More
We examine convergence properties of continuous-time variants of accelerated Forward-Backward (FB) and Douglas-Rachford (DR) splitting algorithms for nonsmooth composite optimization problems. When the objective function is given by the sum of a quadratic and a nonsmooth term, we establish accelerated sublinear and exponential convergence rates for convex and strongly convex problems, respectively. Moreover, for FB splitting dynamics, we demonstrate that accelerated exponential convergence rate carries over to general strongly convex problems. In our Lyapunov-based analysis we exploit the variable-metric gradient interpretations of FB and DR splittings to obtain smooth Lyapunov functions that allow us to establish accelerated convergence rates. We provide computational experiments to demonstrate the merits and the effectiveness of our analysis.
△ Less
Submitted 24 November, 2024; v1 submitted 30 July, 2024;
originally announced July 2024.
-
Line graphs and Nordhaus-Gaddum-type bounds for self-loop graphs
Authors:
Saieed Akbari,
Irena M. Jovanović,
Johnny Lim
Abstract:
Let $G_S$ be the graph obtained by attaching a self-loop at every vertex in $S \subseteq V(G)$ of a simple graph $G$ of order $n.$ In this paper, we explore several new results related to the line graph $L(G_S)$ of $G_S.$ Particularly, we show that every eigenvalue of $L(G_S)$ must be at least $-2,$ and relate the characteristic polynomial of the line graph $L(G)$ of $G$ with the characteristic po…
▽ More
Let $G_S$ be the graph obtained by attaching a self-loop at every vertex in $S \subseteq V(G)$ of a simple graph $G$ of order $n.$ In this paper, we explore several new results related to the line graph $L(G_S)$ of $G_S.$ Particularly, we show that every eigenvalue of $L(G_S)$ must be at least $-2,$ and relate the characteristic polynomial of the line graph $L(G)$ of $G$ with the characteristic polynomial of the line graph $L(\widehat{G})$ of a self-loop graph $\widehat{G}$, which is obtained by attaching a self-loop at each vertex of $G$. Then, we provide some new bounds for the eigenvalues and energy of $G_S.$ As one of the consequences, we obtain that the energy of a connected regular complete multipartite graph is not greater than the energy of the corresponding self-loop graph. Lastly, we establish a lower bound of the spectral radius in terms of the first Zagreb index $M_1(G)$ and the minimum degree $δ(G),$ as well as proving two Nordhaus-Gaddum-type bounds for the spectral radius and the energy of $G_S,$ respectively.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Steenrod operations on polyhedral products
Authors:
Sanjana Agarwal,
Jelena Grbić,
Michele Intermont,
Milica Jovanović,
Evgeniya Lagoda,
Sarah Whitehouse
Abstract:
We describe the action of the mod $2$ Steenrod algebra on the cohomology of various polyhedral products and related spaces. We carry this out for Davis-Januszkiewicz spaces and their generalizations, for moment-angle complexes as well as for certain polyhedral joins. By studying the combinatorics of underlying simplicial complexes, we deduce some consequences for the lowest cohomological dimension…
▽ More
We describe the action of the mod $2$ Steenrod algebra on the cohomology of various polyhedral products and related spaces. We carry this out for Davis-Januszkiewicz spaces and their generalizations, for moment-angle complexes as well as for certain polyhedral joins. By studying the combinatorics of underlying simplicial complexes, we deduce some consequences for the lowest cohomological dimension in which non-trivial Steenrod operations can appear.
We present a version of cochain-level formulas for Steenrod operations on simplicial complexes. We explain the idea of "propagating" such formulas from a simplicial complex $K$ to polyhedral joins over $K$ and we give examples of this process. We tie the propagation of the Steenrod algebra actions on polyhedral joins to those on moment-angle complexes. Although these are cases where one can understand the Steenrod action via a stable homotopy decomposition, we anticipate applying this method to cases where there is no such decomposition.
△ Less
Submitted 20 June, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
On the mod 2 cohomology algebra of oriented Grassmannians
Authors:
Milica Jovanović,
Branislav I. Prvulović
Abstract:
For $n\in\{2^t-3,2^t-2,2^t-1\}$ ($t\ge3$) we study the cohomology algebra $H^*(\widetilde G_{n,3};\mathbb Z_2)$ of the Grassmann manifold $\widetilde G_{n,3}$ of oriented $3$-dimensional subspaces of $\mathbb R^n$. A complete description of $H^*(\widetilde G_{n,3};\mathbb Z_2)$ is given in the cases $n=2^t-3$ and $n=2^t-2$, while in the case $n=2^t-1$ we obtain a description complete up to a coeff…
▽ More
For $n\in\{2^t-3,2^t-2,2^t-1\}$ ($t\ge3$) we study the cohomology algebra $H^*(\widetilde G_{n,3};\mathbb Z_2)$ of the Grassmann manifold $\widetilde G_{n,3}$ of oriented $3$-dimensional subspaces of $\mathbb R^n$. A complete description of $H^*(\widetilde G_{n,3};\mathbb Z_2)$ is given in the cases $n=2^t-3$ and $n=2^t-2$, while in the case $n=2^t-1$ we obtain a description complete up to a coefficient from $\mathbb Z_2$.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning
Authors:
Dongsheng Ding,
Xiaohan Wei,
Zhuoran Yang,
Zhaoran Wang,
Mihailo R. Jovanović
Abstract:
We examine online safe multi-agent reinforcement learning using constrained Markov games in which agents compete by maximizing their expected total rewards under a constraint on expected total utilities. Our focus is confined to an episodic two-player zero-sum constrained Markov game with independent transition functions that are unknown to agents, adversarial reward functions, and stochastic util…
▽ More
We examine online safe multi-agent reinforcement learning using constrained Markov games in which agents compete by maximizing their expected total rewards under a constraint on expected total utilities. Our focus is confined to an episodic two-player zero-sum constrained Markov game with independent transition functions that are unknown to agents, adversarial reward functions, and stochastic utility functions. For such a Markov game, we employ an approach based on the occupancy measure to formulate it as an online constrained saddle-point problem with an explicit constraint. We extend the Lagrange multiplier method in constrained optimization to handle the constraint by creating a generalized Lagrangian with minimax decision primal variables and a dual variable. Next, we develop an upper confidence reinforcement learning algorithm to solve this Lagrangian problem while balancing exploration and exploitation. Our algorithm updates the minimax decision primal variables via online mirror descent and the dual variable via projected gradient step and we prove that it enjoys sublinear rate $ O((|X|+|Y|) L \sqrt{T(|A|+|B|)}))$ for both regret and constraint violation after playing $T$ episodes of the game. Here, $L$ is the horizon of each episode, $(|X|,|A|)$ and $(|Y|,|B|)$ are the state/action space sizes of the min-player and the max-player, respectively. To the best of our knowledge, we provide the first provably efficient online safe reinforcement learning algorithm in constrained Markov games.
△ Less
Submitted 31 May, 2023;
originally announced June 2023.
-
On integral cohomology algebra of some oriented Grassmann manifolds
Authors:
Milica Jovanović
Abstract:
The integral cohomology algebra of $\widetilde G_{6,3}$ has been determined in the recent work of Kalafat and Yalçinkaya. We completely determine the integral cohomology algebra of $\widetilde G_{n,3}$ for $n=8$ and $n=10$. The main method used to describe these algebras is the Leray-Serre spectral sequence. We also illustrate this method by determining the integral cohomology algebra of…
▽ More
The integral cohomology algebra of $\widetilde G_{6,3}$ has been determined in the recent work of Kalafat and Yalçinkaya. We completely determine the integral cohomology algebra of $\widetilde G_{n,3}$ for $n=8$ and $n=10$. The main method used to describe these algebras is the Leray-Serre spectral sequence. We also illustrate this method by determining the integral cohomology algebra of $\widetilde G_{n,2}$ for $n$ odd.
△ Less
Submitted 31 July, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Tradeoffs between convergence rate and noise amplification for momentum-based accelerated optimization algorithms
Authors:
Hesameddin Mohammadi,
Meisam Razaviyayn,
Mihailo R. Jovanović
Abstract:
We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to an additive white noise. This setup uses noise to account for uncertainty in either gradient evaluation or iteration updates, and it includes Polyak's heavy-ball and Nesterov's accelerated methods as special cases. For strongly convex quadratic prob…
▽ More
We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to an additive white noise. This setup uses noise to account for uncertainty in either gradient evaluation or iteration updates, and it includes Polyak's heavy-ball and Nesterov's accelerated methods as special cases. For strongly convex quadratic problems, we use the steady-state variance of the error in the optimization variable to quantify noise amplification and identify fundamental stochastic performance tradeoffs. Our approach utilizes the Jury stability criterion to provide a novel geometric characterization of conditions for linear convergence, and it reveals the relation between the noise amplification and convergence rate as well as their dependence on the condition number and the constant algorithmic parameters. This geometric insight leads to simple alternative proofs of standard convergence results and allows us to establish ``uncertainty principle'' of strongly convex optimization: for the two-step momentum method with linear convergence rate, the lower bound on the product between the settling time and noise amplification scales quadratically with the condition number. Our analysis also identifies a key difference between the gradient and iterate noise models: while the amplification of gradient noise can be made arbitrarily small by sufficiently decelerating the algorithm, the best achievable variance for the iterate noise model increases linearly with the settling time in the decelerating regime. Finally, we introduce two parameterized families of algorithms that strike a balance between noise amplification and settling time while preserving order-wise Pareto optimality for both noise models.
△ Less
Submitted 19 June, 2024; v1 submitted 24 September, 2022;
originally announced September 2022.
-
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Authors:
Dongsheng Ding,
Kaiqing Zhang,
Jiali Duan,
Tamer Başar,
Mihailo R. Jovanović
Abstract:
We study sequential decision making problems aimed at maximizing the expected total reward while satisfying a constraint on the expected total utility. We employ the natural policy gradient method to solve the discounted infinite-horizon optimal control problem for Constrained Markov Decision Processes (constrained MDPs). Specifically, we propose a new Natural Policy Gradient Primal-Dual (NPG-PD)…
▽ More
We study sequential decision making problems aimed at maximizing the expected total reward while satisfying a constraint on the expected total utility. We employ the natural policy gradient method to solve the discounted infinite-horizon optimal control problem for Constrained Markov Decision Processes (constrained MDPs). Specifically, we propose a new Natural Policy Gradient Primal-Dual (NPG-PD) method that updates the primal variable via natural policy gradient ascent and the dual variable via projected sub-gradient descent. Although the underlying maximization involves a nonconcave objective function and a nonconvex constraint set, under the softmax policy parametrization we prove that our method achieves global convergence with sublinear rates regarding both the optimality gap and the constraint violation. Such convergence is independent of the size of the state-action space, i.e., it is~dimension-free. Furthermore, for log-linear and general smooth policy parametrizations, we establish sublinear convergence rates up to a function approximation error caused by restricted policy parametrization. We also provide convergence and finite-sample complexity guarantees for two sample-based NPG-PD algorithms. Finally, we use computational experiments to showcase the merits and the effectiveness of our approach.
△ Less
Submitted 28 August, 2024; v1 submitted 6 June, 2022;
originally announced June 2022.
-
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence
Authors:
Dongsheng Ding,
Chen-Yu Wei,
Kaiqing Zhang,
Mihailo R. Jovanović
Abstract:
We examine global non-asymptotic convergence properties of policy gradient methods for multi-agent reinforcement learning (RL) problems in Markov potential games (MPG). To learn a Nash equilibrium of an MPG in which the size of state space and/or the number of players can be very large, we propose new independent policy gradient algorithms that are run by all players in tandem. When there is no un…
▽ More
We examine global non-asymptotic convergence properties of policy gradient methods for multi-agent reinforcement learning (RL) problems in Markov potential games (MPG). To learn a Nash equilibrium of an MPG in which the size of state space and/or the number of players can be very large, we propose new independent policy gradient algorithms that are run by all players in tandem. When there is no uncertainty in the gradient evaluation, we show that our algorithm finds an $ε$-Nash equilibrium with $O(1/ε^2)$ iteration complexity which does not explicitly depend on the state space size. When the exact gradient is not available, we establish $O(1/ε^5)$ sample complexity bound in a potentially infinitely large state space for a sample-based algorithm that utilizes function approximation. Moreover, we identify a class of independent policy gradient algorithms that enjoys convergence for both zero-sum Markov games and Markov cooperative games with the players that are oblivious to the types of games being played. Finally, we provide computational experiments to corroborate the merits and the effectiveness of our theoretical developments.
△ Less
Submitted 4 August, 2022; v1 submitted 8 February, 2022;
originally announced February 2022.
-
Oblique transition in hypersonic double-wedge flow
Authors:
Anubhav Dwivedi,
G. S. Sidharth,
Mihailo R. Jovanović
Abstract:
We utilize resolvent and weakly nonlinear analyses in combination with direct numerical simulations (DNS) to identify mechanisms for oblique transition in a Mach 5 hypersonic flow over an adiabatic slender double-wedge. Even though the laminar separated flow is globally stable, resolvent analysis demonstrates significant amplification of unsteady external disturbances to the linearized flow equati…
▽ More
We utilize resolvent and weakly nonlinear analyses in combination with direct numerical simulations (DNS) to identify mechanisms for oblique transition in a Mach 5 hypersonic flow over an adiabatic slender double-wedge. Even though the laminar separated flow is globally stable, resolvent analysis demonstrates significant amplification of unsteady external disturbances to the linearized flow equations. These disturbances are introduced upstream of the separation zone and they lead to the appearance of oblique waves further downstream. We demonstrate that large amplification of oblique waves arises from the growth of fluctuation shear stress due to streamline curvature of the laminar base flow in the separated shear layer. This is in contrast to the attached boundary layers, where no such mechanism exists. We also use a weakly nonlinear analysis to show that the resolvent operator associated with linearization around the laminar base flow governs the evolution of steady reattachment streaks that arise from quadratic interactions of unsteady oblique waves. These quadratic interactions generate vortical excitations in the reattaching shear layer which lead to the formation of streaks in the recirculation zone and their subsequent amplification, breakdown, and transition to turbulence downstream. Our analysis of the energy budget shows that deceleration of the base flow near reattachment is primarily responsible for amplification of steady streaks. Finally, we employ DNS to examine latter stages of transition to turbulence and demonstrate the predictive power of a weakly nonlinear input-output framework in uncovering triggering mechanisms for oblique transition in separated high-speed boundary layer flows.
△ Less
Submitted 26 May, 2022; v1 submitted 30 November, 2021;
originally announced November 2021.
-
Can Decentralized Control Outperform Centralized? The Role of Communication Latency
Authors:
Luca Ballotta,
Mihailo R. Jovanović,
Luca Schenato
Abstract:
In this paper, we examine the influence of communication latency on performance of networked control systems. Even though distributed control architectures offer advantages in terms of communication, maintenance costs, and scalability, it is an open question how communication latency that varies with network topology influences closed-loop performance. For networks in which delays increase with th…
▽ More
In this paper, we examine the influence of communication latency on performance of networked control systems. Even though distributed control architectures offer advantages in terms of communication, maintenance costs, and scalability, it is an open question how communication latency that varies with network topology influences closed-loop performance. For networks in which delays increase with the number of links, we establish the existence of a fundamental performance trade-off that arises from control architecture. In particular, we utilize consensus dynamics with single- and double-integrator agents to show that, if delays increase fast enough, a sparse controller with nearest neighbor interactions can outperform the centralized one with all-to-all communication topology.
△ Less
Submitted 26 January, 2023; v1 submitted 1 September, 2021;
originally announced September 2021.
-
Transient growth of accelerated optimization algorithms
Authors:
Hesameddin Mohammadi,
Samantha Samuelson,
Mihailo R. Jovanović
Abstract:
Optimization algorithms are increasingly being used in applications with limited time budgets. In many real-time and embedded scenarios, only a few iterations can be performed and traditional convergence metrics cannot be used to evaluate performance in these non-asymptotic regimes. In this paper, we examine the transient behavior of accelerated first-order optimization algorithms. For convex quad…
▽ More
Optimization algorithms are increasingly being used in applications with limited time budgets. In many real-time and embedded scenarios, only a few iterations can be performed and traditional convergence metrics cannot be used to evaluate performance in these non-asymptotic regimes. In this paper, we examine the transient behavior of accelerated first-order optimization algorithms. For convex quadratic problems, we employ tools from linear systems theory to show that transient growth arises from the presence of non-normal dynamics. We identify the existence of modes that yield an algebraic growth in early iterations and quantify the transient excursion from the optimal solution caused by these modes. For strongly convex smooth optimization problems, we utilize the theory of integral quadratic constraints (IQCs) to establish an upper bound on the magnitude of the transient response of Nesterov's accelerated algorithm. We show that both the Euclidean distance between the optimization variable and the global minimizer and the rise time to the transient peak are proportional to the square root of the condition number of the problem. Finally, for problems with large condition numbers, we demonstrate tightness of the bounds that we derive up to constant factors.
△ Less
Submitted 23 December, 2021; v1 submitted 14 March, 2021;
originally announced March 2021.
-
Well-conditioned ultraspherical and spectral integration methods for resolvent analysis of channel flows of Newtonian and viscoelastic fluids
Authors:
Gokul Hariharan,
Satish Kumar,
Mihailo R. Jovanović
Abstract:
Modal and nonmodal analyses of fluid flows provide fundamental insight into the early stages of transition to turbulence. Eigenvalues of the dynamical generator govern temporal growth or decay of individual modes, while singular values of the frequency response operator quantify the amplification of disturbances for linearly stable flows. In this paper, we develop well-conditioned ultraspherical a…
▽ More
Modal and nonmodal analyses of fluid flows provide fundamental insight into the early stages of transition to turbulence. Eigenvalues of the dynamical generator govern temporal growth or decay of individual modes, while singular values of the frequency response operator quantify the amplification of disturbances for linearly stable flows. In this paper, we develop well-conditioned ultraspherical and spectral integration methods for frequency response analysis of channel flows of Newtonian and viscoelastic fluids. Even if a discretization method is well-conditioned, we demonstrate that calculations can be erroneous if singular values are computed as the eigenvalues of a cascade connection of the frequency response operator and its adjoint. To address this issue, we utilize a feedback interconnection of the frequency response operator with its adjoint to avoid computation of inverses and facilitate robust singular value decomposition. Specifically, in contrast to conventional spectral collocation methods, the proposed method (i) produces reliable results in channel flows of viscoelastic fluids at high Weissenberg numbers ($\sim 500$); and (ii) does not require a staggered grid for the equations in primitive variables.
△ Less
Submitted 23 February, 2021; v1 submitted 9 May, 2020;
originally announced May 2020.
-
From bypass transition to flow control and data-driven turbulence modeling: An input-output viewpoint
Authors:
Mihailo R. Jovanović
Abstract:
Transient growth and resolvent analyses are routinely used to assess non-asymptotic properties of fluid flows. In particular, resolvent analysis can be interpreted as a special case of viewing flow dynamics as an open system in which free-stream turbulence, surface roughness, and other irregularities provide sources of input forcing. We offer a comprehensive summary of the tools that can be employ…
▽ More
Transient growth and resolvent analyses are routinely used to assess non-asymptotic properties of fluid flows. In particular, resolvent analysis can be interpreted as a special case of viewing flow dynamics as an open system in which free-stream turbulence, surface roughness, and other irregularities provide sources of input forcing. We offer a comprehensive summary of the tools that can be employed to probe the dynamics of fluctuations around a laminar or turbulent base flow in the presence of such stochastic or deterministic input forcing and describe how input-output techniques enhance resolvent analysis. Specifically, physical insights that may remain hidden in the resolvent analysis are gained by detailed examination of input-output responses between spatially-localized body forces and selected linear combinations of state variables. This differentiating feature plays a key role in quantifying the importance of different mechanisms for bypass transition in wall-bounded shear flows and in explaining how turbulent jets generate noise. We highlight the utility of a stochastic framework, with white or colored inputs, in addressing a variety of open challenges including transition in complex fluids, flow control, and physics-aware data-driven turbulence modeling. Applications with time- or spatially-periodic base flows are discussed and future research directions are outlined.
△ Less
Submitted 23 March, 2020;
originally announced March 2020.
-
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Authors:
Dongsheng Ding,
Xiaohan Wei,
Zhuoran Yang,
Zhaoran Wang,
Mihailo R. Jovanović
Abstract:
We study the Safe Reinforcement Learning (SRL) problem using the Constrained Markov Decision Process (CMDP) formulation in which an agent aims to maximize the expected total reward subject to a safety constraint on the expected total value of a utility function. We focus on an episodic setting with the function approximation where the Markov transition kernels have a linear structure but do not im…
▽ More
We study the Safe Reinforcement Learning (SRL) problem using the Constrained Markov Decision Process (CMDP) formulation in which an agent aims to maximize the expected total reward subject to a safety constraint on the expected total value of a utility function. We focus on an episodic setting with the function approximation where the Markov transition kernels have a linear structure but do not impose any additional assumptions on the sampling model. Designing SRL algorithms with provable computational and statistical efficiency is particularly challenging under this setting because of the need to incorporate both the safety constraint and the function approximation into the fundamental exploitation/exploration tradeoff. To this end, we present an \underline{O}ptimistic \underline{P}rimal-\underline{D}ual Proximal Policy \underline{OP}timization (OPDOP) algorithm where the value function is estimated by combining the least-squares policy evaluation and an additional bonus term for safe exploration. We prove that the proposed algorithm achieves an $\tilde{O}(d H^{2.5}\sqrt{T})$ regret and an $\tilde{O}(d H^{2.5}\sqrt{T})$ constraint violation, where $d$ is the dimension of the feature mapping, $H$ is the horizon of each episode, and $T$ is the total number of steps. These bounds hold when the reward/utility functions are fixed but the feedback after each episode is bandit. Our bounds depend on the capacity of the state-action space only through the dimension of the feature mapping and thus our results hold even when the number of states goes to infinity. To the best of our knowledge, we provide the first provably efficient online policy optimization algorithm for CMDP with safe exploration in the function approximation setting.
△ Less
Submitted 25 October, 2020; v1 submitted 1 March, 2020;
originally announced March 2020.
-
Model-based design of riblets for turbulent drag reduction
Authors:
Wei Ran,
Armin Zare,
Mihailo R. Jovanović
Abstract:
Both experiments and direct numerical simulations have been used to demonstrate that riblets can reduce turbulent drag by as much as $10\%$, but their systematic design remains an open challenge. In this paper, we develop a model-based framework to quantify the effect of streamwise-aligned spanwise-periodic riblets on kinetic energy and skin-friction drag in turbulent channel flow. We model the ef…
▽ More
Both experiments and direct numerical simulations have been used to demonstrate that riblets can reduce turbulent drag by as much as $10\%$, but their systematic design remains an open challenge. In this paper, we develop a model-based framework to quantify the effect of streamwise-aligned spanwise-periodic riblets on kinetic energy and skin-friction drag in turbulent channel flow. We model the effect of riblets as a volume penalization in the Navier-Stokes equations and use the statistical response of the eddy-viscosity-enhanced linearized equations to quantify the effect of background turbulence on the mean velocity and skin-friction drag. For triangular riblets, our simulation-free approach reliably predicts drag-reducing trends as well as mechanisms that lead to performance deterioration for large riblets. We investigate the effect of height and spacing on drag reduction and demonstrate a correlation between energy suppression and drag-reduction for appropriately sized riblets. We also analyze the effect of riblets on drag reduction mechanisms and turbulent flow structures including very large scale motions. Our results demonstrate the utility of our approach in capturing the effect of riblets on turbulent flows using models that are tractable for analysis and optimization.
△ Less
Submitted 27 August, 2020; v1 submitted 5 February, 2020;
originally announced February 2020.
-
Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem
Authors:
Hesameddin Mohammadi,
Armin Zare,
Mahdi Soltanolkotabi,
Mihailo R. Jovanović
Abstract:
Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation. In this pape…
▽ More
Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation. In this paper, we take a step towards demystifying the performance and efficiency of such methods by focusing on the standard infinite-horizon linear quadratic regulator problem for continuous-time systems with unknown state-space parameters. We establish exponential stability for the ordinary differential equation (ODE) that governs the gradient-flow dynamics over the set of stabilizing feedback gains and show that a similar result holds for the gradient descent method that arises from the forward Euler discretization of the corresponding ODE. We also provide theoretical bounds on the convergence rate and sample complexity of the random search method with two-point gradient estimates. We prove that the required simulation time for achieving $ε$-accuracy in the model-free setup and the total number of function evaluations both scale as $\log \, (1/ε)$.
△ Less
Submitted 15 March, 2021; v1 submitted 26 December, 2019;
originally announced December 2019.
-
Modal stability analysis of viscoelastic channel and pipe flows using a well-conditioned spectral method
Authors:
Gokul Hariharan,
Mihailo R. Jovanović,
Satish Kumar
Abstract:
Modal stability analysis provides information about the long-time growth or decay of small-amplitude perturbations around a steady-state solution of a dynamical system. In fluid flows, exponentially growing perturbations can initiate departure from laminar flow and trigger transition to turbulence. Although flow of a Newtonian fluid through a pipe is linearly stable for very large values of the Re…
▽ More
Modal stability analysis provides information about the long-time growth or decay of small-amplitude perturbations around a steady-state solution of a dynamical system. In fluid flows, exponentially growing perturbations can initiate departure from laminar flow and trigger transition to turbulence. Although flow of a Newtonian fluid through a pipe is linearly stable for very large values of the Reynolds number ($Re \sim 10^7$), a transition to turbulence often occurs for $Re$ as low as $1500$. When a dilute polymer solution is used in the place of a Newtonian fluid, the transitional value of the Reynolds number decreases even further. Using the spectral collocation method and Oldroyd-B constitutive equation, Garg et al. (Phys. Rev. Lett. 121:024502, 2018) claimed that such a transition in viscoelastic fluids is related to linear instability. Since differential matrices in the collocation method become ill-conditioned when a large number of basis functions is used, we revisit this problem using the well-conditioned spectral integration method. We show modal stability of viscoelastic pipe flow for a broad range of fluid elasticities and polymer concentrations, including cases considered by Garg et al. Similarly, we find that plane Poiseuille flow is linearly stable for cases where Garg et al. report instability. In both channel and pipe flows, we establish the existence of spurious modes that diverge slowly with finer discretization and demonstrate that these can be mistaken for grid-independent modes if the discretization is not fine enough.
△ Less
Submitted 20 November, 2019; v1 submitted 7 November, 2019;
originally announced November 2019.
-
Global exponential stability of primal-dual gradient flow dynamics based on the proximal augmented Lagrangian: A Lyapunov-based approach
Authors:
Dongsheng Ding,
Mihailo R. Jovanović
Abstract:
For a class of nonsmooth composite optimization problems with linear equality constraints, we utilize a Lyapunov-based approach to establish the global exponential stability of the primal-dual gradient flow dynamics based on the proximal augmented Lagrangian. The result holds when the differentiable part of the objective function is strongly convex with a Lipschitz continuous gradient; the non-dif…
▽ More
For a class of nonsmooth composite optimization problems with linear equality constraints, we utilize a Lyapunov-based approach to establish the global exponential stability of the primal-dual gradient flow dynamics based on the proximal augmented Lagrangian. The result holds when the differentiable part of the objective function is strongly convex with a Lipschitz continuous gradient; the non-differentiable part is proper, lower semi-continuous, and convex; and the matrix in the linear constraint is full row rank. Our quadratic Lyapunov function generalizes recent result from strongly convex problems with either affine equality or inequality constraints to a broader class of composite optimization problems with nonsmooth regularizers and it provides a worst-case lower bound of the exponential decay rate. Finally, we use computational experiments to demonstrate that our convergence rate estimate is less conservative than the existing alternatives.
△ Less
Submitted 2 October, 2019;
originally announced October 2019.
-
Stochastic dynamical modeling of turbulent flows
Authors:
Armin Zare,
Tryphon T. Georgiou,
Mihailo R. Jovanović
Abstract:
Advanced measurement techniques and high performance computing have made large data sets available for a wide range of turbulent flows that arise in engineering applications. Drawing on this abundance of data, dynamical models can be constructed to reproduce structural and statistical features of turbulent flows, opening the way to the design of effective model-based flow control strategies. This…
▽ More
Advanced measurement techniques and high performance computing have made large data sets available for a wide range of turbulent flows that arise in engineering applications. Drawing on this abundance of data, dynamical models can be constructed to reproduce structural and statistical features of turbulent flows, opening the way to the design of effective model-based flow control strategies. This review describes a framework for completing second-order statistics of turbulent flows by models that are based on the Navier-Stokes equations linearized around the turbulent mean velocity. Systems theory and convex optimization are combined to address the inherent uncertainty in the dynamics and the statistics of the flow by seeking a suitable parsimonious correction to the prior linearized model. Specifically, dynamical couplings between states of the linearized model dictate structural constraints on the statistics of flow fluctuations. Thence, colored-in-time stochastic forcing that drives the linearized model is sought to account for and reconcile dynamics with available data (i.e., partially known second order statistics). The number of dynamical degrees of freedom that are directly affected by stochastic excitation is minimized as a measure of model parsimony. The spectral content of the resulting colored-in-time stochastic contribution can alternatively be seen to arise from a low-rank structural perturbation of the linearized dynamical generator, pointing to suitable dynamical corrections that may account for the absence of the nonlinear interactions in the linearized model.
△ Less
Submitted 26 August, 2019;
originally announced August 2019.
-
Proximal gradient flow and Douglas-Rachford splitting dynamics: global exponential stability via integral quadratic constraints
Authors:
Sepideh Hassan-Moghaddam,
Mihailo R. Jovanović
Abstract:
Many large-scale and distributed optimization problems can be brought into a composite form in which the objective function is given by the sum of a smooth term and a nonsmooth regularizer. Such problems can be solved via a proximal gradient method and its variants, thereby generalizing gradient descent to a nonsmooth setup. In this paper, we view proximal algorithms as dynamical systems and lever…
▽ More
Many large-scale and distributed optimization problems can be brought into a composite form in which the objective function is given by the sum of a smooth term and a nonsmooth regularizer. Such problems can be solved via a proximal gradient method and its variants, thereby generalizing gradient descent to a nonsmooth setup. In this paper, we view proximal algorithms as dynamical systems and leverage techniques from control theory to study their global properties. In particular, for problems with strongly convex objective functions, we utilize the theory of integral quadratic constraints to prove the global exponential stability of the equilibrium points of the differential equations that govern the evolution of proximal gradient and Douglas-Rachford splitting flows. In our analysis, we use the fact that these algorithms can be interpreted as variable-metric gradient methods on the suitable envelopes and exploit structural properties of the nonlinear terms that arise from the gradient of the smooth part of the objective function and the proximal operator associated with the nonsmooth regularizer. We also demonstrate that these envelopes can be obtained from the augmented Lagrangian associated with the original nonsmooth problem and establish conditions for global exponential convergence even in the absence of strong convexity.
△ Less
Submitted 25 June, 2020; v1 submitted 23 August, 2019;
originally announced August 2019.
-
Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Authors:
Dongsheng Ding,
Xiaohan Wei,
Zhuoran Yang,
Zhaoran Wang,
Mihailo R. Jovanović
Abstract:
We study the policy evaluation problem in multi-agent reinforcement learning where a group of agents, with jointly observed states and private local actions and rewards, collaborate to learn the value function of a given policy via local computation and communication over a connected undirected network. This problem arises in various large-scale multi-agent systems, including power grids, intellig…
▽ More
We study the policy evaluation problem in multi-agent reinforcement learning where a group of agents, with jointly observed states and private local actions and rewards, collaborate to learn the value function of a given policy via local computation and communication over a connected undirected network. This problem arises in various large-scale multi-agent systems, including power grids, intelligent transportation systems, wireless sensor networks, and multi-agent robotics. When the dimension of state-action space is large, the temporal-difference learning with linear function approximation is widely used. In this paper, we develop a new distributed temporal-difference learning algorithm and quantify its finite-time performance. Our algorithm combines a distributed stochastic primal-dual method with a homotopy-based approach to adaptively adjust the learning rate in order to minimize the mean-square projected Bellman error by taking fresh online samples from a causal on-policy trajectory. We explicitly take into account the Markovian nature of sampling and improve the best-known finite-time error bound from $O(1/\sqrt{T})$ to~$O(1/T)$, where $T$ is the total number of iterations.
△ Less
Submitted 4 November, 2021; v1 submitted 7 August, 2019;
originally announced August 2019.
-
Robustness of accelerated first-order algorithms for strongly convex optimization problems
Authors:
Hesameddin Mohammadi,
Meisam Razaviyayn,
Mihailo R. Jovanović
Abstract:
We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradien…
▽ More
We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradient is sought through measurements of a real system or in a distributed computation over a network. Even though the underlying dynamics of first-order algorithms for this class of problems are nonlinear, we establish upper bounds on the mean-squared deviation from the optimal solution that are tight up to constant factors. Our analysis quantifies fundamental trade-offs between noise amplification and convergence rates obtained via any acceleration scheme similar to Nesterov's or heavy-ball methods. To gain additional analytical insight, for strongly convex quadratic problems, we explicitly evaluate the steady-state variance of the optimization variable in terms of the eigenvalues of the Hessian of the objective function. We demonstrate that the entire spectrum of the Hessian, rather than just the extreme eigenvalues, influence robustness of noisy algorithms. We specialize this result to the problem of distributed averaging over undirected networks and examine the role of network size and topology on the robustness of noisy accelerated algorithms.
△ Less
Submitted 20 February, 2020; v1 submitted 27 May, 2019;
originally announced May 2019.
-
Stochastic receptivity analysis of boundary layer flow
Authors:
Wei Ran,
Armin Zare,
M. J. Philipp Hack,
Mihailo R. Jovanović
Abstract:
We utilize the externally forced linearized Navier-Stokes equations to study the receptivity of pre-transitional boundary layers to persistent sources of stochastic excitation. Stochastic forcing is used to model the effect of free-stream turbulence that enters at various wall-normal locations and the fluctuation dynamics are studied via linearized models that arise from locally parallel and globa…
▽ More
We utilize the externally forced linearized Navier-Stokes equations to study the receptivity of pre-transitional boundary layers to persistent sources of stochastic excitation. Stochastic forcing is used to model the effect of free-stream turbulence that enters at various wall-normal locations and the fluctuation dynamics are studied via linearized models that arise from locally parallel and global perspectives. In contrast to the widely used resolvent analysis that quantifies the amplification of deterministic disturbances at a given temporal frequency, our approach examines the steady-state response to stochastic excitation that is uncorrelated in time. In addition to stochastic forcing with identity covariance, we utilize the spatial spectrum of homogeneous isotropic turbulence to model the effect of free-stream turbulence. Even though locally parallel analysis does not account for the effect of the spatially evolving base flow, we demonstrate that it captures the essential mechanisms and the prevailing length-scales in stochastically forced boundary layer flows. On the other hand, global analysis, which accounts for the spatially evolving nature of the boundary layer flow, predicts the amplification of a cascade of streamwise scales throughout the streamwise domain. We show that the flow structures that can be extracted from a modal decomposition of the resulting velocity covariance matrix, can be closely captured by conducting locally parallel analysis at various streamwise locations and over different wall-parallel wavenumber pairs. Our approach does not rely on costly stochastic simulations and it provides insight into mechanisms for perturbation growth including the interaction of the slowly varying base flow with streaks and Tollmien-Schlichting waves.
△ Less
Submitted 3 June, 2019; v1 submitted 20 July, 2018;
originally announced July 2018.
-
Proximal algorithms for large-scale statistical modeling and sensor/actuator selection
Authors:
Armin Zare,
Hesameddin Mohammadi,
Neil K. Dhingra,
Tryphon T. Georgiou,
Mihailo R. Jovanović
Abstract:
Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally selec…
▽ More
Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally select a subset of available sensors and actuators for control purposes. To address modeling and control of large-scale systems we develop a unified algorithmic framework using proximal methods. Our customized algorithms exploit problem structure and allow handling statistical modeling, as well as sensor and actuator selection, for substantially larger scales than what is amenable to current general-purpose solvers. We establish linear convergence of the proximal gradient algorithm, draw contrast between the proposed proximal algorithms and alternating direction method of multipliers, and provide examples that illustrate the merits and effectiveness of our framework.
△ Less
Submitted 26 December, 2019; v1 submitted 4 July, 2018;
originally announced July 2018.
-
Structured decentralized control of positive systems with applications to combination drug therapy and leader selection in directed networks
Authors:
Neil K. Dhingra,
Marcello Colombino,
Mihailo R. Jovanović
Abstract:
We study a class of structured optimal control problems in which the main diagonal of the dynamic matrix is a linear function of the design variable. While such problems are in general challenging and nonconvex, for positive systems we prove convexity of the $H_2$ and $H_\infty$ optimal control formulations which allow for arbitrary convex constraints and regularization of the control input. Moreo…
▽ More
We study a class of structured optimal control problems in which the main diagonal of the dynamic matrix is a linear function of the design variable. While such problems are in general challenging and nonconvex, for positive systems we prove convexity of the $H_2$ and $H_\infty$ optimal control formulations which allow for arbitrary convex constraints and regularization of the control input. Moreover, we establish differentiability of the $H_\infty$ norm when the graph associated with the dynamical generator is weakly connected and develop a customized algorithm for computing the optimal solution even in the absence of differentiability. We apply our results to the problems of leader selection in directed consensus networks and combination drug therapy for HIV treatment. In the context of leader selection, we address the combinatorial challenge by deriving upper and lower bounds on optimal performance. For combination drug therapy, we develop a customized subgradient method for efficient treatment of diseases whose mutation patterns are not connected.
△ Less
Submitted 4 March, 2018; v1 submitted 29 December, 2017;
originally announced December 2017.
-
A second order primal-dual method for nonsmooth convex composite optimization
Authors:
Neil K. Dhingra,
Sei Zhen Khong,
Mihailo R. Jovanović
Abstract:
We develop a second order primal-dual method for optimization problems in which the objective function is given by the sum of a strongly convex twice differentiable term and a possibly nondifferentiable convex regularizer. After introducing an auxiliary variable, we utilize the proximal operator of the nonsmooth regularizer to transform the associated augmented Lagrangian into a function that is o…
▽ More
We develop a second order primal-dual method for optimization problems in which the objective function is given by the sum of a strongly convex twice differentiable term and a possibly nondifferentiable convex regularizer. After introducing an auxiliary variable, we utilize the proximal operator of the nonsmooth regularizer to transform the associated augmented Lagrangian into a function that is once, but not twice, continuously differentiable. The saddle point of this function corresponds to the solution of the original optimization problem. We employ a generalization of the Hessian to define second order updates on this function and prove global exponential stability of the corresponding differential inclusion. Furthermore, we develop a globally convergent customized algorithm that utilizes the primal-dual augmented Lagrangian as a merit function. We show that the search direction can be computed efficiently and prove quadratic/superlinear asymptotic convergence. We use the $\ell_1$-regularized model predictive control problem and the problem of designing a distributed controller for a spatially-invariant system to demonstrate the merits and the effectiveness of our method.
△ Less
Submitted 27 August, 2020; v1 submitted 5 September, 2017;
originally announced September 2017.
-
On the optimal control problem for a class of monotone bilinear systems
Authors:
Neil K. Dhingra,
Marcello Colombino,
Mihailo R. Jovanović,
Anders Rantzer,
Roy S. Smith
Abstract:
We consider a class of monotone systems in which the control signal multiplies the state. Among other applications, such bilinear systems can be used to model the evolutionary dynamics of HIV in the presence of combination drug therapy. For this class of systems, we formulate an infinite horizon optimal control problem, prove that the optimal control signal is constant over time, and show that it…
▽ More
We consider a class of monotone systems in which the control signal multiplies the state. Among other applications, such bilinear systems can be used to model the evolutionary dynamics of HIV in the presence of combination drug therapy. For this class of systems, we formulate an infinite horizon optimal control problem, prove that the optimal control signal is constant over time, and show that it can be computed by solving a finite-dimensional non-smooth convex optimization problem. We provide an explicit expression for the subdifferential set of the objective function and use a subgradient algorithm to design the optimal controller. We further extend our results to characterize the optimal robust controller for systems with uncertain dynamics and show that computing the robust controller is no harder than computing the nominal controller. We illustrate our results with an example motivated by combination drug therapy.
△ Less
Submitted 29 November, 2016;
originally announced November 2016.
-
The proximal augmented Lagrangian method for nonsmooth composite optimization
Authors:
Neil K. Dhingra,
Sei Zhen Khong,
Mihailo R. Jovanović
Abstract:
We study a class of optimization problems in which the objective function is given by the sum of a differentiable but possibly nonconvex component and a nondifferentiable convex regularization term. We introduce an auxiliary variable to separate the objective function components and utilize the Moreau envelope of the regularization term to derive the proximal augmented Lagrangian $-$ a continuousl…
▽ More
We study a class of optimization problems in which the objective function is given by the sum of a differentiable but possibly nonconvex component and a nondifferentiable convex regularization term. We introduce an auxiliary variable to separate the objective function components and utilize the Moreau envelope of the regularization term to derive the proximal augmented Lagrangian $-$ a continuously differentiable function obtained by constraining the augmented Lagrangian to the manifold that corresponds to the explicit minimization over the variable in the nonsmooth term. The continuous differentiability of this function with respect to both primal and dual variables allows us to leverage the method of multipliers (MM) to compute optimal primal-dual pairs by solving a sequence of differentiable problems. The MM algorithm is applicable to a broader class of problems than proximal gradient methods and it has stronger convergence guarantees and a more refined step-size update rules than the alternating direction method of multipliers. These features make it an attractive option for solving structured optimal control problems. We also develop an algorithm based on the primal-descent dual-ascent gradient method and prove global (exponential) asymptotic stability when the differentiable component of the objective function is (strongly) convex and the regularization term is convex. Finally, we identify classes of problems for which the primal-dual gradient flow dynamics are convenient for distributed implementation and compare/contrast our framework to the existing approaches.
△ Less
Submitted 25 August, 2018; v1 submitted 14 October, 2016;
originally announced October 2016.
-
Color of turbulence
Authors:
Armin Zare,
Mihailo R. Jovanović,
Tryphon T. Georgiou
Abstract:
In this paper, we address the problem of how to account for second-order statistics of turbulent flows using low-complexity stochastic dynamical models based on the linearized Navier-Stokes equations. The complexity is quantified by the number of degrees of freedom in the linearized evolution model that are directly influenced by stochastic excitation sources. For the case where only a subset of v…
▽ More
In this paper, we address the problem of how to account for second-order statistics of turbulent flows using low-complexity stochastic dynamical models based on the linearized Navier-Stokes equations. The complexity is quantified by the number of degrees of freedom in the linearized evolution model that are directly influenced by stochastic excitation sources. For the case where only a subset of velocity correlations are known, we develop a framework to complete unavailable second-order statistics in a way that is consistent with linearization around turbulent mean velocity. In general, white-in-time stochastic forcing is not sufficient to explain turbulent flow statistics. We develop models for colored-in-time forcing using a maximum entropy formulation together with a regularization that serves as a proxy for rank minimization. We show that colored-in-time excitation of the Navier-Stokes equations can also be interpreted as a low-rank modification to the generator of the linearized dynamics. Our method provides a data-driven refinement of models that originate from first principles and captures complex dynamics of turbulent flows in a way that is tractable for analysis, optimization, and control design.
△ Less
Submitted 18 October, 2016; v1 submitted 16 February, 2016;
originally announced February 2016.
-
Topology design for stochastically-forced consensus networks
Authors:
Sepideh Hassan-Moghaddam,
Mihailo R. Jovanović
Abstract:
We study an optimal control problem aimed at achieving a desired tradeoff between the network coherence and communication requirements in the distributed controller. Our objective is to add a certain number of edges to an undirected network, with a known graph Laplacian, in order to optimally enhance closed-loop performance. To promote controller sparsity, we introduce $\ell_1$-regularization into…
▽ More
We study an optimal control problem aimed at achieving a desired tradeoff between the network coherence and communication requirements in the distributed controller. Our objective is to add a certain number of edges to an undirected network, with a known graph Laplacian, in order to optimally enhance closed-loop performance. To promote controller sparsity, we introduce $\ell_1$-regularization into the optimal ${\cal H}_2$ formulation and cast the design problem as a semidefinite program. We derive a Lagrange dual, provide interpretation of dual variables, and exploit structure of the optimality conditions for undirected networks to develop customized proximal gradient and Newton algorithms that are well-suited for large problems. We illustrate that our algorithms can solve the problems with more than million edges in the controller graph in a few minutes, on a PC. We also exploit structure of connected resistive networks to demonstrate how additional edges can be systematically added in order to minimize the ${\cal H}_2$ norm of the closed-loop system.
△ Less
Submitted 17 October, 2016; v1 submitted 10 June, 2015;
originally announced June 2015.
-
Input-output analysis and decentralized optimal control of inter-area oscillations in power systems
Authors:
Xiaofan Wu,
Florian Dörfler,
Mihailo R. Jovanović
Abstract:
Local and inter-area oscillations in bulk power systems are typically identified using spatial profiles of poorly damped modes, and they are mitigated via carefully tuned decentralized controllers. In this paper, we employ non-modal tools to analyze and control inter-area oscillations. Our input-output analysis examines power spectral density and variance amplification of stochastically forced sys…
▽ More
Local and inter-area oscillations in bulk power systems are typically identified using spatial profiles of poorly damped modes, and they are mitigated via carefully tuned decentralized controllers. In this paper, we employ non-modal tools to analyze and control inter-area oscillations. Our input-output analysis examines power spectral density and variance amplification of stochastically forced systems and offers new insights relative to modal approaches. To improve upon the limitations of conventional wide-area control strategies, we also study the problem of signal selection and optimal design of sparse and block-sparse wide-area controllers. In our design, we preserve rotational symmetry of the power system by allowing only relative angle measurements in the distributed controllers. For the IEEE 39 New England model, we examine performance tradeoffs and robustness of different control architectures and show that optimal retuning of fully-decentralized control strategies can effectively guard against local and inter-area oscillations.
△ Less
Submitted 18 May, 2015; v1 submitted 11 February, 2015;
originally announced February 2015.
-
Low-complexity modeling of partially available second-order statistics: theory and an efficient matrix completion algorithm
Authors:
Armin Zare,
Yongxin Chen,
Mihailo R. Jovanović,
Tryphon T. Georgiou
Abstract:
State statistics of linear systems satisfy certain structural constraints that arise from the underlying dynamics and the directionality of input disturbances. In the present paper we study the problem of completing partially known state statistics. Our aim is to develop tools that can be used in the context of control-oriented modeling of large-scale dynamical systems. For the type of application…
▽ More
State statistics of linear systems satisfy certain structural constraints that arise from the underlying dynamics and the directionality of input disturbances. In the present paper we study the problem of completing partially known state statistics. Our aim is to develop tools that can be used in the context of control-oriented modeling of large-scale dynamical systems. For the type of applications we have in mind, the dynamical interaction between state variables is known while the directionality and dynamics of input excitation is often uncertain. Thus, the goal of the mathematical problem that we formulate is to identify the dynamics and directionality of input excitation in order to explain and complete observed sample statistics. More specifically, we seek to explain correlation data with the least number of possible input disturbance channels. We formulate this inverse problem as rank minimization, and for its solution, we employ a convex relaxation based on the nuclear norm. The resulting optimization problem is cast as a semidefinite program and can be solved using general-purpose solvers. For problem sizes that these solvers cannot handle, we develop a customized alternating minimization algorithm (AMA). We interpret AMA as a proximal gradient for the dual problem and prove sub-linear convergence for the algorithm with fixed step-size. We conclude with an example that illustrates the utility of our modeling and optimization framework and draw contrast between AMA and the commonly used alternating direction method of multipliers (ADMM) algorithm.
△ Less
Submitted 11 April, 2016; v1 submitted 10 December, 2014;
originally announced December 2014.
-
Performance laws of large heterogeneous cellular networks
Authors:
Bartlomiej Blaszczyszyn,
Miodrag Jovanovic,
Mohamed Kadhem Karray
Abstract:
We propose a model for heterogeneous cellular networks assuming a space-time Poisson process of call arrivals, independently marked by data volumes, and served by different types of base stations (having different transmission powers) represented by the superposition of independent Poisson processes on the plane. Each station applies a processor sharing policy to serve users arriving in its vicini…
▽ More
We propose a model for heterogeneous cellular networks assuming a space-time Poisson process of call arrivals, independently marked by data volumes, and served by different types of base stations (having different transmission powers) represented by the superposition of independent Poisson processes on the plane. Each station applies a processor sharing policy to serve users arriving in its vicinity, modeled by the Voronoi cell perturbed by some random signal propagation effects (shadowing). Users' peak service rates depend on their signal-to-interference-and-noise ratios (SINR) with respect to the serving station. The mutual-dependence of the cells (due to the extra-cell interference) is captured via some system of cell-load equations impacting the spatial distribution of the SINR. We use this model to study in a semi-analytic way (involving only static simulations, with the temporal evolution handled by the queuing theoretic results) network performance metrics (cell loads, mean number of users) and the quality of service perceived by the users (mean throughput) served by different types of base stations. Our goal is to identify macroscopic laws regarding these performance metrics, involving averaging both over time and the network geometry. The reveled laws are validated against real field measurement in an operational network.
△ Less
Submitted 19 March, 2015; v1 submitted 28 November, 2014;
originally announced November 2014.
-
Tests of exponentiality based on Arnold-Villasenor characterization, and their efficiencies
Authors:
M. Jovanovic,
B. Milosevic,
Ya. Yu. Nikitin,
M. Obradovic,
K. Yu. Volkova
Abstract:
We propose two families of scale-free exponentiality tests based on the recent characterization of exponentiality by Arnold and Villasenor. The test statistics are based on suitable functionals of U-empirical distribution functions. The family of integral statistics can be reduced to V- or U-statistics with relatively simple non-degenerate kernels. They are asymptotically normal and have reasonabl…
▽ More
We propose two families of scale-free exponentiality tests based on the recent characterization of exponentiality by Arnold and Villasenor. The test statistics are based on suitable functionals of U-empirical distribution functions. The family of integral statistics can be reduced to V- or U-statistics with relatively simple non-degenerate kernels. They are asymptotically normal and have reasonably high local Bahadur efficiency under common alternatives. This efficiency is compared with simulated powers of new tests. On the other hand, the Kolmogorov type tests demonstrate very low local Bahadur efficiency and rather moderate power for common alternatives,and can hardly be recommended to practitioners. We also explore the conditions of local asymptotic optimality of new tests and describe for both families special "most favorable" alternatives for which the tests are fully efficient.
△ Less
Submitted 18 July, 2014;
originally announced July 2014.
-
Goodness-of-Fit Tests for Pareto Distribution Based on a Characterization and their Asymptotics
Authors:
Marko Obradović,
Milan Jovanović,
Bojana Milošević
Abstract:
In this paper we present a new characterization of Pareto distribution and consider goodness of fit tests based on it. We provide an integral and Kolmogorov- Smirnov type statistics based on U-statistics and we calculate Bahadur efficiency for various alternatives. We find locally optimal alternatives for those tests. For small sample sizes we compare the power of those tests with some common good…
▽ More
In this paper we present a new characterization of Pareto distribution and consider goodness of fit tests based on it. We provide an integral and Kolmogorov- Smirnov type statistics based on U-statistics and we calculate Bahadur efficiency for various alternatives. We find locally optimal alternatives for those tests. For small sample sizes we compare the power of those tests with some common goodness of fit tests.
△ Less
Submitted 27 April, 2014; v1 submitted 21 October, 2013;
originally announced October 2013.
-
Sparsity-promoting dynamic mode decomposition
Authors:
Mihailo R. Jovanović,
Peter J. Schmid,
Joseph W. Nichols
Abstract:
Dynamic mode decomposition (DMD) represents an effective means for capturing the essential features of numerically or experimentally generated flow fields. In order to achieve a desirable tradeoff between the quality of approximation and the number of modes that are used to approximate the given fields, we develop a sparsity-promoting variant of the standard DMD algorithm. In our method, sparsity…
▽ More
Dynamic mode decomposition (DMD) represents an effective means for capturing the essential features of numerically or experimentally generated flow fields. In order to achieve a desirable tradeoff between the quality of approximation and the number of modes that are used to approximate the given fields, we develop a sparsity-promoting variant of the standard DMD algorithm. In our method, sparsity is induced by regularizing the least-squares deviation between the matrix of snapshots and the linear combination of DMD modes with an additional term that penalizes the $\ell_1$-norm of the vector of DMD amplitudes. The globally optimal solution of the resulting regularized convex optimization problem is computed using the alternating direction method of multipliers, an algorithm well-suited for large problems. Several examples of flow fields resulting from numerical simulations and physical experiments are used to illustrate the effectiveness of the developed method.
△ Less
Submitted 16 September, 2013;
originally announced September 2013.
-
How user throughput depends on the traffic demand in large cellular networks
Authors:
Bartlomiej Blaszczyszyn,
Miodrag Jovanovic,
Mohamed Kadhem Karray
Abstract:
Little's law allows to express the mean user throughput in any region of the network as the ratio of the mean traffic demand to the steady-state mean number of users in this region. Corresponding statistics are usually collected in operational networks for each cell. Using ergodic arguments and Palm theoretic formalism, we show that the global mean user throughput in the network is equal to the ra…
▽ More
Little's law allows to express the mean user throughput in any region of the network as the ratio of the mean traffic demand to the steady-state mean number of users in this region. Corresponding statistics are usually collected in operational networks for each cell. Using ergodic arguments and Palm theoretic formalism, we show that the global mean user throughput in the network is equal to the ratio of these two means in the steady state of the "typical cell". Here, both means account for double averaging: over time and network geometry, and can be related to the per-surface traffic demand, base-station density and the spatial distribution of the SINR. This latter accounts for network irregularities, shadowing and idling cells via cell-load equations. We validate our approach comparing analytical and simulation results for Poisson network model to real-network cell-measurements.
△ Less
Submitted 24 March, 2014; v1 submitted 31 July, 2013;
originally announced July 2013.
-
Sparsity-Promoting Optimal Wide-Area Control of Power Networks
Authors:
Florian Dörfler,
Mihailo R. Jovanovic,
Michael Chertkov,
Francesco Bullo
Abstract:
Inter-area oscillations in bulk power systems are typically poorly controllable by means of local decentralized control. Recent research efforts have been aimed at developing wide- area control strategies that involve communication of remote signals. In conventional wide-area control, the control structure is fixed a priori typically based on modal criteria. In contrast, here we employ the recentl…
▽ More
Inter-area oscillations in bulk power systems are typically poorly controllable by means of local decentralized control. Recent research efforts have been aimed at developing wide- area control strategies that involve communication of remote signals. In conventional wide-area control, the control structure is fixed a priori typically based on modal criteria. In contrast, here we employ the recently-introduced paradigm of sparsity- promoting optimal control to simultaneously identify the optimal control structure and optimize the closed-loop performance. To induce a sparse control architecture, we regularize the standard quadratic performance index with an l1-penalty on the feedback matrix. The quadratic objective functions are inspired by the classic slow coherency theory and are aimed at imitating homogeneous networks without inter-area oscillations. We use the New England power grid model to demonstrate that the proposed combination of the sparsity-promoting control design with the slow coherency objectives performs almost as well as the optimal centralized control while only making use of a single wide-area communication link. In addition to this nominal performance, we also demonstrate that our control strategy yields favorable robustness margins and that it can be used to identify a sparse control architecture for control design via alternative means.
△ Less
Submitted 12 November, 2013; v1 submitted 16 July, 2013;
originally announced July 2013.
-
Quality of Real-Time Streaming in Wireless Cellular Networks - Stochastic Modeling and Analysis
Authors:
Bartlomiej Blaszczyszyn,
Miodrag Jovanovic,
Mohamed Kadhem Karray
Abstract:
We present a new stochastic service model with capacity sharing and interruptions, appropriate for the evaluation of the quality of real-time streaming (RTS), like e.g. mobile TV, in wireless cellular networks. The general model takes into account multi-class Markovian process of call arrivals, (to capture different radio channel conditions, requested streaming bit-rates and durations) and allows…
▽ More
We present a new stochastic service model with capacity sharing and interruptions, appropriate for the evaluation of the quality of real-time streaming (RTS), like e.g. mobile TV, in wireless cellular networks. The general model takes into account multi-class Markovian process of call arrivals, (to capture different radio channel conditions, requested streaming bit-rates and durations) and allows for a general resource allocation policy saying which users are temporarily denied the requested fixed streaming bit-rates (put in outage) due to resource constraints. We give expressions for several important performance characteristics of the model, including mean time spent in outage and mean number of outage incidents for a typical user of a given class. These expressions involve only stationary probabilities of the (free) traffic demand process, which is a vector of independent Poisson random variables describing the number of users of different classes. In order to analyze RTS in 3GPP Long Term Evolution (LTE) cellular networks, we specify our general model assuming orthogonal user channels with the peak bit-rates close to the theoretical Shannon's bound in the additive white Gaussian noise (AWGN) channel, which leads to the resource constraints in a multi-rate linear form. In this setting we consider a natural class of least-effort-served-first resource allocation policies, for which the characteristics of the model can be further evaluated using Fourier analysis of Poisson variables. Within this class we identify and evaluate an optimal and a fair policy, the latter being suggested by LTE implementations. We also propose some intermediate policies, which allow to solve the optimality/fairness tradeoff caused by unequal user radio-channel conditions. Our results can be used for the evaluation of the quality of RTS in LTE networks and dimensioning of these networks.
△ Less
Submitted 4 March, 2014; v1 submitted 18 April, 2013;
originally announced April 2013.
-
Worst-case amplification of disturbances in inertialess Couette flow of viscoelastic fluids
Authors:
Binh K. Lieu,
Mihailo R. Jovanović,
Satish Kumar
Abstract:
Amplification of deterministic disturbances in inertialess shear-driven channel flows of viscoelastic fluids is examined by analyzing the frequency responses from spatio-temporal body forces to the velocity and polymer stress fluctuations. In strongly elastic flows, we show that disturbances with large streamwise length scales may be significantly amplified even in the absence of inertia. For fluc…
▽ More
Amplification of deterministic disturbances in inertialess shear-driven channel flows of viscoelastic fluids is examined by analyzing the frequency responses from spatio-temporal body forces to the velocity and polymer stress fluctuations. In strongly elastic flows, we show that disturbances with large streamwise length scales may be significantly amplified even in the absence of inertia. For fluctuations without streamwise variations, we derive explicit analytical expressions for the dependence of the worst-case amplification (from different forcing to different velocity and polymer stress components) on the Weissenberg number ($We$), the maximum extensibility of the polymer chains ($L$), the viscosity ratio, and the spanwise wavenumber. For the Oldroyd-B model, the amplification of the most energetic components of velocity and polymer stress fields scales as $We^2$ and $We^4$. On the other hand, finite extensibility of polymer molecules limits the largest achievable amplification even in flows with infinitely large Weissenberg numbers: in the presence of wall-normal and spanwise forces the amplification of the streamwise velocity and polymer stress fluctuations is bounded by quadratic and quartic functions of $L$. This high amplification signals low robustness to modeling imperfections of inertialess channel flows of viscoelastic fluids. The underlying physical mechanism involves interactions of polymer stress fluctuations with a base shear, and it represents a close analog of the lift-up mechanism that initiates a bypass transition in inertial flows of Newtonian fluids.
△ Less
Submitted 19 February, 2013;
originally announced February 2013.
-
Algorithms for leader selection in stochastically forced consensus networks
Authors:
Fu Lin,
Makan Fardad,
Mihailo R. Jovanović
Abstract:
We are interested in assigning a pre-specified number of nodes as leaders in order to minimize the mean-square deviation from consensus in stochastically forced networks. This problem arises in several applications including control of vehicular formations and localization in sensor networks. For networks with leaders subject to noise, we show that the Boolean constraints (a node is either a leade…
▽ More
We are interested in assigning a pre-specified number of nodes as leaders in order to minimize the mean-square deviation from consensus in stochastically forced networks. This problem arises in several applications including control of vehicular formations and localization in sensor networks. For networks with leaders subject to noise, we show that the Boolean constraints (a node is either a leader or it is not) are the only source of nonconvexity. By relaxing these constraints to their convex hull we obtain a lower bound on the global optimal value. We also use a simple but efficient greedy algorithm to identify leaders and to compute an upper bound. For networks with leaders that perfectly follow their desired trajectories, we identify an additional source of nonconvexity in the form of a rank constraint. Removal of the rank constraint and relaxation of the Boolean constraints yields a semidefinite program for which we develop a customized algorithm well-suited for large networks. Several examples ranging from regular lattices to random graphs are provided to illustrate the effectiveness of the developed algorithms.
△ Less
Submitted 29 May, 2013; v1 submitted 2 February, 2013;
originally announced February 2013.
-
Design of optimal sparse interconnection graphs for synchronization of oscillator networks
Authors:
Makan Fardad,
Fu Lin,
Mihailo R. Jovanović
Abstract:
We study the optimal design of a conductance network as a means for synchronizing a given set of oscillators. Synchronization is achieved when all oscillator voltages reach consensus, and performance is quantified by the mean-square deviation from the consensus value. We formulate optimization problems that address the trade-off between synchronization performance and the number and strength of os…
▽ More
We study the optimal design of a conductance network as a means for synchronizing a given set of oscillators. Synchronization is achieved when all oscillator voltages reach consensus, and performance is quantified by the mean-square deviation from the consensus value. We formulate optimization problems that address the trade-off between synchronization performance and the number and strength of oscillator couplings. We promote the sparsity of the coupling network by penalizing the number of interconnection links. For identical oscillators, we establish convexity of the optimization problem and demonstrate that the design problem can be formulated as a semidefinite program. Finally, for special classes of oscillator networks we derive explicit analytical expressions for the optimal conductance values.
△ Less
Submitted 10 October, 2013; v1 submitted 2 February, 2013;
originally announced February 2013.
-
Model-based design of transverse wall oscillations for turbulent drag reduction
Authors:
Rashad Moarref,
Mihailo R. Jovanović
Abstract:
Over the last two decades, both experiments and simulations have demonstrated that transverse wall oscillations with properly selected amplitude and frequency can reduce turbulent drag by as much as 40%. In this paper, we develop a model-based approach for designing oscillations that suppress turbulence in a channel flow. We utilize eddy-viscosity-enhanced linearization of the turbulent flow with…
▽ More
Over the last two decades, both experiments and simulations have demonstrated that transverse wall oscillations with properly selected amplitude and frequency can reduce turbulent drag by as much as 40%. In this paper, we develop a model-based approach for designing oscillations that suppress turbulence in a channel flow. We utilize eddy-viscosity-enhanced linearization of the turbulent flow with control in conjunction with turbulence modeling to determine skin-friction drag in a simulation-free manner. The Boussinesq eddy viscosity hypothesis is used to quantify the effect of fluctuations on the mean velocity in the flow subject to control. In contrast to the traditional approach that relies on numerical simulations, we determine the turbulent viscosity from the second order statistics of the linearized model driven by white-in-time stochastic forcing. The spatial power spectrum of the forcing is selected to ensure that the linearized model for the uncontrolled flow reproduces the turbulent energy spectrum. The resulting correction to the turbulent mean velocity induced by small amplitude wall movements is then used to identify the optimal frequency of drag reducing oscillations. In addition, the control net efficiency and the turbulent flow structures that we obtain agree well with the results of numerical simulations and experiments. This demonstrates the predictive power of our model-based approach to controlling turbulent flows and is expected to pave the way for successful flow control at higher Reynolds numbers than currently possible.
△ Less
Submitted 1 June, 2012;
originally announced June 2012.
-
Optimal Control of Vehicular Formations with Nearest Neighbor Interactions
Authors:
Fu Lin,
Makan Fardad,
Mihailo R. Jovanović
Abstract:
We consider the design of optimal localized feedback gains for one-dimensional formations in which vehicles only use information from their immediate neighbors. The control objective is to enhance coherence of the formation by making it behave like a rigid lattice. For the single-integrator model with symmetric gains, we establish convexity, implying that the globally optimal controller can be com…
▽ More
We consider the design of optimal localized feedback gains for one-dimensional formations in which vehicles only use information from their immediate neighbors. The control objective is to enhance coherence of the formation by making it behave like a rigid lattice. For the single-integrator model with symmetric gains, we establish convexity, implying that the globally optimal controller can be computed efficiently. We also identify a class of convex problems for double-integrators by restricting the controller to symmetric position and uniform diagonal velocity gains. To obtain the optimal non-symmetric gains for both the single- and the double-integrator models, we solve a parameterized family of optimal control problems ranging from an easily solvable problem to the problem of interest as the underlying parameter increases. When this parameter is kept small, we employ perturbation analysis to decouple the matrix equations that result from the optimality conditions, thereby rendering the unique optimal feedback gain. This solution is used to initialize a homotopy-based Newton's method to find the optimal localized gain. To investigate the performance of localized controllers, we examine how the coherence of large-scale stochastically forced formations scales with the number of vehicles. We establish several explicit scaling relationships and show that the best performance is achieved by a localized controller that is both non-symmetric and spatially-varying.
△ Less
Submitted 17 December, 2011;
originally announced December 2011.