Search | arXiv e-print repository

Linear Quadratic Mean Field Games with Quantile-Dependent Cost Coefficients

Abstract: This paper studies a class of linear quadratic mean field games where the coefficients of quadratic cost functions depend on both the mean and the variance of the population's state distribution through its quantile function. Such a formulation allows for modelling agents that are sensitive to not only the population average but also the population variance. The corresponding mean field game equil… ▽ More This paper studies a class of linear quadratic mean field games where the coefficients of quadratic cost functions depend on both the mean and the variance of the population's state distribution through its quantile function. Such a formulation allows for modelling agents that are sensitive to not only the population average but also the population variance. The corresponding mean field game equilibrium is identified, which involves solving two coupled differential equations: one is a Riccati equation and the other the variance evolution equation. Furthermore, the conditions for the existence and uniqueness of the mean field equilibrium are established. Finally, numerical results are presented to illustrate the behavior of two coupled differential equations and the performance of the mean field game solution. △ Less

Submitted 3 November, 2024; originally announced November 2024.

Comments: 15 pages

arXiv:1803.00040 [pdf, other]

An integral control formulation of Mean-field game based large scale coordination of loads in smart grids

Authors: Arman C. Kizilkale, Rabih Salhab, Roland P. Malhame

Abstract: Pressure on ancillary reserves, i.e.frequency preserving, in power systems has significantly mounted due to the recent generalized increase of the fraction of (highly fluctuating) wind and solar energy sources in grid generation mixes. The energy storage associated with millions of individual customer electric thermal (heating-cooling) loads is considered as a tool for smoothing power demand/gener… ▽ More Pressure on ancillary reserves, i.e.frequency preserving, in power systems has significantly mounted due to the recent generalized increase of the fraction of (highly fluctuating) wind and solar energy sources in grid generation mixes. The energy storage associated with millions of individual customer electric thermal (heating-cooling) loads is considered as a tool for smoothing power demand/generation imbalances. The piecewise constant level tracking problem of their collective energy content is formulated as a linear quadratic mean field game problem with integral control in the cost coefficients. The introduction of integral control brings with it a robustness potential to mismodeling, but also the potential of cost coefficient unboundedness. A suitable Banach space is introduced to establish the existence of Nash equilibria for the corresponding infinite population game, and algorithms are proposed for reliably computing a class of desirable near Nash equilibria. Numerical simulations illustrate the flexibility and robustness of the approach. △ Less

Submitted 22 October, 2018; v1 submitted 28 February, 2018; originally announced March 2018.

arXiv:1606.05272 [pdf, other]

Dynamic Collective Choice: Social Optima

Authors: Rabih Salhab, Jerome Le Ny, Roland P. Malhamé

Abstract: We consider a dynamic collective choice problem where a large number of players are cooperatively choosing between multiple destinations while being influenced by the behavior of the group. For example, in a robotic swarm exploring a new environment, a robot might have to choose between multiple sites to visit, but at the same time it should remain close to the group to achieve some coordinated ta… ▽ More We consider a dynamic collective choice problem where a large number of players are cooperatively choosing between multiple destinations while being influenced by the behavior of the group. For example, in a robotic swarm exploring a new environment, a robot might have to choose between multiple sites to visit, but at the same time it should remain close to the group to achieve some coordinated tasks. We show that to find a social optimum for our problem, one needs to solve a set of Linear Quadratic Regulator problems, whose number increases exponentially with the size of the population. Alternatively, we develop via the Mean Field Games methodology a set of decentralized strategies that are independent of the size of the population. When the number of agents is sufficiently large, these strategies qualify as approximately socially optimal. To compute the approximate social optimum, each player needs to know its own state and the statistical distributions of the players' initial states and problem parameters. Finally, we give a numerical example where the cooperative and noncooperative cases have opposite behaviors. Whereas in the former the size of the majority increases with the social effect, in the latter, the existence of a majority is disadvantaged. △ Less

Submitted 16 June, 2016; originally announced June 2016.

arXiv:1604.08136 [pdf, other]

Collective Stochastic Discrete Choice Problems: A Min-LQG Game Formulation

Authors: Rabih Salhab, Roland P. Malhamé, Jerome Le Ny

Abstract: We consider a class of dynamic collective choice models with social interactions, whereby a large number of non-uniform agents have to individually settle on one of multiple discrete alternative choices, with the relevance of their would-be choices continuously impacted by noise and the unfolding group behavior. This class of problems is modeled here as a so-called Min-LQG game, i.e., a linear qua… ▽ More We consider a class of dynamic collective choice models with social interactions, whereby a large number of non-uniform agents have to individually settle on one of multiple discrete alternative choices, with the relevance of their would-be choices continuously impacted by noise and the unfolding group behavior. This class of problems is modeled here as a so-called Min-LQG game, i.e., a linear quadratic Gaussian dynamic and non-cooperative game, with an additional combinatorial aspect in that it includes a final choice-related minimization in its terminal cost. The presence of this minimization term is key to enforcing some specific discrete choice by each individual agent. The theory of mean field games is invoked to generate a class of decentralized agent feedback control strategies which are then shown to converge to an exact Nash equilibrium of the game as the number of players increases to infinity. A key building block in our approach is an explicit solution to the problem of computing the best response of a generic agent to some arbitrarily posited smooth mean field trajectory. Ultimately, an agent is shown to face a continuously revised discrete choice problem, where greedy choices dictated by current conditions must be constantly balanced against the risk of the future process noise upsetting the wisdom of such decisions.Even though an agent's ultimately chosen alternative is random and dictated by its entire noise history and initial state, the limiting infinite population macroscopic behavior can still be predicted. It is shown that any Nash equilibrium of the game is defined by an a priori computable probability matrix characterizing the manner in which the agent population ultimately splits among the available alternatives. △ Less

Submitted 17 August, 2017; v1 submitted 27 April, 2016; originally announced April 2016.

arXiv:1506.09210 [pdf, other]

A Dynamic Game Model of Collective Choice in Multi-Agent Systems

Authors: Rabih Salhab, Roland P. Malhamé, Jerome Le Ny

Abstract: Inspired by successful biological collective decision mechanisms such as honey bees searching for a new colony or the collective navigation of fish schools, we consider a mean field games (MFG)-like scenario where a large number of agents have to make a choice among a set of different potential target destinations. Each individual both influences and is influenced by the group's decision, as well… ▽ More Inspired by successful biological collective decision mechanisms such as honey bees searching for a new colony or the collective navigation of fish schools, we consider a mean field games (MFG)-like scenario where a large number of agents have to make a choice among a set of different potential target destinations. Each individual both influences and is influenced by the group's decision, as well as the mean trajectory of all the agents. The model can be interpreted as a stylized version of opinion crystallization in an election for example. The agents' biases are dictated first by their initial spatial position and, in a subsequent generalization of the model, by a combination of initial position and a priori individual preference. The agents have linear dynamics and are coupled through a modified form of quadratic cost. Fixed point based finite population equilibrium conditions are identified and associated existence conditions are established. In general multiple equilibria may exist and the agents need to know all initial conditions to compute them precisely. However, as the number of agents increases sufficiently, we show that 1) the computed fixed point equilibria qualify as epsilon Nash equilibria, 2) agents no longer require all initial conditions to compute the equilibria but rather can do so based on a representative probability distribution of these conditions now viewed as random variables. Numerical results are reported. △ Less

Submitted 24 January, 2016; v1 submitted 30 June, 2015; originally announced June 2015.

arXiv:1409.7091 [pdf, other]

Eminence Grise Coalitions: On the Shaping of Public Opinion

Authors: Sadegh Bolouki, Roland P. Malhame, Milad Siami, Nader Motee

Abstract: We consider a network of evolving opinions. It includes multiple individuals with first-order opinion dynamics defined in continuous time and evolving based on a general exogenously defined time-varying underlying graph. In such a network, for an arbitrary fixed initial time, a subset of individuals forms an eminence grise coalition, abbreviated as EGC, if the individuals in that subset are capabl… ▽ More We consider a network of evolving opinions. It includes multiple individuals with first-order opinion dynamics defined in continuous time and evolving based on a general exogenously defined time-varying underlying graph. In such a network, for an arbitrary fixed initial time, a subset of individuals forms an eminence grise coalition, abbreviated as EGC, if the individuals in that subset are capable of leading the entire network to agreeing on any desired opinion, through a cooperative choice of their own initial opinions. In this endeavor, the coalition members are assumed to have access to full profile of the underlying graph of the network as well as the initial opinions of all other individuals. While the complete coalition of individuals always qualifies as an EGC, we establish the existence of a minimum size EGC for an arbitrary time-varying network; also, we develop a non-trivial set of upper and lower bounds on that size. As a result, we show that, even when the underlying graph does not guarantee convergence to a global or multiple consensus, a generally restricted coalition of agents can steer public opinion towards a desired global consensus without affecting any of the predefined graph interactions, provided they can cooperatively adjust their own initial opinions. Geometric insights into the structure of EGC's are given. The results are also extended to the discrete time case where the relation with Decomposition-Separation Theorem is also made explicit. △ Less

Submitted 24 September, 2014; originally announced September 2014.

Comments: 35 pages

arXiv:1303.6674 [pdf, other]

Consensus Algorithms and the Decomposition-Separation Theorem

Authors: Sadegh Bolouki, Roland P. Malhame

Abstract: Convergence properties of time inhomogeneous Markov chain based discrete and continuous time linear consensus algorithms are analyzed. Provided that a so-called infinite jet flow property is satisfied by the underlying chains, necessary conditions for both consensus and multiple consensus are established. A recenet extension by Sonin of the classical Kolmogorov-Doeblin decomposition-separation for… ▽ More Convergence properties of time inhomogeneous Markov chain based discrete and continuous time linear consensus algorithms are analyzed. Provided that a so-called infinite jet flow property is satisfied by the underlying chains, necessary conditions for both consensus and multiple consensus are established. A recenet extension by Sonin of the classical Kolmogorov-Doeblin decomposition-separation for homogeneous Markov chains to the inhomogeneous case is then employed to show that the obtained necessary conditions are also sufficient when the chain is of Class P*, as defined by Touri and Nedic. It is also shown that Sonin's theorem leads to a rediscovery and generalization of most of the existing related consensus results in the literature. △ Less

Submitted 24 September, 2014; v1 submitted 26 March, 2013; originally announced March 2013.

Comments: 33 pages

arXiv:1204.6624 [pdf, ps, other]

Theorems about Ergodicity and Class-Ergodicity of Chains with Applications in Known Consensus Models

Authors: Sadegh Bolouki, Roland P. Malhame

Abstract: In a multi-agent system, unconditional (multiple) consensus is the property of reaching to (multiple) consensus irrespective of the instant and values at which states are initialized. For linear algorithms, occurrence of unconditional (multiple) consensus turns out to be equivalent to (class-) ergodicity of the transition chain (A_n). For a wide class of chains, chains with so-called balanced asym… ▽ More In a multi-agent system, unconditional (multiple) consensus is the property of reaching to (multiple) consensus irrespective of the instant and values at which states are initialized. For linear algorithms, occurrence of unconditional (multiple) consensus turns out to be equivalent to (class-) ergodicity of the transition chain (A_n). For a wide class of chains, chains with so-called balanced asymmetry property, necessary and sufficient conditions for ergodicity and class-ergodicity are derived. The results are employed to analyze the limiting behavior of agents' states in the JLM model, the Krause model, and the Cucker-Smale model. In particular, unconditional single or multiple consensus occurs in all three models. Moreover, a necessary and sufficient condition for unconditional consensus in the JLM model and a sufficient condition for consensus in the Cucker-Smale model are obtained. △ Less

Submitted 26 March, 2013; v1 submitted 30 April, 2012; originally announced April 2012.

Comments: 7 pages

arXiv:1204.6093 [pdf, ps, other]

Linear Consensus Algorithms Based on Balanced Asymmetric Chains

Authors: Sadegh Bolouki, Roland P. Malhame

Abstract: Multi agent consensus algorithms with update steps based on so-called balanced asymmetric chains, are analyzed. For such algorithms it is shown that (i) the set of accumulation points of states is finite, (ii) the asymptotic unconditional occurrence of single consensus or multiple consensuses is directly related to the property of absolute infinite flow for the underlying update chain. The results… ▽ More Multi agent consensus algorithms with update steps based on so-called balanced asymmetric chains, are analyzed. For such algorithms it is shown that (i) the set of accumulation points of states is finite, (ii) the asymptotic unconditional occurrence of single consensus or multiple consensuses is directly related to the property of absolute infinite flow for the underlying update chain. The results are applied to well known consensus models. △ Less

Submitted 26 March, 2013; v1 submitted 26 April, 2012; originally announced April 2012.

Comments: 15 pages

Showing 1–9 of 9 results for author: Malhame, R P