-
On the Uniqueness of Nash Equilibria in Multiagent Matrix Games
Authors:
James P. Bailey
Abstract:
We provide a complete characterization for uniqueness of equilibria in unconstrained polymatrix games. We show that while uniqueness is natural for coordination and general polymatrix games, zero-sum games require that the dimension of the combined strategy space is even. Therefore, non-uniqueness is common in zero-sum polymatrix games. In addition, we study the impact of non-uniqueness on classic…
▽ More
We provide a complete characterization for uniqueness of equilibria in unconstrained polymatrix games. We show that while uniqueness is natural for coordination and general polymatrix games, zero-sum games require that the dimension of the combined strategy space is even. Therefore, non-uniqueness is common in zero-sum polymatrix games. In addition, we study the impact of non-uniqueness on classical learning dynamics for multiagent systems and show that the classical methods still yield unique estimates even when there is not a unique equilibrium.
△ Less
Submitted 21 October, 2024;
originally announced October 2024.
-
On the Approximability of the Yolk in the Spatial Model of Voting
Authors:
Ran Hu,
James P. Bailey
Abstract:
In the spatial model of voting, the yolk and LP (linear programming) yolk are important solution concepts for predicting outcomes for a committee of voters. McKelvey and Tovey showed that the LP yolk provides a lower bound approximation for the size of the yolk and there has been considerable debate on whether the LP yolk is a good approximation of the yolk. In this paper, we show that for an odd…
▽ More
In the spatial model of voting, the yolk and LP (linear programming) yolk are important solution concepts for predicting outcomes for a committee of voters. McKelvey and Tovey showed that the LP yolk provides a lower bound approximation for the size of the yolk and there has been considerable debate on whether the LP yolk is a good approximation of the yolk. In this paper, we show that for an odd number of voters in a two-dimensional space that the yolk radius is at most twice the size of the LP yolk radius. However, we also show that (1) even in this setting, the LP yolk center can be arbitrarily far away from the yolk center (relative to the radius of the yolk) and (2) for all other settings (an even number of voters or in dimension $k\geq 3$) that the LP yolk can be arbitrarily small relative to the yolk. Thus, in general, the LP yolk can be an arbitrarily poor approximation of the yolk.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
Analyzing Wage Theft in Day Labor Markets via Principal Agent Models
Authors:
James P. Bailey,
Bahar Cavdar,
Yanling Chang
Abstract:
In day labor markets, workers are particularly vulnerable to wage theft. This paper introduces a principal-agent model to analyze the conditions required to mitigate wage theft through fines and establishes the necessary and sufficient conditions to reduce theft. We find that the fines necessary to eliminate theft are significantly larger than those imposed by current labor laws, making wage theft…
▽ More
In day labor markets, workers are particularly vulnerable to wage theft. This paper introduces a principal-agent model to analyze the conditions required to mitigate wage theft through fines and establishes the necessary and sufficient conditions to reduce theft. We find that the fines necessary to eliminate theft are significantly larger than those imposed by current labor laws, making wage theft likely to persist under penalty-based methods alone. Through numerical analysis, we show how wage theft disproportionately affects workers with lower reservation utilities and observe that workers with similar reservation utilities experience comparable impacts, regardless of their skill levels. To address the limitations of penalty-based approaches, we extend the model to a dynamic game incorporating worker awareness. We prove that wage theft can be fully eliminated if workers accurately predict theft using historical data and employers follow optimal fixed wage strategy. Additionally, sharing wage theft information becomes an effective long-term solution when employers use any given fixed wage strategies, emphasizing the importance of raising worker awareness through various channels.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
$O\left(1/T\right)$ Time-Average Convergence in a Generalization of Multiagent Zero-Sum Games
Authors:
James P. Bailey
Abstract:
We introduce a generalization of zero-sum network multiagent matrix games and prove that alternating gradient descent converges to the set of Nash equilibria at rate $O(1/T)$ for this set of games. Alternating gradient descent obtains this convergence guarantee while using fixed learning rates that are four times larger than the optimistic variant of gradient descent. Experimentally, we show with…
▽ More
We introduce a generalization of zero-sum network multiagent matrix games and prove that alternating gradient descent converges to the set of Nash equilibria at rate $O(1/T)$ for this set of games. Alternating gradient descent obtains this convergence guarantee while using fixed learning rates that are four times larger than the optimistic variant of gradient descent. Experimentally, we show with 97.5% confidence that, on average, these larger learning rates result in time-averaged strategies that are 2.585 times closer to the set of Nash equilibria than optimistic gradient descent.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Stochastic Multiplicative Weights Updates in Zero-Sum Games
Authors:
James P. Bailey,
Sai Ganesh Nagarajan,
Georgios Piliouras
Abstract:
We study agents competing against each other in a repeated network zero-sum game while applying the multiplicative weights update (MWU) algorithm with fixed learning rates. In our implementation, agents select their strategies probabilistically in each iteration and update their weights/strategies using the realized vector payoff of all strategies, i.e., stochastic MWU with full information. We sh…
▽ More
We study agents competing against each other in a repeated network zero-sum game while applying the multiplicative weights update (MWU) algorithm with fixed learning rates. In our implementation, agents select their strategies probabilistically in each iteration and update their weights/strategies using the realized vector payoff of all strategies, i.e., stochastic MWU with full information. We show that the system results in an irreducible Markov chain where agent strategies diverge from the set of Nash equilibria. Further, we show that agents will play pure strategies with probability 1 in the limit.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Conditions for Stability in Strategic Matching
Authors:
James P. Bailey,
Craig A. Tovey
Abstract:
We consider the stability of matchings when individuals strategically submit preference information to a publicly known algorithm. Most pure Nash equilibria of the ensuing game yield a matching that is unstable with respect to the individuals' sincere preferences. We introduce a well-supported minimal dishonesty constraint, and obtain conditions under which every pure Nash equilibrium yields a mat…
▽ More
We consider the stability of matchings when individuals strategically submit preference information to a publicly known algorithm. Most pure Nash equilibria of the ensuing game yield a matching that is unstable with respect to the individuals' sincere preferences. We introduce a well-supported minimal dishonesty constraint, and obtain conditions under which every pure Nash equilibrium yields a matching that is stable with respect to the sincere preferences. The conditions on the matching algorithm are to be either fully-randomized, or monotonic and independent of non-spouses (INS), an IIA-like property. These conditions are significant because they support the use of algorithms other than the Gale-Shapley (man-optimal) algorithm for kidney exchange and other applications. We prove that the Gale-Shapley algorithm always yields the woman-optimal matching when individuals are minimally dishonest. However, we give a negative answer to one of Gusfield and Irving's open questions: there is no monotonic INS or fully-randomized stable matching algorithm that is certain to yield the egalitarian-optimal matching when individuals are strategic and minimally dishonest. Finally, we show that these results extend to the student placement problem, where women are polyandrous but must be honest but do not extend to the admissions problem, where women are both polyandrous and strategic.
△ Less
Submitted 9 August, 2021;
originally announced August 2021.
-
Finite Regret and Cycles with Fixed Step-Size via Alternating Gradient Descent-Ascent
Authors:
James P. Bailey,
Gauthier Gidel,
Georgios Piliouras
Abstract:
Gradient descent is arguably one of the most popular online optimization methods with a wide array of applications. However, the standard implementation where agents simultaneously update their strategies yields several undesirable properties; strategies diverge away from equilibrium and regret grows over time. In this paper, we eliminate these negative properties by introducing a different implem…
▽ More
Gradient descent is arguably one of the most popular online optimization methods with a wide array of applications. However, the standard implementation where agents simultaneously update their strategies yields several undesirable properties; strategies diverge away from equilibrium and regret grows over time. In this paper, we eliminate these negative properties by introducing a different implementation to obtain finite regret via arbitrary fixed step-size. We obtain this surprising property by having agents take turns when updating their strategies. In this setting, we show that an agent that uses gradient descent obtains bounded regret -- regardless of how their opponent updates their strategies. Furthermore, we show that in adversarial settings that agents' strategies are bounded and cycle when both are using the alternating gradient descent algorithm.
△ Less
Submitted 9 July, 2019;
originally announced July 2019.
-
Fast and Furious Learning in Zero-Sum Games: Vanishing Regret with Non-Vanishing Step Sizes
Authors:
James P. Bailey,
Georgios Piliouras
Abstract:
We show for the first time, to our knowledge, that it is possible to reconcile in online learning in zero-sum games two seemingly contradictory objectives: vanishing time-average regret and non-vanishing step sizes. This phenomenon, that we coin ``fast and furious" learning in games, sets a new benchmark about what is possible both in max-min optimization as well as in multi-agent systems. Our ana…
▽ More
We show for the first time, to our knowledge, that it is possible to reconcile in online learning in zero-sum games two seemingly contradictory objectives: vanishing time-average regret and non-vanishing step sizes. This phenomenon, that we coin ``fast and furious" learning in games, sets a new benchmark about what is possible both in max-min optimization as well as in multi-agent systems. Our analysis does not depend on introducing a carefully tailored dynamic. Instead we focus on the most well studied online dynamic, gradient descent. Similarly, we focus on the simplest textbook class of games, two-agent two-strategy zero-sum games, such as Matching Pennies. Even for this simplest of benchmarks the best known bound for total regret, prior to our work, was the trivial one of $O(T)$, which is immediately applicable even to a non-learning agent. Based on a tight understanding of the geometry of the non-equilibrating trajectories in the dual space we prove a regret bound of $Θ(\sqrt{T})$ matching the well known optimal bound for adaptive step sizes in the online setting. This guarantee holds for all fixed step-sizes without having to know the time horizon in advance and adapt the fixed step-size accordingly. As a corollary, we establish that even with fixed learning rates the time-average of mixed strategies, utilities converge to their exact Nash equilibrium values.
△ Less
Submitted 11 May, 2019;
originally announced May 2019.
-
Multi-Agent Learning in Network Zero-Sum Games is a Hamiltonian System
Authors:
James P. Bailey,
Georgios Piliouras
Abstract:
Zero-sum games are natural, if informal, analogues of closed physical systems where no energy/utility can enter or exit. This analogy can be extended even further if we consider zero-sum network (polymatrix) games where multiple agents interact in a closed economy. Typically, (network) zero-sum games are studied from the perspective of Nash equilibria. Nevertheless, this comes in contrast with the…
▽ More
Zero-sum games are natural, if informal, analogues of closed physical systems where no energy/utility can enter or exit. This analogy can be extended even further if we consider zero-sum network (polymatrix) games where multiple agents interact in a closed economy. Typically, (network) zero-sum games are studied from the perspective of Nash equilibria. Nevertheless, this comes in contrast with the way we typically think about closed physical systems, e.g., Earth-moon systems which move perpetually along recurrent trajectories of constant energy.
We establish a formal and robust connection between multi-agent systems and Hamiltonian dynamics -- the same dynamics that describe conservative systems in physics. Specifically, we show that no matter the size, or network structure of such closed economies, even if agents use different online learning dynamics from the standard class of Follow-the-Regularized-Leader, they yield Hamiltonian dynamics. This approach generalizes the known connection to Hamiltonians for the special case of replicator dynamics in two agent zero-sum games developed by Hofbauer. Moreover, our results extend beyond zero-sum settings and provide a type of a Rosetta stone (see e.g. Table 1) that helps to translate results and techniques between online optimization, convex analysis, games theory, and physics.
△ Less
Submitted 5 March, 2019;
originally announced March 2019.