Robust No-Regret Learning in Min-Max Stackelberg Games

Goktas, Denizalp; Zhao, Jiayi; Greenwald, Amy

Computer Science > Computer Science and Game Theory

arXiv:2203.14126 (cs)

[Submitted on 26 Mar 2022 (v1), last revised 13 Apr 2022 (this version, v2)]

Title:Robust No-Regret Learning in Min-Max Stackelberg Games

Authors:Denizalp Goktas, Jiayi Zhao, Amy Greenwald

View PDF

Abstract:The behavior of no-regret learning algorithms is well understood in two-player min-max (i.e, zero-sum) games. In this paper, we investigate the behavior of no-regret learning in min-max games with dependent strategy sets, where the strategy of the first player constrains the behavior of the second. Such games are best understood as sequential, i.e., min-max Stackelberg, games. We consider two settings, one in which only the first player chooses their actions using a no-regret algorithm while the second player best responds, and one in which both players use no-regret algorithms. For the former case, we show that no-regret dynamics converge to a Stackelberg equilibrium. For the latter case, we introduce a new type of regret, which we call Lagrangian regret, and show that if both players minimize their Lagrangian regrets, then play converges to a Stackelberg equilibrium. We then observe that online mirror descent (OMD) dynamics in these two settings correspond respectively to a known nested (i.e., sequential) gradient descent-ascent (GDA) algorithm and a new simultaneous GDA-like algorithm, thereby establishing convergence of these algorithms to Stackelberg equilibrium. Finally, we analyze the robustness of OMD dynamics to perturbations by investigating online min-max Stackelberg games. We prove that OMD dynamics are robust for a large class of online min-max games with independent strategy sets. In the dependent case, we demonstrate the robustness of OMD dynamics experimentally by simulating them in online Fisher markets, a canonical example of a min-max Stackelberg game with dependent strategy sets.

Comments:	15 pages, 1 figure, 2 tables, 6 Algorithms; Forthcoming AAMAS'22. arXiv admin note: text overlap with arXiv:2110.05192
Subjects:	Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Theoretical Economics (econ.TH)
Cite as:	arXiv:2203.14126 [cs.GT]
	(or arXiv:2203.14126v2 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2203.14126

Submission history

From: Denizalp Goktas [view email]
[v1] Sat, 26 Mar 2022 18:12:40 UTC (2,278 KB)
[v2] Wed, 13 Apr 2022 20:44:18 UTC (2,279 KB)

Computer Science > Computer Science and Game Theory

Title:Robust No-Regret Learning in Min-Max Stackelberg Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Robust No-Regret Learning in Min-Max Stackelberg Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators