Search | arXiv e-print repository

Approximating Nash Equilibria in General-Sum Games via Meta-Learning

Authors: David Sychrovský, Christopher Solinas, Revan MacQueen, Kevin Wang, James R. Wright, Nathan R. Sturtevant, Michael Bowling

Abstract: Nash equilibrium is perhaps the best-known solution concept in game theory. Such a solution assigns a strategy to each player which offers no incentive to unilaterally deviate. While a Nash equilibrium is guaranteed to always exist, the problem of finding one in general-sum games is PPAD-complete, generally considered intractable. Regret minimization is an efficient framework for approximating Nas… ▽ More Nash equilibrium is perhaps the best-known solution concept in game theory. Such a solution assigns a strategy to each player which offers no incentive to unilaterally deviate. While a Nash equilibrium is guaranteed to always exist, the problem of finding one in general-sum games is PPAD-complete, generally considered intractable. Regret minimization is an efficient framework for approximating Nash equilibria in two-player zero-sum games. However, in general-sum games, such algorithms are only guaranteed to converge to a coarse-correlated equilibrium (CCE), a solution concept where players can correlate their strategies. In this work, we use meta-learning to minimize the correlations in strategies produced by a regret minimizer. This encourages the regret minimizer to find strategies that are closer to a Nash equilibrium. The meta-learned regret minimizer is still guaranteed to converge to a CCE, but we give a bound on the distance to Nash equilibrium in terms of our meta-loss. We evaluate our approach in general-sum imperfect information games. Our algorithms provide significantly better approximations of Nash equilibria than state-of-the-art regret minimization techniques. △ Less

Submitted 26 April, 2025; originally announced April 2025.

arXiv:2310.11518 [pdf, other]

Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability

Authors: Revan MacQueen, James R. Wright

Abstract: Self-play is a technique for machine learning in multi-agent systems where a learning algorithm learns by interacting with copies of itself. Self-play is useful for generating large quantities of data for learning, but has the drawback that the agents the learner will face post-training may have dramatically different behavior than the learner came to expect by interacting with itself. For the spe… ▽ More Self-play is a technique for machine learning in multi-agent systems where a learning algorithm learns by interacting with copies of itself. Self-play is useful for generating large quantities of data for learning, but has the drawback that the agents the learner will face post-training may have dramatically different behavior than the learner came to expect by interacting with itself. For the special case of two-player constant-sum games, self-play that reaches Nash equilibrium is guaranteed to produce strategies that perform well against any post-training opponent; however, no such guarantee exists for multiplayer games. We show that in games that approximately decompose into a set of two-player constant-sum games (called constant-sum polymatrix games) where global $ε$-Nash equilibria are boundedly far from Nash equilibria in each subgame (called subgame stability), any no-external-regret algorithm that learns by self-play will produce a strategy with bounded vulnerability. For the first time, our results identify a structural property of multiplayer games that enable performance guarantees for the strategies produced by a broad class of self-play algorithms. We demonstrate our findings through experiments on Leduc poker. △ Less

Submitted 29 November, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: To appear at NeurIPS 2023

arXiv:2304.07187 [pdf, ps, other]

A Proof that Coarse Correlated Equilibrium Implies Nash Equilibrium in Two-Player Zero-Sum Games

Authors: Revan MacQueen

Abstract: We give a simple proof of the well-known result that the marginal strategies of a coarse correlated equilibrium form a Nash equilibrium in two-player zero-sum games. A corollary of this fact is that no-external-regret learning algorithms that converge to the set of coarse correlated equilibria will also converge to Nash equilibria in two-player zero-sum games. We show an approximate version: that… ▽ More We give a simple proof of the well-known result that the marginal strategies of a coarse correlated equilibrium form a Nash equilibrium in two-player zero-sum games. A corollary of this fact is that no-external-regret learning algorithms that converge to the set of coarse correlated equilibria will also converge to Nash equilibria in two-player zero-sum games. We show an approximate version: that $ε$-coarse correlated equilibria imply $2ε$-Nash equilibria. △ Less

Submitted 15 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

arXiv:2012.03587 [pdf, other]

doi 10.1063/5.0040839

Double-Layer Metasurface for Enhanced Photon Up-Conversion

Authors: Phillip Manley, Michele Segantini, Doguscan Ahiboz, Martin Hammerschmidt, Georgios Arnaoutakis, Rowan W. MacQueen, Sven Burger, Christiane Becker

Abstract: We present a double-layer dielectric metasurface obtained by stacking a silicon nanodisc array and a silicon photonic crystal slab with equal periodicity on top of each other. We focus on the investigation of electric near-field enhancement effects occurring at resonant excitation of the metasurface and study its optical properties numerically and experimentally. We find that the major difference… ▽ More We present a double-layer dielectric metasurface obtained by stacking a silicon nanodisc array and a silicon photonic crystal slab with equal periodicity on top of each other. We focus on the investigation of electric near-field enhancement effects occurring at resonant excitation of the metasurface and study its optical properties numerically and experimentally. We find that the major difference in multi-layer metasurfaces when compared to conventional single-layer structures appears to be in Rayleigh-Wood anomalies: they are split into multiple different modes which are themselves spectrally broadened. As a proof of concept we cover a double-layer metasurface with a lanthanide-doped up-conversion particle layer and study its interaction with a 1550 nm photoexcitation. We observe a 2.7-fold enhancemed up-conversion photoluminescence by using the stacked metasurface instead of a planar substrate, although only around 1% of the up-conversion material is exposed to enhanced near-fields. Two mechanisms are identified explaining this behavior: First, enhanced near-fields when exciting the metasurface resonantly, and second, light trapping by total internal reflection in the particle layer when the metasurface redirects light into high-angle diffraction orders. These results pave the way for low-threshold and, in particular, broadband photon up-conversion in future solar energy and biosensing applications. △ Less

Submitted 7 December, 2020; originally announced December 2020.

Comments: 14 pages, 6 figures

Journal ref: APL Photon. 6, 036103 (2021)

arXiv:2012.00817 [pdf, other]

Game-Theoretic Malware Detection

Authors: Revan MacQueen, Natalie Bombardieri, James R. Wright, Karim Ali

Abstract: Malware attacks are costly. To mitigate against such attacks, organizations deploy malware detection tools that help them detect and eventually resolve those threats. While running only the best available tool does not provide enough coverage of the potential attacks, running all available tools is prohibitively expensive in terms of financial cost and computing resources. Therefore, an organizati… ▽ More Malware attacks are costly. To mitigate against such attacks, organizations deploy malware detection tools that help them detect and eventually resolve those threats. While running only the best available tool does not provide enough coverage of the potential attacks, running all available tools is prohibitively expensive in terms of financial cost and computing resources. Therefore, an organization typically runs a set of tools that maximizes their coverage given a limited budget. However, how should an organization choose that set? Attackers are strategic, and will change their behavior to preferentially exploit the gaps left by a deterministic choice of tools. To avoid leaving such easily-exploited gaps, the defender must choose a random set. In this paper, we present an approach to compute an optimal randomization over size-bounded sets of available security analysis tools by modeling the relationship between attackers and security analysts as a leader-follower Stackelberg security game. We estimate the parameters of our model by combining the information from the VirusTotal dataset with the more detailed reports from the National Vulnerability Database. In an empirical comparison, our approach outperforms a set of natural baselines under a wide range of assumptions. △ Less

Submitted 7 January, 2022; v1 submitted 1 December, 2020; originally announced December 2020.

arXiv:2004.08562 [pdf]

doi 10.1021/acs.langmuir.0c01154

Tetracene ultrathin film growth on silicon

Authors: Jens Niederhausen, Hazem Aldahhak, Rowan W. MacQueen, Wolf Gero Schmidt, Uwe Gerstmann, Klaus Lips

Abstract: Inorganic-organic interfaces are important for enhancing the power conversion efficiency of silicon-based solar cells through singlet exciton fission (SF). We elucidated the structure of the first monolayers of tetracene (Tc), a SF molecule, on hydrogen-passivated Si(111) [H-Si(111)] and hydrogenated amorphous Si (a-Si:H) by combining near-edge X-ray absorption fine structure (NEXAFS) and X-ray ph… ▽ More Inorganic-organic interfaces are important for enhancing the power conversion efficiency of silicon-based solar cells through singlet exciton fission (SF). We elucidated the structure of the first monolayers of tetracene (Tc), a SF molecule, on hydrogen-passivated Si(111) [H-Si(111)] and hydrogenated amorphous Si (a-Si:H) by combining near-edge X-ray absorption fine structure (NEXAFS) and X-ray photoelectron spectroscopy (XPS) experiments with density functional theory (DFT) calculations. For samples grown at or below substrate temperatures of 265 K, the resulting ultrathin Tc films are dominated by almost upright-standing molecules. The molecular arrangement is very similar to the Tc bulk phase, with only slightly higher average angle between the conjugated molecular plane normal and the surface normal ($α$) around 77°. Judging from carbon K-edge X-ray absorption spectra, the orientation of the Tc molecules are almost identical when grown on H-Si(111) and a-Si:H substrates as well as for (sub)mono- to several-monolayer coverages. Annealing to room temperature, however, changes the film structure towards a smaller $α$ of about 63°. A detailed DFT-assisted analysis suggests that this structural transition is correlated with a lower packing density and requires a well-chosen amount of thermal energy. Therefore, we attribute the resulting structure to a distinct monolayer configuration that features less inclined, but still well-ordered molecules. The larger overlap with the substrate wavefunctions makes this arrangement attractive for an optimized interfacial electron transfer in SF-assisted silicon solar cells. △ Less

Submitted 18 April, 2020; originally announced April 2020.

Journal ref: Langmuir 2020, 36, 31, 9099-9113

Showing 1–6 of 6 results for author: MacQueen, R