Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

Bairamian, Daniel; Marcotte, Philippe; Romoff, Joshua; Robert, Gabriel; Nowrouzezahrai, Derek

Computer Science > Machine Learning

arXiv:2311.17190 (cs)

[Submitted on 28 Nov 2023]

Title:Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

Authors:Daniel Bairamian, Philippe Marcotte, Joshua Romoff, Gabriel Robert, Derek Nowrouzezahrai

View PDF

Abstract:Recent advances in Competitive Self-Play (CSP) have achieved, or even surpassed, human level performance in complex game environments such as Dota 2 and StarCraft II using Distributed Multi-Agent Reinforcement Learning (MARL). One core component of these methods relies on creating a pool of learning agents -- consisting of the Main Agent, past versions of this agent, and Exploiter Agents -- where Exploiter Agents learn counter-strategies to the Main Agents. A key drawback of these approaches is the large computational cost and physical time that is required to train the system, making them impractical to deploy in highly iterative real-life settings such as video game productions. In this paper, we propose the Minimax Exploiter, a game theoretic approach to exploiting Main Agents that leverages knowledge of its opponents, leading to significant increases in data efficiency. We validate our approach in a diversity of settings, including simple turn based games, the arcade learning environment, and For Honor, a modern video game. The Minimax Exploiter consistently outperforms strong baselines, demonstrating improved stability and data efficiency, leading to a robust CSP-MARL method that is both flexible and easy to deploy.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
Cite as:	arXiv:2311.17190 [cs.LG]
	(or arXiv:2311.17190v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.17190

Submission history

From: Daniel Bairamian [view email]
[v1] Tue, 28 Nov 2023 19:34:40 UTC (11,027 KB)

Computer Science > Machine Learning

Title:Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators