MatrixWorld: A pursuit-evasion platform for safe multi-agent coordination and autocurricula

Sun, Lijun; Chang, Yu-Cheng; Lyu, Chao; Lin, Chin-Teng; Shi, Yuhui

Computer Science > Multiagent Systems

arXiv:2307.14854 (cs)

[Submitted on 27 Jul 2023 (v1), last revised 5 Jun 2024 (this version, v2)]

Title:MatrixWorld: A pursuit-evasion platform for safe multi-agent coordination and autocurricula

Authors:Lijun Sun, Yu-Cheng Chang, Chao Lyu, Chin-Teng Lin, Yuhui Shi

View PDF HTML (experimental)

Abstract:Multi-agent reinforcement learning (MARL) achieves encouraging performance in solving complex tasks. However, the safety of MARL policies is one critical concern that impedes their real-world applications. Popular multi-agent benchmarks focus on diverse tasks yet provide limited safety support. Therefore, this work proposes a safety-constrained multi-agent environment: MatrixWorld, based on the general pursuit-evasion game. Particularly, a safety-constrained multi-agent action execution model is proposed for the software implementation of safe multi-agent environments based on diverse safety definitions. It (1) extends the vertex conflict among homogeneous / cooperative agents to heterogeneous / adversarial settings, and (2) proposes three types of resolutions for each type of conflict, aiming at providing rational and unbiased feedback for safe MARL. Besides, MatrixWorld is also a lightweight co-evolution framework for the learning of pursuit tasks, evasion tasks, or both, where more pursuit-evasion variants can be designed based on different practical meanings of safety. As a brief survey, we review and analyze the co-evolution mechanism in the multi-agent setting, which clearly reveals its relationships with autocurricula, self-play, arms races, and adversarial learning. Thus, MatrixWorld can also serve as the first environment for autocurricula research, where ideas can be quickly verified and well understood.

Subjects:	Multiagent Systems (cs.MA)
Cite as:	arXiv:2307.14854 [cs.MA]
	(or arXiv:2307.14854v2 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2307.14854

Submission history

From: Lijun Sun [view email]
[v1] Thu, 27 Jul 2023 13:35:03 UTC (4,514 KB)
[v2] Wed, 5 Jun 2024 12:15:00 UTC (4,412 KB)

Computer Science > Multiagent Systems

Title:MatrixWorld: A pursuit-evasion platform for safe multi-agent coordination and autocurricula

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:MatrixWorld: A pursuit-evasion platform for safe multi-agent coordination and autocurricula

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators