Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games

Liu, Ziyi; Guo, Xian; Fang, Yongchun

Computer Science > Multiagent Systems

arXiv:2205.15859 (cs)

This paper has been withdrawn by Ziyi Liu

[Submitted on 31 May 2022 (v1), last revised 4 Jan 2023 (this version, v3)]

Title:Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games

Authors:Ziyi Liu, Xian Guo, Yongchun Fang

No PDF available, click to view other formats

Abstract:While various multi-agent reinforcement learning methods have been proposed in cooperative settings, few works investigate how self-interested learning agents achieve mutual coordination in decentralized general-sum games and generalize pre-trained policies to non-cooperative opponents during execution. In this paper, we present Generalizable Risk-Sensitive Policy (GRSP). GRSP learns the distributions over agent's return and estimate a dynamic risk-seeking bonus to discover risky coordination strategies. Furthermore, to avoid overfitting to training opponents, GRSP learns an auxiliary opponent modeling task to infer opponents' types and dynamically alter corresponding strategies during execution. Empirically, agents trained via GRSP can achieve mutual coordination during training stably and avoid being exploited by non-cooperative opponents during execution. To the best of our knowledge, it is the first method to learn coordination strategies between agents both in iterated prisoner's dilemma (IPD) and iterated stag hunt (ISH) without shaping opponents or rewards, and firstly consider generalization during execution. Furthermore, we show that GRSP can be scaled to high-dimensional settings.

Comments:	the paper will be updated soon
Subjects:	Multiagent Systems (cs.MA)
Cite as:	arXiv:2205.15859 [cs.MA]
	(or arXiv:2205.15859v3 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2205.15859

Submission history

From: Ziyi Liu [view email]
[v1] Tue, 31 May 2022 15:09:50 UTC (12,082 KB)
[v2] Sat, 24 Sep 2022 02:43:55 UTC (19,687 KB)
[v3] Wed, 4 Jan 2023 02:52:27 UTC (1 KB) (withdrawn)

Computer Science > Multiagent Systems

Title:Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators