Double Distillation Network for Multi-Agent Reinforcement Learning

Zhou, Yang; Wang, Siying; Chen, Wenyu; Zhang, Ruoning; Zhao, Zhitong; Zhang, Zixuan

Computer Science > Multiagent Systems

arXiv:2502.03125v1 (cs)

[Submitted on 5 Feb 2025]

Title:Double Distillation Network for Multi-Agent Reinforcement Learning

Authors:Yang Zhou, Siying Wang, Wenyu Chen, Ruoning Zhang, Zhitong Zhao, Zixuan Zhang

View PDF HTML (experimental)

Abstract:Multi-agent reinforcement learning typically employs a centralized training-decentralized execution (CTDE) framework to alleviate the non-stationarity in environment. However, the partial observability during execution may lead to cumulative gap errors gathered by agents, impairing the training of effective collaborative policies. To overcome this challenge, we introduce the Double Distillation Network (DDN), which incorporates two distillation modules aimed at enhancing robust coordination and facilitating the collaboration process under constrained information. The external distillation module uses a global guiding network and a local policy network, employing distillation to reconcile the gap between global training and local execution. In addition, the internal distillation module introduces intrinsic rewards, drawn from state information, to enhance the exploration capabilities of agents. Extensive experiments demonstrate that DDN significantly improves performance across multiple scenarios.

Subjects:	Multiagent Systems (cs.MA); Machine Learning (cs.LG)
Cite as:	arXiv:2502.03125 [cs.MA]
	(or arXiv:2502.03125v1 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2502.03125

Submission history

From: Yang Zhou [view email]
[v1] Wed, 5 Feb 2025 12:31:55 UTC (865 KB)

Computer Science > Multiagent Systems

Title:Double Distillation Network for Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Double Distillation Network for Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators