Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning

Chu, Xiangxiang; Ye, Hangjun

Computer Science > Artificial Intelligence

arXiv:1710.00336 (cs)

[Submitted on 1 Oct 2017 (v1), last revised 3 Oct 2017 (this version, v2)]

Title:Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning

Authors:Xiangxiang Chu, Hangjun Ye

View PDF

Abstract:Deep reinforcement learning for multi-agent cooperation and competition has been a hot topic recently. This paper focuses on cooperative multi-agent problem based on actor-critic methods under local observations settings. Multi agent deep deterministic policy gradient obtained state of art results for some multi-agent games, whereas, it cannot scale well with growing amount of agents. In order to boost scalability, we propose a parameter sharing deterministic policy gradient method with three variants based on neural networks, including actor-critic sharing, actor sharing and actor sharing with partially shared critic. Benchmarks from rllab show that the proposed method has advantages in learning speed and memory efficiency, well scales with growing amount of agents, and moreover, it can make full use of reward sharing and exchangeability if possible.

Comments:	12 pages, 6 figures
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1710.00336 [cs.AI]
	(or arXiv:1710.00336v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1710.00336

Submission history

From: Xiangxiang Chu [view email]
[v1] Sun, 1 Oct 2017 11:43:10 UTC (1,321 KB)
[v2] Tue, 3 Oct 2017 00:47:58 UTC (1,276 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2017-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiangxiang Chu
Hangjun Ye

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators