TarMAC: Targeted Multi-Agent Communication

Das, Abhishek; Gervet, Théophile; Romoff, Joshua; Batra, Dhruv; Parikh, Devi; Rabbat, Michael; Pineau, Joelle

Computer Science > Machine Learning

arXiv:1810.11187v1 (cs)

[Submitted on 26 Oct 2018 (this version), latest version 22 Feb 2020 (v2)]

Title:TarMAC: Targeted Multi-Agent Communication

Authors:Abhishek Das, Théophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Michael Rabbat, Joelle Pineau

View PDF

Abstract:We explore a collaborative multi-agent reinforcement learning setting where a team of agents attempts to solve cooperative tasks in partially-observable environments. In this scenario, learning an effective communication protocol is key. We propose a communication architecture that allows for targeted communication, where agents learn both what messages to send and who to send them to, solely from downstream task-specific reward without any communication supervision. Additionally, we introduce a multi-stage communication approach where the agents co-ordinate via multiple rounds of communication before taking actions in the environment. We evaluate our approach on a diverse set of cooperative multi-agent tasks, of varying difficulties, with varying number of agents, in a variety of environments ranging from 2D grid layouts of shapes and simulated traffic junctions to complex 3D indoor environments. We demonstrate the benefits of targeted as well as multi-stage communication. Moreover, we show that the targeted communication strategies learned by agents are both interpretable and intuitive.

Comments:	10 pages, 4 figures, 4 tables
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
Cite as:	arXiv:1810.11187 [cs.LG]
	(or arXiv:1810.11187v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.11187

Submission history

From: Abhishek Das [view email]
[v1] Fri, 26 Oct 2018 04:22:58 UTC (8,400 KB)
[v2] Sat, 22 Feb 2020 04:37:13 UTC (6,179 KB)

Computer Science > Machine Learning

Title:TarMAC: Targeted Multi-Agent Communication

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:TarMAC: Targeted Multi-Agent Communication

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators