Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning

Diddigi, Raghuram Bharadwaj; Danda, Sai Koti Reddy; J., Prabuchandran K.; Bhatnagar, Shalabh

Computer Science > Multiagent Systems

arXiv:1905.02907 (cs)

[Submitted on 8 May 2019 (v1), last revised 12 Jul 2020 (this version, v2)]

Title:Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning

Authors:Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda, Prabuchandran K.J., Shalabh Bhatnagar

View PDF

Abstract:In cooperative stochastic games multiple agents work towards learning joint optimal actions in an unknown environment to achieve a common goal. In many real-world applications, however, constraints are often imposed on the actions that can be jointly taken by the agents. In such scenarios the agents aim to learn joint actions to achieve a common goal (minimizing a specified cost function) while meeting the given constraints (specified via certain penalty functions). In this paper, we consider the relaxation of the constrained optimization problem by constructing the Lagrangian of the cost and penalty functions. We propose a nested actor-critic solution approach to solve this relaxed problem. In this approach, an actor-critic scheme is employed to improve the policy for a given Lagrange parameter update on a faster timescale as in the classical actor-critic architecture. A meta actor-critic scheme using this faster timescale policy updates is then employed to improve the Lagrange parameters on the slower timescale. Utilizing the proposed nested actor-critic schemes, we develop three Nested Actor-Critic (N-AC) algorithms. Through experiments on constrained cooperative tasks, we show the effectiveness of the proposed algorithms.

Subjects:	Multiagent Systems (cs.MA)
Cite as:	arXiv:1905.02907 [cs.MA]
	(or arXiv:1905.02907v2 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.1905.02907

Submission history

From: Diddigi Raghuram Bharadwaj [view email]
[v1] Wed, 8 May 2019 04:48:08 UTC (182 KB)
[v2] Sun, 12 Jul 2020 08:39:31 UTC (238 KB)

Computer Science > Multiagent Systems

Title:Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators