Approximate Linear Programming for Decentralized Policy Iteration in Cooperative Multi-agent Markov Decision Processes

Mandal, Lakshmi; Lakshminarayanan, Chandrashekar; Bhatnagar, Shalabh

Computer Science > Machine Learning

arXiv:2311.11789 (cs)

[Submitted on 20 Nov 2023 (v1), last revised 29 Apr 2024 (this version, v2)]

Title:Approximate Linear Programming for Decentralized Policy Iteration in Cooperative Multi-agent Markov Decision Processes

Authors:Lakshmi Mandal, Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

View PDF HTML (experimental)

Abstract:In this work, we consider a cooperative multi-agent Markov decision process (MDP) involving m agents. At each decision epoch, all the m agents independently select actions in order to maximize a common long-term objective. In the policy iteration process of multi-agent setup, the number of actions grows exponentially with the number of agents, incurring huge computational costs. Thus, recent works consider decentralized policy improvement, where each agent improves its decisions unilaterally, assuming that the decisions of the other agents are fixed. However, exact value functions are considered in the literature, which is computationally expensive for a large number of agents with high dimensional state-action space. Thus, we propose approximate decentralized policy iteration algorithms, using approximate linear programming with function approximation to compute the approximate value function for decentralized policy improvement. Further, we consider (both) cooperative multi-agent finite and infinite horizon discounted MDPs and propose suitable algorithms in each case. Moreover, we provide theoretical guarantees for our algorithms and also demonstrate their advantages over existing state-of-the-art algorithms in the literature.

Subjects:	Machine Learning (cs.LG); Multiagent Systems (cs.MA); Optimization and Control (math.OC)
Cite as:	arXiv:2311.11789 [cs.LG]
	(or arXiv:2311.11789v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.11789

Submission history

From: Lakshmi Mandal [view email]
[v1] Mon, 20 Nov 2023 14:14:13 UTC (273 KB)
[v2] Mon, 29 Apr 2024 18:05:39 UTC (332 KB)

Computer Science > Machine Learning

Title:Approximate Linear Programming for Decentralized Policy Iteration in Cooperative Multi-agent Markov Decision Processes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Approximate Linear Programming for Decentralized Policy Iteration in Cooperative Multi-agent Markov Decision Processes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators