Anytime Planning for Decentralized POMDPs using Expectation Maximization

Kumar, Akshat; Zilberstein, Shlomo

Computer Science > Artificial Intelligence

arXiv:1203.3490 (cs)

[Submitted on 15 Mar 2012]

Title:Anytime Planning for Decentralized POMDPs using Expectation Maximization

Authors:Akshat Kumar, Shlomo Zilberstein

View PDF

Abstract:Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. While fnite-horizon DECPOMDPs have enjoyed signifcant success, progress remains slow for the infnite-horizon case mainly due to the inherent complexity of optimizing stochastic controllers representing agent policies. We present a promising new class of algorithms for the infnite-horizon case, which recasts the optimization problem as inference in a mixture of DBNs. An attractive feature of this approach is the straightforward adoption of existing inference techniques in DBNs for solving DEC-POMDPs and supporting richer representations such as factored or continuous states and actions. We also derive the Expectation Maximization (EM) algorithm to optimize the joint policy represented as DBNs. Experiments on benchmark domains show that EM compares favorably against the state-of-the-art solvers.

Comments:	Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)
Subjects:	Artificial Intelligence (cs.AI)
Report number:	UAI-P-2010-PG-294-301
Cite as:	arXiv:1203.3490 [cs.AI]
	(or arXiv:1203.3490v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1203.3490

Submission history

From: Akshat Kumar [view email] [via AUAI proxy]
[v1] Thu, 15 Mar 2012 11:17:56 UTC (547 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2012-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Akshat Kumar
Shlomo Zilberstein

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Anytime Planning for Decentralized POMDPs using Expectation Maximization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Anytime Planning for Decentralized POMDPs using Expectation Maximization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators