On Bellman's Optimality Principle for zs-POSGs

Buffet, Olivier; Dibangoye, Jilles; Delage, Aurélien; Saffidine, Abdallah; Thomas, Vincent

Computer Science > Artificial Intelligence

arXiv:2006.16395 (cs)

[Submitted on 29 Jun 2020 (v1), last revised 15 Nov 2022 (this version, v2)]

Title:On Bellman's Optimality Principle for zs-POSGs

Authors:Olivier Buffet, Jilles Dibangoye, Aurélien Delage, Abdallah Saffidine, Vincent Thomas

View PDF

Abstract:Many non-trivial sequential decision-making problems are efficiently solved by relying on Bellman's optimality principle, i.e., exploiting the fact that sub-problems are nested recursively within the original problem. Here we show how it can apply to (infinite horizon) 2-player zero-sum partially observable stochastic games (zs-POSGs) by (i) taking a central planner's viewpoint, which can only reason on a sufficient statistic called occupancy state, and (ii) turning such problems into zero-sum occupancy Markov games (zs-OMGs). Then, exploiting the Lipschitz-continuity of the value function in occupancy space, one can derive a version of the HSVI algorithm (Heuristic Search Value Iteration) that provably finds an $\epsilon$-Nash equilibrium in finite time.

Comments:	18 pages, 0 figures, 1 algorithm
Subjects:	Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
ACM classes:	I.2.8
Cite as:	arXiv:2006.16395 [cs.AI]
	(or arXiv:2006.16395v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2006.16395

Submission history

From: Olivier Buffet [view email]
[v1] Mon, 29 Jun 2020 21:23:10 UTC (29 KB)
[v2] Tue, 15 Nov 2022 14:20:31 UTC (31 KB)

Computer Science > Artificial Intelligence

Title:On Bellman's Optimality Principle for zs-POSGs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:On Bellman's Optimality Principle for zs-POSGs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators