Ergodic Unobservable MDPs: Decidability of Approximation

Chatterjee, Krishnendu; Lurie, David; Saona, Raimundo; Ziliotto, Bruno

Mathematics > Optimization and Control

arXiv:2405.12583 (math)

[Submitted on 21 May 2024]

Title:Ergodic Unobservable MDPs: Decidability of Approximation

Authors:Krishnendu Chatterjee, David Lurie, Raimundo Saona, Bruno Ziliotto

View PDF HTML (experimental)

Abstract:Unobservable Markov decision processes (UMDPs) serve as a prominent mathematical framework for modeling sequential decision-making problems. A key aspect in computational analysis is the consideration of decidability, which concerns the existence of algorithms. In general, the computation of the exact and approximated values is undecidable for UMDPs with the long-run average objective. Building on matrix product theory and ergodic properties, we introduce a novel subclass of UMDPs, termed ergodic UMDPs. Our main result demonstrates that approximating the value within this subclass is decidable. However, we show that the exact problem remains undecidable. Finally, we discuss the primary challenges of extending these results to partially observable Markov decision processes.

Subjects:	Optimization and Control (math.OC); Computational Complexity (cs.CC)
MSC classes:	90C40, 49M25, 90C59, 91A68, 68W25
Cite as:	arXiv:2405.12583 [math.OC]
	(or arXiv:2405.12583v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2405.12583

Submission history

From: Bruno Ziliotto [view email]
[v1] Tue, 21 May 2024 08:27:21 UTC (22 KB)

Full-text links:

Access Paper:

view license

Current browse context:

math.OC

< prev | next >

new | recent | 2024-05

Change to browse by:

cs
cs.CC
math

References & Citations

export BibTeX citation

Mathematics > Optimization and Control

Title:Ergodic Unobservable MDPs: Decidability of Approximation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Ergodic Unobservable MDPs: Decidability of Approximation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators