When Is Partially Observable Reinforcement Learning Not Scary?

Liu, Qinghua; Chung, Alan; Szepesvári, Csaba; Jin, Chi

Computer Science > Machine Learning

arXiv:2204.08967 (cs)

[Submitted on 19 Apr 2022 (v1), last revised 24 May 2022 (this version, v2)]

Title:When Is Partially Observable Reinforcement Learning Not Scary?

Authors:Qinghua Liu, Alan Chung, Csaba Szepesvári, Chi Jin

View PDF

Abstract:Applications of Reinforcement Learning (RL), in which agents learn to make a sequence of decisions despite lacking complete information about the latent states of the controlled system, that is, they act under partial observability of the states, are ubiquitous. Partially observable RL can be notoriously difficult -- well-known information-theoretic results show that learning partially observable Markov decision processes (POMDPs) requires an exponential number of samples in the worst case. Yet, this does not rule out the existence of large subclasses of POMDPs over which learning is tractable.
In this paper we identify such a subclass, which we call weakly revealing POMDPs. This family rules out the pathological instances of POMDPs where observations are uninformative to a degree that makes learning hard. We prove that for weakly revealing POMDPs, a simple algorithm combining optimism and Maximum Likelihood Estimation (MLE) is sufficient to guarantee polynomial sample complexity. To the best of our knowledge, this is the first provably sample-efficient result for learning from interactions in overcomplete POMDPs, where the number of latent states can be larger than the number of observations.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY); Machine Learning (stat.ML)
Cite as:	arXiv:2204.08967 [cs.LG]
	(or arXiv:2204.08967v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.08967

Submission history

From: Qinghua Liu [view email]
[v1] Tue, 19 Apr 2022 16:08:28 UTC (546 KB)
[v2] Tue, 24 May 2022 18:53:45 UTC (565 KB)

Computer Science > Machine Learning

Title:When Is Partially Observable Reinforcement Learning Not Scary?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:When Is Partially Observable Reinforcement Learning Not Scary?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators