Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL

Zhang, Weitong; He, Jiafan; Zhou, Dongruo; Zhang, Amy; Gu, Quanquan

Computer Science > Machine Learning

arXiv:2106.11935 (cs)

[Submitted on 22 Jun 2021 (v1), last revised 14 Feb 2024 (this version, v2)]

Title:Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL

Authors:Weitong Zhang, Jiafan He, Dongruo Zhou, Amy Zhang, Quanquan Gu

View PDF HTML (experimental)

Abstract:The success of deep reinforcement learning (DRL) lies in its ability to learn a representation that is well-suited for the exploration and exploitation task. To understand how the choice of representation can improve the efficiency of reinforcement learning (RL), we study representation selection for a class of low-rank Markov Decision Processes (MDPs) where the transition kernel can be represented in a bilinear form. We propose an efficient algorithm, called ReLEX, for representation learning in both online and offline RL. Specifically, we show that the online version of ReLEX, called ReLEX-UCB, always performs no worse than the state-of-the-art algorithm without representation selection, and achieves a strictly better constant regret if the representation function class has a "coverage" property over the entire state-action space. For the offline counterpart, ReLEX-LCB, we show that the algorithm can find the optimal policy if the representation class can cover the state-action space and achieves gap-dependent sample complexity. This is the first result with constant sample complexity for representation learning in offline RL.

Comments:	32 pages, 2 figures, 7 tables, In UAI 2023
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2106.11935 [cs.LG]
	(or arXiv:2106.11935v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.11935

Submission history

From: Weitong Zhang [view email]
[v1] Tue, 22 Jun 2021 17:16:50 UTC (29 KB)
[v2] Wed, 14 Feb 2024 07:05:06 UTC (353 KB)

Computer Science > Machine Learning

Title:Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators