Provable General Function Class Representation Learning in Multitask Bandits and MDPs

Lu, Rui; Zhao, Andrew; Du, Simon S.; Huang, Gao

Computer Science > Machine Learning

arXiv:2205.15701 (cs)

[Submitted on 31 May 2022 (v1), last revised 21 Oct 2022 (this version, v3)]

Title:Provable General Function Class Representation Learning in Multitask Bandits and MDPs

Authors:Rui Lu, Andrew Zhao, Simon S. Du, Gao Huang

View PDF

Abstract:While multitask representation learning has become a popular approach in reinforcement learning (RL) to boost the sample efficiency, the theoretical understanding of why and how it works is still limited. Most previous analytical works could only assume that the representation function is already known to the agent or from linear function class, since analyzing general function class representation encounters non-trivial technical obstacles such as generalization guarantee, formulation of confidence bound in abstract function space, etc. However, linear-case analysis heavily relies on the particularity of linear function class, while real-world practice usually adopts general non-linear representation functions like neural networks. This significantly reduces its applicability. In this work, we extend the analysis to general function class representations. Specifically, we consider an agent playing $M$ contextual bandits (or MDPs) concurrently and extracting a shared representation function $\phi$ from a specific function class $\Phi$ using our proposed Generalized Functional Upper Confidence Bound algorithm (GFUCB). We theoretically validate the benefit of multitask representation learning within general function class for bandits and linear MDP for the first time. Lastly, we conduct experiments to demonstrate the effectiveness of our algorithm with neural net representation.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2205.15701 [cs.LG]
	(or arXiv:2205.15701v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.15701

Submission history

From: Rui Lu [view email]
[v1] Tue, 31 May 2022 11:36:42 UTC (216 KB)
[v2] Thu, 6 Oct 2022 10:53:08 UTC (293 KB)
[v3] Fri, 21 Oct 2022 07:37:58 UTC (218 KB)

Computer Science > Machine Learning

Title:Provable General Function Class Representation Learning in Multitask Bandits and MDPs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Provable General Function Class Representation Learning in Multitask Bandits and MDPs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators