On Online Learning in Kernelized Markov Decision Processes

Chowdhury, Sayak Ray; Gopalan, Aditya

Computer Science > Machine Learning

arXiv:1911.01871 (cs)

[Submitted on 4 Nov 2019]

Title:On Online Learning in Kernelized Markov Decision Processes

Authors:Sayak Ray Chowdhury, Aditya Gopalan

View PDF

Abstract:We develop algorithms with low regret for learning episodic Markov decision processes based on kernel approximation techniques. The algorithms are based on both the Upper Confidence Bound (UCB) as well as Posterior or Thompson Sampling (PSRL) philosophies, and work in the general setting of continuous state and action spaces when the true unknown transition dynamics are assumed to have smoothness induced by an appropriate Reproducing Kernel Hilbert Space (RKHS).

Comments:	arXiv admin note: text overlap with arXiv:1805.08052
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.01871 [cs.LG]
	(or arXiv:1911.01871v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.01871

Submission history

From: Sayak Ray Chowdhury [view email]
[v1] Mon, 4 Nov 2019 05:17:28 UTC (80 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-11

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sayak Ray Chowdhury
Aditya Gopalan

export BibTeX citation

Computer Science > Machine Learning

Title:On Online Learning in Kernelized Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Online Learning in Kernelized Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators