Learning Multiple Markov Chains via Adaptive Allocation

Talebi, Mohammad Sadegh; Maillard, Odalric-Ambrym

Computer Science > Machine Learning

arXiv:1905.11128 (cs)

[Submitted on 27 May 2019 (v1), last revised 13 Nov 2019 (this version, v2)]

Title:Learning Multiple Markov Chains via Adaptive Allocation

Authors:Mohammad Sadegh Talebi, Odalric-Ambrym Maillard

View PDF

Abstract:We study the problem of learning the transition matrices of a set of Markov chains from a single stream of observations on each chain. We assume that the Markov chains are ergodic but otherwise unknown. The learner can sample Markov chains sequentially to observe their states. The goal of the learner is to sequentially select various chains to learn transition matrices uniformly well with respect to some loss function. We introduce a notion of loss that naturally extends the squared loss for learning distributions to the case of Markov chains, and further characterize the notion of being \emph{uniformly good} in all problem instances. We present a novel learning algorithm that efficiently balances \emph{exploration} and \emph{exploitation} intrinsic to this problem, without any prior knowledge of the chains. We provide finite-sample PAC-type guarantees on the performance of the algorithm. Further, we show that our algorithm asymptotically attains an optimal loss.

Comments:	Accepted to NeurIPS 2019
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1905.11128 [cs.LG]
	(or arXiv:1905.11128v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.11128

Submission history

From: M. Sadegh Talebi [view email]
[v1] Mon, 27 May 2019 11:25:08 UTC (46 KB)
[v2] Wed, 13 Nov 2019 11:04:29 UTC (50 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-05

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

M. Sadegh Talebi
Odalric-Ambrym Maillard

export BibTeX citation

Computer Science > Machine Learning

Title:Learning Multiple Markov Chains via Adaptive Allocation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Multiple Markov Chains via Adaptive Allocation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators