Sequential Monte Carlo Bandits

Cherkassky, Michael; Bornn, Luke

Statistics > Machine Learning

arXiv:1310.1404 (stat)

[Submitted on 4 Oct 2013]

Title:Sequential Monte Carlo Bandits

Authors:Michael Cherkassky, Luke Bornn

View PDF

Abstract:In this paper we propose a flexible and efficient framework for handling multi-armed bandits, combining sequential Monte Carlo algorithms with hierarchical Bayesian modeling techniques. The framework naturally encompasses restless bandits, contextual bandits, and other bandit variants under a single inferential model. Despite the model's generality, we propose efficient Monte Carlo algorithms to make inference scalable, based on recent developments in sequential Monte Carlo methods. Through two simulation studies, the framework is shown to outperform other empirical methods, while also naturally scaling to more complex problems for which existing approaches can not cope. Additionally, we successfully apply our framework to online video-based advertising recommendation, and show its increased efficacy as compared to current state of the art bandit algorithms.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
Cite as:	arXiv:1310.1404 [stat.ML]
	(or arXiv:1310.1404v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1310.1404

Submission history

From: Luke Bornn [view email]
[v1] Fri, 4 Oct 2013 20:19:56 UTC (1,425 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ML

< prev | next >

new | recent | 2013-10

Change to browse by:

cs
cs.LG
stat
stat.ME

References & Citations

1 blog link

(what is this?)

export BibTeX citation

Statistics > Machine Learning

Title:Sequential Monte Carlo Bandits

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Sequential Monte Carlo Bandits

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators