The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates

Reeve, Henry WJ; Mellor, Joe; Brown, Gavin

Computer Science > Machine Learning

arXiv:1803.00316 (cs)

[Submitted on 1 Mar 2018]

Title:The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates

Authors:Henry WJ Reeve, Joe Mellor, Gavin Brown

View PDF

Abstract:In this paper we propose and explore the k-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates. We focus on a setting where the covariates are supported on a metric space of low intrinsic dimension, such as a manifold embedded within a high dimensional ambient feature space. The algorithm is conceptually simple and straightforward to implement. The k-Nearest Neighbour UCB algorithm does not require prior knowledge of the either the intrinsic dimension of the marginal distribution or the time horizon. We prove a regret bound for the k-Nearest Neighbour UCB algorithm which is minimax optimal up to logarithmic factors. In particular, the algorithm automatically takes advantage of both low intrinsic dimensionality of the marginal distribution over the covariates and low noise in the data, expressed as a margin condition. In addition, focusing on the case of bounded rewards, we give corresponding regret bounds for the k-Nearest Neighbour KL-UCB algorithm, which is an analogue of the KL-UCB algorithm adapted to the setting of multi-armed bandits with covariates. Finally, we present empirical results which demonstrate the ability of both the k-Nearest Neighbour UCB and k-Nearest Neighbour KL-UCB to take advantage of situations where the data is supported on an unknown sub-manifold of a high-dimensional feature space.

Comments:	To be presented at ALT 2018
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1803.00316 [cs.LG]
	(or arXiv:1803.00316v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1803.00316
Journal reference:	Algorithmic Learning Theory 2018

Submission history

From: Henry WJ Reeve [view email]
[v1] Thu, 1 Mar 2018 11:41:13 UTC (289 KB)

Computer Science > Machine Learning

Title:The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators