Best Arm Identification for Contaminated Bandits

Altschuler, Jason; Brunel, Victor-Emmanuel; Malek, Alan

Mathematics > Statistics Theory

arXiv:1802.09514v2 (math)

[Submitted on 26 Feb 2018 (v1), revised 8 Apr 2018 (this version, v2), latest version 15 May 2019 (v5)]

Title:Best Arm Identification for Contaminated Bandits

Authors:Jason Altschuler, Victor-Emmanuel Brunel, Alan Malek

View PDF

Abstract:This paper studies active learning in the context of robust statistics. Specifically, we propose the Contaminated Best Arm Identification variant of the multi-armed bandit problem, in which every arm pull has probability $\varepsilon$ of generating a sample from an arbitrary \emph{contamination} distribution instead of the \emph{true} underlying distribution. The goal is to identify the best (or approximately best) true distribution with high probability, with a secondary goal of providing guarantees on the quality of that arm's underlying distribution. It is simple to see that in this contamination model there are no consistent estimators for statistics (e.g. median) of the underlying distribution, and that even with infinite samples, statistics can be estimated only up to some unavoidable bias. We present tight, non-asymptotic sample complexity bounds for estimating the first two robust moments (median and median absolute deviation) with high probability. We then show how to use this algorithmically for our problem by adapting Best Arm Identification algorithms from the classical multi-armed bandit literature. We give matching upper and lower bounds (up to a small logarithmic factor) on these algorithms' sample complexities. These results suggest an inherent robustness of classical Best Arm Identification algorithms.

Subjects:	Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1802.09514 [math.ST]
	(or arXiv:1802.09514v2 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1802.09514

Submission history

From: Jason Altschuler [view email]
[v1] Mon, 26 Feb 2018 18:59:30 UTC (46 KB)
[v2] Sun, 8 Apr 2018 22:23:06 UTC (48 KB)
[v3] Tue, 19 Jun 2018 00:15:37 UTC (47 KB)
[v4] Fri, 19 Oct 2018 15:31:29 UTC (49 KB)
[v5] Wed, 15 May 2019 15:32:00 UTC (50 KB)

Mathematics > Statistics Theory

Title:Best Arm Identification for Contaminated Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Best Arm Identification for Contaminated Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators