Best Arm Identification with Minimal Regret

Yang, Junwen; Tan, Vincent Y. F.; Jin, Tianyuan

Computer Science > Machine Learning

arXiv:2409.18909 (cs)

[Submitted on 27 Sep 2024]

Title:Best Arm Identification with Minimal Regret

Authors:Junwen Yang, Vincent Y. F. Tan, Tianyuan Jin

View PDF

Abstract:Motivated by real-world applications that necessitate responsible experimentation, we introduce the problem of best arm identification (BAI) with minimal regret. This innovative variant of the multi-armed bandit problem elegantly amalgamates two of its most ubiquitous objectives: regret minimization and BAI. More precisely, the agent's goal is to identify the best arm with a prescribed confidence level $\delta$, while minimizing the cumulative regret up to the stopping time. Focusing on single-parameter exponential families of distributions, we leverage information-theoretic techniques to establish an instance-dependent lower bound on the expected cumulative regret. Moreover, we present an intriguing impossibility result that underscores the tension between cumulative regret and sample complexity in fixed-confidence BAI. Complementarily, we design and analyze the Double KL-UCB algorithm, which achieves asymptotic optimality as the confidence level tends to zero. Notably, this algorithm employs two distinct confidence bounds to guide arm selection in a randomized manner. Our findings elucidate a fresh perspective on the inherent connections between regret minimization and BAI.

Comments:	Preprint
Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:2409.18909 [cs.LG]
	(or arXiv:2409.18909v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2409.18909

Submission history

From: Junwen Yang [view email]
[v1] Fri, 27 Sep 2024 16:46:02 UTC (42 KB)

Computer Science > Machine Learning

Title:Best Arm Identification with Minimal Regret

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Best Arm Identification with Minimal Regret

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators