Multi-Agent Best Arm Identification in Stochastic Linear Bandits

Agrawal, Sanjana; Blanco, Saúl A.

Computer Science > Machine Learning

arXiv:2411.13690 (cs)

[Submitted on 20 Nov 2024 (v1), last revised 24 May 2025 (this version, v2)]

Title:Multi-Agent Best Arm Identification in Stochastic Linear Bandits

Authors:Sanjana Agrawal, Saúl A. Blanco

View PDF HTML (experimental)

Abstract:We study the problem of collaborative best-arm identification in stochastic linear bandits under a fixed-budget scenario. In our learning model, we first consider multiple agents connected through a star network, interacting with a linear bandit instance in parallel. We then extend our analysis to arbitrary network topologies. The objective of the agents is to collaboratively identify the best arm of the given bandit instance with the help of a central server while minimizing the probability of error in best arm estimation. To this end, we propose two algorithms, MaLinBAI-Star and MaLinBAI-Gen for star networks and networks with arbitrary structure, respectively. Both algorithms utilize the technique of G-optimal design along with the successive elimination based strategy where agents share their knowledge through a central server at each communication round. We demonstrate, both theoretically and empirically, that our algorithms achieve exponentially decaying probability of error in the allocated time budget. Furthermore, experimental results on both synthetic and real-world data validate the effectiveness of our algorithms over the state-of-the art existing multi-agent algorithms.

Comments:	Updated algorithms, corrected proofs, fixed typos
Subjects:	Machine Learning (cs.LG)
MSC classes:	93E35
ACM classes:	I.2.6
Cite as:	arXiv:2411.13690 [cs.LG]
	(or arXiv:2411.13690v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.13690

Submission history

From: Saúl Blanco [view email]
[v1] Wed, 20 Nov 2024 20:09:44 UTC (888 KB)
[v2] Sat, 24 May 2025 18:55:53 UTC (423 KB)

Computer Science > Machine Learning

Title:Multi-Agent Best Arm Identification in Stochastic Linear Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multi-Agent Best Arm Identification in Stochastic Linear Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators