Quantifying the Burden of Exploration and the Unfairness of Free Riding

Jung, Christopher; Kannan, Sampath; Lutz, Neil

Computer Science > Machine Learning

arXiv:1810.08743v2 (cs)

[Submitted on 20 Oct 2018 (v1), revised 23 Jan 2019 (this version, v2), latest version 4 Feb 2022 (v5)]

Title:Quantifying the Burden of Exploration and the Unfairness of Free Riding

Authors:Christopher Jung, Sampath Kannan, Neil Lutz

View PDF

Abstract:We consider the multi-armed bandit setting with a twist. Rather than having just one decision maker deciding which arm to pull in each round, we have $n$ different decision makers (agents). In the simple stochastic setting we show that one of the agents (called the free rider), who has access to the history of other agents playing some zero-regret algorithm can achieve just $O(1)$ regret, as opposed to the regret lower bound of $\Omega (\log T)$ when one decision maker is playing in isolation. In the linear contextual setting, we show that if the other agents play a particular, popular zero-regret algorithm (UCB), then the free rider can again achieve $O(1)$ regret. In order to prove this result, we give a deterministic lower bound on the number of times each suboptimal arm must be pulled in UCB. In contrast, we show that the free-rider cannot beat the standard single-player regret bounds in certain partial information settings. Lastly, we complement our theoretical results with simulations for both stochastic and contextual settings.

Subjects:	Machine Learning (cs.LG); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
Cite as:	arXiv:1810.08743 [cs.LG]
	(or arXiv:1810.08743v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.08743

Submission history

From: Neil Lutz [view email]
[v1] Sat, 20 Oct 2018 03:08:52 UTC (15 KB)
[v2] Wed, 23 Jan 2019 22:22:35 UTC (136 KB)
[v3] Wed, 17 Jul 2019 01:16:56 UTC (98 KB)
[v4] Tue, 22 Sep 2020 17:29:49 UTC (306 KB)
[v5] Fri, 4 Feb 2022 15:57:48 UTC (306 KB)

Computer Science > Machine Learning

Title:Quantifying the Burden of Exploration and the Unfairness of Free Riding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Quantifying the Burden of Exploration and the Unfairness of Free Riding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators