Thompson Sampling-like Algorithms for Stochastic Rising Bandits

Fiandri, Marco; Metelli, Alberto Maria; Trovò, Francesco

Statistics > Machine Learning

arXiv:2505.12092 (stat)

[Submitted on 17 May 2025 (v1), last revised 20 May 2025 (this version, v2)]

Title:Thompson Sampling-like Algorithms for Stochastic Rising Bandits

Authors:Marco Fiandri, Alberto Maria Metelli, Francesco Trovò

View PDF HTML (experimental)

Abstract:Stochastic rising rested bandit (SRRB) is a setting where the arms' expected rewards increase as they are pulled. It models scenarios in which the performances of the different options grow as an effect of an underlying learning process (e.g., online model selection). Even if the bandit literature provides specifically crafted algorithms based on upper-confidence bounds for such a setting, no study about Thompson sampling TS-like algorithms has been performed so far. The strong regularity of the expected rewards in the SRRB setting suggests that specific instances may be tackled effectively using adapted and sliding-window TS approaches. This work provides novel regret analyses for such algorithms in SRRBs, highlighting the challenges and providing new technical tools of independent interest. Our results allow us to identify under which assumptions TS-like algorithms succeed in achieving sublinear regret and which properties of the environment govern the complexity of the regret minimization problem when approached with TS. Furthermore, we provide a regret lower bound based on a complexity index we introduce. Finally, we conduct numerical simulations comparing TS-like algorithms with state-of-the-art approaches for SRRBs in synthetic and real-world settings.

Comments:	57 pages
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2505.12092 [stat.ML]
	(or arXiv:2505.12092v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2505.12092

Submission history

From: Marco Fiandri [view email]
[v1] Sat, 17 May 2025 17:19:07 UTC (910 KB)
[v2] Tue, 20 May 2025 15:08:08 UTC (901 KB)

Statistics > Machine Learning

Title:Thompson Sampling-like Algorithms for Stochastic Rising Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Thompson Sampling-like Algorithms for Stochastic Rising Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators