Skip to main content

Showing 1–3 of 3 results for author: Fiandri, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.12092  [pdf, other

    stat.ML cs.LG

    Thompson Sampling-like Algorithms for Stochastic Rising Bandits

    Authors: Marco Fiandri, Alberto Maria Metelli, Francesco Trovò

    Abstract: Stochastic rising rested bandit (SRRB) is a setting where the arms' expected rewards increase as they are pulled. It models scenarios in which the performances of the different options grow as an effect of an underlying learning process (e.g., online model selection). Even if the bandit literature provides specifically crafted algorithms based on upper-confidence bounds for such a setting, no stud… ▽ More

    Submitted 20 May, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

    Comments: 57 pages

  2. arXiv:2411.14446  [pdf, other

    stat.ML cs.LG

    Rising Rested Bandits: Lower Bounds and Efficient Algorithms

    Authors: Marco Fiandri, Alberto Maria Metelli, Francesco Trov`o

    Abstract: This paper is in the field of stochastic Multi-Armed Bandits (MABs), i.e. those sequential selection techniques able to learn online using only the feedback given by the chosen option (a.k.a. $arm$). We study a particular case of the rested bandits in which the arms' expected reward is monotonically non-decreasing and concave. We study the inherent sample complexity of the regret minimization prob… ▽ More

    Submitted 26 November, 2024; v1 submitted 6 November, 2024; originally announced November 2024.

    Comments: 63 pages. arXiv admin note: substantial text overlap with arXiv:2212.03798

  3. arXiv:2409.05181  [pdf, ps, other

    stat.ML cs.LG

    Sliding-Window Thompson Sampling for Non-Stationary Settings

    Authors: Marco Fiandri, Alberto Maria Metelli, Francesco Trovò

    Abstract: Non-stationary multi-armed bandits (NS-MABs) model sequential decision-making problems in which the expected rewards of a set of actions, a.k.a.~arms, evolve over time. In this paper, we fill a gap in the literature by providing a novel analysis of Thompson sampling-inspired (TS) algorithms for NS-MABs that both corrects and generalizes existing work. Specifically, we study the cumulative frequent… ▽ More

    Submitted 14 June, 2025; v1 submitted 8 September, 2024; originally announced September 2024.

    Comments: 32 pages