Skip to main content

Showing 1–2 of 2 results for author: Boudart, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.05306  [pdf, ps, other

    stat.ML cs.AI cs.LG math.ST

    Enjoying Non-linearity in Multinomial Logistic Bandits

    Authors: Pierre Boudart, Pierre Gaillard, Alessandro Rudi

    Abstract: We consider the multinomial logistic bandit problem, a variant of generalized linear bandits where a learner interacts with an environment by selecting actions to maximize expected rewards based on probabilistic feedback from multiple possible outcomes. In the binary setting, recent work has focused on understanding the impact of the non-linearity of the logistic model (Faury et al., 2020; Abeille… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  2. arXiv:2406.12366  [pdf, ps, other

    cs.LG math.ST stat.ML

    Structured Prediction in Online Learning

    Authors: Pierre Boudart, Alessandro Rudi, Pierre Gaillard

    Abstract: We study a theoretical and algorithmic framework for structured prediction in the online learning setting. The problem of structured prediction, i.e. estimating function where the output space lacks a vectorial structure, is well studied in the literature of supervised statistical learning. We show that our algorithm is a generalisation of optimal algorithms from the supervised learning setting, a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 29 pages