Skip to main content

Showing 1–14 of 14 results for author: Jourdan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23557  [pdf, ps, other

    stat.ML cs.LG

    Learning Parametric Distributions from Samples and Preferences

    Authors: Marc Jourdan, Gizem Yüce, Nicolas Flammarion

    Abstract: Recent advances in language modeling have underscored the role of preference feedback in enhancing model performance. This paper investigates the conditions under which preference feedback improves parameter estimation in classes of continuous parametric distributions. In our framework, the learner observes pairs of samples from an unknown distribution along with their relative preferences dependi… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 28 pages, 8 figures. To be published in the Forty-Second International Conference on Machine Learning

  2. arXiv:2411.04939  [pdf, other

    stat.ML cs.LG

    Pareto Set Identification With Posterior Sampling

    Authors: Cyrille Kone, Marc Jourdan, Emilie Kaufmann

    Abstract: The problem of identifying the best answer among a collection of items having real-valued distribution is well-understood. Despite its practical relevance for many applications, fewer works have studied its extension when multiple and potentially conflicting metrics are available to assess an item's quality. Pareto set identification (PSI) aims to identify the set of answers whose means are no… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  3. arXiv:2411.01898  [pdf, other

    cs.LG cs.AI

    Best-Arm Identification in Unimodal Bandits

    Authors: Riccardo Poiani, Marc Jourdan, Emilie Kaufmann, Rémy Degenne

    Abstract: We study the fixed-confidence best-arm identification problem in unimodal bandits, in which the means of the arms increase with the index of the arm up to their maximum, then decrease. We derive two lower bounds on the stopping time of any algorithm. The instance-dependent lower bound suggests that due to the unimodal structure, only three arms contribute to the leading confidence-dependent cost.… ▽ More

    Submitted 26 May, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

  4. arXiv:2406.06408  [pdf, other

    stat.ML cs.CR cs.LG math.ST

    Differentially Private Best-Arm Identification

    Authors: Achraf Azize, Marc Jourdan, Aymen Al Marjani, Debabrota Basu

    Abstract: Best Arm Identification (BAI) problems are progressively used for data-sensitive applications, such as designing adaptive clinical trials, tuning hyper-parameters, and conducting user studies. Motivated by the data privacy concerns invoked by these applications, we study the problem of BAI with fixed confidence in both the local and central models, i.e. $ε$-local and $ε$-global Differential Privac… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.02202

  5. arXiv:2310.10359  [pdf, other

    stat.ML cs.LG

    An Anytime Algorithm for Good Arm Identification

    Authors: Marc Jourdan, Clémence Réda

    Abstract: In good arm identification (GAI), the goal is to identify one arm whose average performance exceeds a given threshold, referred to as good arm, if it exists. Few works have studied GAI in the fixed-budget setting, when the sampling budget is fixed beforehand, or the anytime setting, when a recommendation can be asked at any time. We propose APGAI, an anytime and parameter-free sampling rule for GA… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 56 pages, 23 figures, 11 tables

  6. arXiv:2309.02202  [pdf, other

    stat.ML cs.CR cs.LG math.ST

    On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence

    Authors: Achraf Azize, Marc Jourdan, Aymen Al Marjani, Debabrota Basu

    Abstract: Best Arm Identification (BAI) problems are progressively used for data-sensitive applications, such as designing adaptive clinical trials, tuning hyper-parameters, and conducting user studies to name a few. Motivated by the data privacy concerns invoked by these applications, we study the problem of BAI with fixed confidence under $ε$-global Differential Privacy (DP). First, to quantify the cost o… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  7. arXiv:2305.16041  [pdf, other

    stat.ML cs.LG

    An $\varepsilon$-Best-Arm Identification Algorithm for Fixed-Confidence and Beyond

    Authors: Marc Jourdan, Rémy Degenne, Emilie Kaufmann

    Abstract: We propose EB-TC$\varepsilon$, a novel sampling rule for $\varepsilon$-best arm identification in stochastic bandits. It is the first instance of Top Two algorithm analyzed for approximate best arm identification. EB-TC$\varepsilon$ is an *anytime* sampling rule that can therefore be employed without modification for fixed confidence or fixed budget identification (without prior knowledge of the b… ▽ More

    Submitted 6 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 68 pages, 14 figures, 4 tables. To be published in the Thirty-seventh Conference on Neural Information Processing Systems

  8. arXiv:2210.05431  [pdf, other

    stat.ML cs.LG

    Non-Asymptotic Analysis of a UCB-based Top Two Algorithm

    Authors: Marc Jourdan, Rémy Degenne

    Abstract: A Top Two sampling rule for bandit identification is a method which selects the next arm to sample from among two candidate arms, a leader and a challenger. Due to their simplicity and good empirical performance, they have received increased attention in recent years. However, for fixed-confidence best arm identification, theoretical guarantees for Top Two methods have only been obtained in the as… ▽ More

    Submitted 6 November, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: 32 pages, 5 figures, 3 tables. To be published in the Thirty-seventh Conference on Neural Information Processing Systems

  9. arXiv:2210.00974  [pdf, other

    stat.ML cs.LG

    Dealing with Unknown Variances in Best-Arm Identification

    Authors: Marc Jourdan, Rémy Degenne, Emilie Kaufmann

    Abstract: The problem of identifying the best arm among a collection of items having Gaussian rewards distribution is well understood when the variances are known. Despite its practical relevance for many applications, few works studied it for unknown variances. In this paper we introduce and analyze two approaches to deal with unknown variances, either by plugging in the empirical variance or by adapting t… ▽ More

    Submitted 23 January, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 73 pages, 5 figures, 3 tables. To be published in the 34th International Conference on Algorithmic Learning Theory, Singapore, 2023

  10. arXiv:2206.05979  [pdf, other

    stat.ML cs.LG

    Top Two Algorithms Revisited

    Authors: Marc Jourdan, Rémy Degenne, Dorian Baudry, Rianne de Heide, Emilie Kaufmann

    Abstract: Top Two algorithms arose as an adaptation of Thompson sampling to best arm identification in multi-armed bandit models (Russo, 2016), for parametric families of arms. They select the next arm to sample from by randomizing among two candidate arms, a leader and a challenger. Despite their good empirical performance, theoretical guarantees for fixed-confidence best arm identification have only been… ▽ More

    Submitted 4 October, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: 75 pages, 8 figures, 3 tables

  11. arXiv:2206.04456  [pdf, other

    stat.ML cs.LG

    Choosing Answers in $\varepsilon$-Best-Answer Identification for Linear Bandits

    Authors: Marc Jourdan, Rémy Degenne

    Abstract: In pure-exploration problems, information is gathered sequentially to answer a question on the stochastic environment. While best-arm identification for linear bandits has been extensively studied in recent years, few works have been dedicated to identifying one arm that is $\varepsilon$-close to the best one (and not exactly the best one). In this problem with several correct answers, an identifi… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 47 pages, 10 figures, 8 tables. To be published in the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

  12. arXiv:2101.08534  [pdf, other

    stat.ML cs.LG

    Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback

    Authors: Marc Jourdan, Mojmír Mutný, Johannes Kirschner, Andreas Krause

    Abstract: Combinatorial bandits with semi-bandit feedback generalize multi-armed bandits, where the agent chooses sets of arms and observes a noisy reward for each arm contained in the chosen set. The action set satisfies a given structure such as forming a base of a matroid or a path in a graph. We focus on the pure-exploration problem of identifying the best arm with fixed confidence, as well as a more ge… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 45 pages. 3 tables. Appendices: from A to I. Figures: 1(a), 1(b), 2(a), 2(b), 3(a), 3(b), 3(c), 4(a), 4(b), 5(a), 5(b), 5(c), 5(d), 6(a), 6(b). To be published in the 32nd International Conference on Algorithmic Learning Theory and the Proceedings of Machine Learning Research vol 132:1-45, 2021

  13. arXiv:1812.05451  [pdf, other

    cs.CR cs.AI cs.LG stat.ML

    A Probabilistic Model of the Bitcoin Blockchain

    Authors: Marc Jourdan, Sebastien Blandin, Laura Wynter, Pralhad Deshpande

    Abstract: The Bitcoin transaction graph is a public data structure organized as transactions between addresses, each associated with a logical entity. In this work, we introduce a complete probabilistic model of the Bitcoin Blockchain. We first formulate a set of conditional dependencies induced by the Bitcoin protocol at the block level and derive a corresponding fully observed graphical model of a Bitcoin… ▽ More

    Submitted 6 November, 2018; originally announced December 2018.

  14. arXiv:1810.11956  [pdf, other

    cs.CR cs.LG

    Characterizing Entities in the Bitcoin Blockchain

    Authors: Marc Jourdan, Sebastien Blandin, Laura Wynter, Pralhad Deshpande

    Abstract: Bitcoin has created a new exchange paradigm within which financial transactions can be trusted without an intermediary. This premise of a free decentralized transactional network however requires, in its current implementation, unrestricted access to the ledger for peer-based transaction verification. A number of studies have shown that, in this pseudonymous context, identities can be leaked based… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.