Skip to main content

Showing 1–9 of 9 results for author: Gilotte, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.10677  [pdf, ps, other

    stat.ML cs.LG

    Practical Improvements of A/B Testing with Off-Policy Estimation

    Authors: Otmane Sakhi, Alexandre Gilotte, David Rohde

    Abstract: We address the problem of A/B testing, a widely used protocol for evaluating the potential improvement achieved by a new decision system compared to a baseline. This protocol segments the population into two subgroups, each exposed to a version of the system and estimates the improvement as the difference between the measured effects. In this work, we demonstrate that the commonly used difference-… ▽ More

    Submitted 13 June, 2025; v1 submitted 12 June, 2025; originally announced June 2025.

  2. arXiv:2502.12103  [pdf, other

    cs.CR stat.CO

    CriteoPrivateAds: A Real-World Bidding Dataset to Design Private Advertising Systems

    Authors: Mehdi Sebbar, Corentin Odic, Mathieu Léchine, Aloïs Bissuel, Nicolas Chrysanthos, Anthony D'Amato, Alexandre Gilotte, Fabian Höring, Sarah Nogueira, Maxime Vono

    Abstract: In the past years, many proposals have emerged in order to address online advertising use-cases without access to third-party cookies. All these proposals leverage some privacy-enhancing technologies such as aggregation or differential privacy. Yet, no public and rich-enough ground truth is currently available to assess the relevancy of aforementioned private advertising frameworks. We are releasi… ▽ More

    Submitted 14 April, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: 11 pages

  3. arXiv:2407.10504  [pdf, other

    cs.GT cs.LG math.OC

    A pragmatic policy learning approach to account for users' fatigue in repeated auctions

    Authors: Benjamin Heymann, Rémi Chan--Renous-Legoubin, Alexandre Gilotte

    Abstract: Online advertising banners are sold in real-time through auctions.Typically, the more banners a user is shown, the smaller the marginalvalue of the next banner for this user is. This fact can be detected bybasic ML models, that can be used to predict how previously won auctionsdecrease the current opportunity value. However, learning is not enough toproduce a bid that correctly accounts for how wi… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  4. arXiv:2210.02450  [pdf, other

    cs.LG cs.AI

    Learning from aggregated data with a maximum entropy model

    Authors: Alexandre Gilotte, Ahmed Ben Yahmed, David Rohde

    Abstract: Aggregating a dataset, then injecting some noise, is a simple and common way to release differentially private data.However, aggregated data -- even without noise -- is not an appropriate input for machine learning classifiers.In this work, we show how a new model, similar to a logistic regression, may be learned from aggregated data only by approximating the unobserved feature distribution with a… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  5. arXiv:2208.05327  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    Fast Offline Policy Optimization for Large Scale Recommendation

    Authors: Otmane Sakhi, David Rohde, Alexandre Gilotte

    Abstract: Personalised interactive systems such as recommender systems require selecting relevant items from massive catalogs dependent on context. Reward-driven offline optimisation of these systems can be achieved by a relaxation of the discrete problem resulting in policy learning or REINFORCE style learning algorithms. Unfortunately, this relaxation step requires computing a sum over the entire catalogu… ▽ More

    Submitted 27 May, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted at AAAI 2023

  6. arXiv:2201.13123  [pdf, other

    cs.LG cs.CR

    Lessons from the AdKDD'21 Privacy-Preserving ML Challenge

    Authors: Eustache Diemert, Romain Fabre, Alexandre Gilotte, Fei Jia, Basile Leparmentier, Jérémie Mary, Zhonghua Qu, Ugo Tanielian, Hui Yang

    Abstract: Designing data sharing mechanisms providing performance and strong privacy guarantees is a hot topic for the Online Advertising industry. Namely, a prominent proposal discussed under the Improving Web Advertising Business Group at W3C only allows sharing advertising signals through aggregated, differentially private reports of past displays. To study this proposal extensively, an open Privacy-Pres… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  7. arXiv:1909.08471  [pdf, other

    cs.IR cs.LG stat.ML

    Learning from Bandit Feedback: An Overview of the State-of-the-art

    Authors: Olivier Jeunen, Dmytro Mykhaylov, David Rohde, Flavian Vasile, Alexandre Gilotte, Martin Bompaire

    Abstract: In machine learning we often try to optimise a decision rule that would have worked well over a historical dataset; this is the so called empirical risk minimisation principle. In the context of learning from recommender system logs, applying this principle becomes a problem because we do not have available the reward of decisions we did not do. In order to handle this "bandit-feedback" setting, s… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

  8. arXiv:1909.07926  [pdf, other

    stat.ML cs.LG

    Ranking metrics on non-shuffled traffic

    Authors: Alexandre Gilotte

    Abstract: Ranking metrics are a family of metrics largely used to evaluate recommender systems. However they typically suffer from the fact the reward is affected by the order in which recommended items are displayed to the user. A classical way to overcome this position bias is to uniformly shuffle a proportion of the recommendations, but this method may result in a bad user experience. It is nevertheless… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

  9. Offline A/B testing for Recommender Systems

    Authors: Alexandre Gilotte, Clément Calauzènes, Thomas Nedelec, Alexandre Abraham, Simon Dollé

    Abstract: Before A/B testing online a new version of a recommender system, it is usual to perform some offline evaluations on historical data. We focus on evaluation methods that compute an estimator of the potential uplift in revenue that could generate this new technology. It helps to iterate faster and to avoid losing money by detecting poor policies. These estimators are known as counterfactual or off-p… ▽ More

    Submitted 22 January, 2018; originally announced January 2018.