Skip to main content

Showing 1–21 of 21 results for author: Cappé, O

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.02023  [pdf, other

    cs.LG stat.ML

    Optimal Classification under Performative Distribution Shift

    Authors: Edwige Cyffers, Muni Sreenivas Pydi, Jamal Atif, Olivier Cappé

    Abstract: Performative learning addresses the increasingly pervasive situations in which algorithmic decisions may induce changes in the data distribution as a consequence of their public deployment. We propose a novel view in which these performative effects are modelled as push-forward measures. This general framework encompasses existing models and enables novel performative gradient estimation methods,… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: 38th Conference on Neural Information Processing Systems, Dec 2024, Vancouver (Canada), Canada

  2. arXiv:2110.15573  [pdf, other

    stat.ML cs.LG

    A/B/n Testing with Control in the Presence of Subpopulations

    Authors: Yoan Russac, Christina Katsimerou, Dennis Bohle, Olivier Cappé, Aurélien Garivier, Wouter Koolen

    Abstract: Motivated by A/B/n testing applications, we consider a finite set of distributions (called \emph{arms}), one of which is treated as a \emph{control}. We assume that the population is stratified into homogeneous subpopulations. At every time step, a subpopulation is sampled and an arm is chosen: the resulting observation is an independent draw from the arm conditioned on the subpopulation. The qual… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Journal ref: NeurIPS 2021, Dec 2021, Virtual, France

  3. arXiv:2107.01835  [pdf, other

    cs.LG stat.ML

    Fast Rate Learning in Stochastic First Price Bidding

    Authors: Juliette Achddou, Olivier Cappé, Aurélien Garivier

    Abstract: First-price auctions have largely replaced traditional bidding approaches based on Vickrey auctions in programmatic advertising. As far as learning is concerned, first-price auctions are more challenging because the optimal bidding strategy does not only depend on the value of the item but also requires some knowledge of the other bids. They have already given rise to several works in sequen… ▽ More

    Submitted 22 November, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Journal ref: ACML 2021 - Proceedings of Machine Learning Research 157, 2021, Nov 2021, SIngapore, Singapore

  4. arXiv:2011.05072  [pdf, other

    cs.LG stat.ML

    Efficient Algorithms for Stochastic Repeated Second-price Auctions

    Authors: Juliette Achddou, Olivier Cappé, Aurélien Garivier

    Abstract: Developing efficient sequential bidding strategies for repeated auctions is an important practical challenge in various marketing tasks. In this setting, the bidding agent obtains information, on both the value of the item at sale and the behavior of the other bidders, only when she wins the auction. Standard bandit theory does not apply to this problem due to the presence of action-dependent cen… ▽ More

    Submitted 26 February, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

    Journal ref: ALT 2021, Mar 2021, Paris, France

  5. arXiv:2011.00819  [pdf, other

    cs.LG cs.AI stat.ML

    Self-Concordant Analysis of Generalized Linear Bandits with Forgetting

    Authors: Yoan Russac, Louis Faury, Olivier Cappé, Aurélien Garivier

    Abstract: Contextual sequential decision problems with categorical or numerical observations are ubiquitous and Generalized Linear Bandits (GLB) offer a solid theoretical framework to address them. In contrast to the case of linear bandits, existing algorithms for GLB have two drawbacks undermining their applicability. First, they rely on excessively pessimistic concentration bounds due to the non-linear na… ▽ More

    Submitted 4 March, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Journal ref: AISTATS 2021 - International Conference on Artificial Intelligence and Statistics, Apr 2021, San Diego / Virtual, United States

  6. A Comparative Study of Gamma Markov Chains for Temporal Non-Negative Matrix Factorization

    Authors: Louis Filstroff, Olivier Gouvert, Cédric Févotte, Olivier Cappé

    Abstract: Non-negative matrix factorization (NMF) has become a well-established class of methods for the analysis of non-negative data. In particular, a lot of effort has been devoted to probabilistic NMF, namely estimation or inference tasks in probabilistic models describing the data, based for example on Poisson or exponential likelihoods. When dealing with time series data, several works have proposed t… ▽ More

    Submitted 25 February, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Code available at https://github.com/lfilstro/TemporalNMF

  7. arXiv:2003.10113  [pdf, other

    cs.LG stat.ML

    Algorithms for Non-Stationary Generalized Linear Bandits

    Authors: Yoan Russac, Olivier Cappé, Aurélien Garivier

    Abstract: The statistical framework of Generalized Linear Models (GLM) can be applied to sequential problems involving categorical or ordinal rewards associated, for instance, with clicks, likes or ratings. In the example of binary rewards, logistic regression is well-known to be preferable to the use of standard linear modeling. Previous works have shown how to deal with GLMs in contextual online learning… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  8. arXiv:1909.09146  [pdf, other

    cs.LG stat.ML

    Weighted Linear Bandits for Non-Stationary Environments

    Authors: Yoan Russac, Claire Vernade, Olivier Cappé

    Abstract: We consider a stochastic linear bandit model in which the available actions correspond to arbitrary context vectors whose associated rewards follow a non-stationary linear regression model. In this setting, the unknown regression parameter is allowed to vary in time. To address this problem, we propose D-LinUCB, a novel optimistic algorithm based on discounted linear regression, where exponential… ▽ More

    Submitted 20 March, 2020; v1 submitted 19 September, 2019; originally announced September 2019.

    Journal ref: NeurIPS 2019 - 33rd Conference on Neural Information Processing Systems, Dec 2019, Vancouver, Canada

  9. arXiv:1509.09130  [pdf, ps, other

    stat.ML cs.IR cs.LG cs.SI

    Learning From Missing Data Using Selection Bias in Movie Recommendation

    Authors: Claire Vernade, Olivier Cappé

    Abstract: Recommending items to users is a challenging task due to the large amount of missing information. In many cases, the data solely consist of ratings or tags voluntarily contributed by each user on a very limited subset of the available items, so that most of the data of potential interest is actually missing. Current approaches to recommendation usually assume that the unobserved data is missing at… ▽ More

    Submitted 30 September, 2015; originally announced September 2015.

  10. arXiv:1407.4443  [pdf, other

    stat.ML cs.LG

    On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

    Authors: Emilie Kaufmann, Olivier Cappé, Aurélien Garivier

    Abstract: The stochastic multi-armed bandit model is a simple abstraction that has proven useful in many different contexts in statistics and machine learning. Whereas the achievable limit in terms of regret minimization is now well known, our aim is to contribute to a better understanding of the performance in terms of identifying the m best arms. We introduce generic notions of complexity for the two domi… ▽ More

    Submitted 14 November, 2016; v1 submitted 16 July, 2014; originally announced July 2014.

    Comments: arXiv admin note: text overlap with arXiv:1405.3224

    Journal ref: Journal of Machine Learning Research, Journal of Machine Learning Research, 2016, 17, pp.1-42

  11. arXiv:1405.3224  [pdf, other

    math.ST cs.LG stat.ML

    On the Complexity of A/B Testing

    Authors: Emilie Kaufmann, Olivier Cappé, Aurélien Garivier

    Abstract: A/B testing refers to the task of determining the best option among two alternatives that yield random outcomes. We provide distribution-dependent lower bounds for the performance of A/B testing that improve over the results currently available both in the fixed-confidence (or delta-PAC) and fixed-budget settings. When the distribution of the outcomes are Gaussian, we prove that the complexity… ▽ More

    Submitted 24 February, 2015; v1 submitted 13 May, 2014; originally announced May 2014.

    Journal ref: Conference on Learning Theory, Jun 2014, Barcelona, Spain. JMLR: Workshop and Conference Proceedings, 35, pp.461-481

  12. arXiv:1210.2601  [pdf, ps, other

    stat.CO math.PR math.ST stat.ME

    Adaptive MCMC with online relabeling

    Authors: Rémi Bardenet, Olivier Cappé, Gersende Fort, Balázs Kégl

    Abstract: When targeting a distribution that is artificially invariant under some permutations, Markov chain Monte Carlo (MCMC) algorithms face the label-switching problem, rendering marginal inference particularly cumbersome. Such a situation arises, for example, in the Bayesian analysis of finite mixture models. Adaptive MCMC algorithms such as adaptive Metropolis (AM), which self-calibrates its proposal… ▽ More

    Submitted 27 July, 2015; v1 submitted 9 October, 2012; originally announced October 2012.

    Comments: Published at http://dx.doi.org/10.3150/13-BEJ578 in the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ578

    Journal ref: Bernoulli 2015, Vol. 21, No. 3, 1304-1340

  13. arXiv:1102.1796  [pdf, ps, other

    stat.ME stat.CO

    Robust Retrospective Multiple Change-point Estimation for Multivariate Data

    Authors: Alexandre Lung-Yut-Fong, Céline Lévy-Leduc, Olivier Cappé

    Abstract: We propose a non-parametric statistical procedure for detecting multiple change-points in multidimensional signals. The method is based on a test statistic that generalizes the well-known Kruskal-Wallis procedure to the multivariate setting. The proposed approach does not require any knowledge about the distribution of the observations and is parameter-free. It is computationally efficient thanks… ▽ More

    Submitted 10 February, 2011; v1 submitted 9 February, 2011; originally announced February 2011.

    Comments: submitted to IEEE Workshop on Statistical Signal Processing 2011

  14. arXiv:1011.1745  [pdf, ps, other

    stat.CO

    Online Expectation-Maximisation

    Authors: Olivier Cappé

    Abstract: Tutorial chapter on the Online EM algorithm to appear in the volume 'Mixtures' edited by Kerrie Mengersen, Mike Titterington and Christian P. Robert.

    Submitted 8 November, 2010; originally announced November 2010.

  15. arXiv:1004.5229  [pdf, ps, other

    cs.LG math.ST stat.ML

    Optimism in Reinforcement Learning and Kullback-Leibler Divergence

    Authors: Sarah Filippi, Olivier Cappé, Aurélien Garivier

    Abstract: We consider model-based reinforcement learning in finite Markov De- cision Processes (MDPs), focussing on so-called optimistic strategies. In MDPs, optimism can be implemented by carrying out extended value it- erations under a constraint of consistency with the estimated model tran- sition probabilities. The UCRL2 algorithm by Auer, Jaksch and Ortner (2009), which follows this strategy, has recen… ▽ More

    Submitted 13 October, 2010; v1 submitted 29 April, 2010; originally announced April 2010.

    Comments: This work has been accepted and presented at ALLERTON 2010; Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on, Monticello (Illinois) : États-Unis (2010)

  16. arXiv:0909.5524  [pdf, ps, other

    stat.AP cs.NI math.ST

    Distributed detection/localization of change-points in high-dimensional network traffic data

    Authors: Alexandre Lung-Yut-Fong, Céline Lévy-Leduc, Olivier Cappé

    Abstract: We propose a novel approach for distributed statistical detection of change-points in high-volume network traffic. We consider more specifically the task of detecting and identifying the targets of Distributed Denial of Service (DDoS) attacks. The proposed algorithm, called DTopRank, performs distributed network anomaly detection by aggregating the partial information gathered in a set of network… ▽ More

    Submitted 20 September, 2011; v1 submitted 30 September, 2009; originally announced September 2009.

    Comments: Statistics and Computing (2011) 1-12

  17. arXiv:0908.2359  [pdf, ps, other

    stat.CO stat.ML

    Online EM Algorithm for Hidden Markov Models

    Authors: Olivier Cappé

    Abstract: Online (also called "recursive" or "adaptive") estimation of fixed model parameters in hidden Markov models is a topic of much interest in times series modelling. In this work, we propose an online parameter estimation algorithm that combines two key ideas. The first one, which is deeply rooted in the Expectation-Maximization (EM) methodology consists in reparameterizing the problem using complete… ▽ More

    Submitted 15 February, 2011; v1 submitted 17 August, 2009; originally announced August 2009.

    Comments: Revised version, to appear in J. Comput. Graph. Statist

  18. arXiv:0908.0319  [pdf, ps, other

    stat.ML cs.AI cs.LG cs.NI

    Regret Bounds for Opportunistic Channel Access

    Authors: Sarah Filippi, Olivier Cappé, Aurélien Garivier

    Abstract: We consider the task of opportunistic channel access in a primary system composed of independent Gilbert-Elliot channels where the secondary (or opportunistic) user does not dispose of a priori information regarding the statistical characteristics of the system. It is shown that this problem may be cast into the framework of model-based learning in a specific class of Partially Observed Markov D… ▽ More

    Submitted 3 August, 2009; originally announced August 2009.

  19. arXiv:0903.0837  [pdf, ps, other

    astro-ph.CO stat.CO

    Estimation of cosmological parameters using adaptive importance sampling

    Authors: Darren Wraith, Martin Kilbinger, Karim Benabed, Olivier Cappé, Jean-François Cardoso, Gersende Fort, Simon Prunet, Christian P. Robert

    Abstract: We present a Bayesian sampling algorithm called adaptive importance sampling or Population Monte Carlo (PMC), whose computational workload is easily parallelizable and thus has the potential to considerably reduce the wall-clock time required for sampling, along with providing other benefits. To assess the performance of the approach for cosmological problems, we use simulated and actual data co… ▽ More

    Submitted 4 March, 2009; originally announced March 2009.

    Comments: 17 pages, 11 figures

    Journal ref: Phys.Rev.D80:023507,2009

  20. Online EM Algorithm for Latent Data Models

    Authors: Olivier Cappé, Eric Moulines

    Abstract: In this contribution, we propose a generic online (also sometimes called adaptive or recursive) version of the Expectation-Maximisation (EM) algorithm applicable to latent variable models of independent observations. Compared to the algorithm of Titterington (1984), this approach is more directly connected to the usual EM algorithm and does not rely on integration with respect to the complete data… ▽ More

    Submitted 1 March, 2017; v1 submitted 27 December, 2007; originally announced December 2007.

    Comments: Version that includes the corrigendum published in volume 73, part 5 (2011), of the Journal of the Royal Statistical Society, Series B + the correction of a typo in Eqs. (32-33)

    Journal ref: Journal of the Royal Statistical Society: Series B, Royal Statistical Society, 2009, 71 (3), pp.593-613

  21. Adaptive Importance Sampling in General Mixture Classes

    Authors: Olivier Cappé, Randal Douc, Arnaud Guillin, Jean-Michel Marin, Christian P. Robert

    Abstract: In this paper, we propose an adaptive algorithm that iteratively updates both the weights and component parameters of a mixture importance sampling density so as to optimise the importance sampling performances, as measured by an entropy criterion. The method is shown to be applicable to a wide class of importance sampling densities, which includes in particular mixtures of multivariate Student… ▽ More

    Submitted 30 May, 2008; v1 submitted 23 October, 2007; originally announced October 2007.

    Comments: Removed misleading comment in Section 2

    Journal ref: Statistics and Computing 18, 4 (2008) 447-459