Skip to main content

Showing 1–8 of 8 results for author: Loustau, S

Searching in archive math. Search in all archives.
.
  1. arXiv:1602.00522  [pdf, other

    stat.ML math.ST

    A Quasi-Bayesian Perspective to Online Clustering

    Authors: Le Li, Benjamin Guedj, Sébastien Loustau

    Abstract: When faced with high frequency streams of data, clustering raises theoretical and algorithmic pitfalls. We introduce a new and adaptive online clustering algorithm relying on a quasi-Bayesian approach, with a dynamic (i.e., time-dependent) estimation of the (unknown and changing) number of clusters. We prove that our approach is supported by minimax regret bounds. We also provide an RJMCMC-flavore… ▽ More

    Submitted 25 May, 2018; v1 submitted 1 February, 2016; originally announced February 2016.

    Journal ref: Electronic Journal of Statistics (2018), vol. 12(2), 3071--3113

  2. Bandwidth selection in kernel empirical risk minimization via the gradient

    Authors: Michaël Chichignoud, Sébastien Loustau

    Abstract: In this paper, we deal with the data-driven selection of multidimensional and possibly anisotropic bandwidths in the general framework of kernel empirical risk minimization. We propose a universal selection rule, which leads to optimal adaptive results in a large variety of statistical models such as nonparametric robust regression and statistical learning with errors in variables. These results a… ▽ More

    Submitted 18 August, 2015; v1 submitted 27 January, 2014; originally announced January 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1318 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1318

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 4, 1617-1646

  3. arXiv:1307.3369  [pdf, ps, other

    math.ST

    Noisy classification with boundary assumptions

    Authors: Sébastien Loustau, Clément Marteau

    Abstract: We address the problem of classification when data are collected from two samples with measurement errors. This problem turns to be an inverse problem and requires a specific treatment. In this context, we investigate the minimax rates of convergence using both a margin assumption, and a smoothness condition on the boundary of the set associated to the Bayes classifier. We establish lower and uppe… ▽ More

    Submitted 12 July, 2013; originally announced July 2013.

    Comments: arXiv admin note: substantial text overlap with arXiv:1201.3283

  4. arXiv:1306.2194  [pdf, other

    math.ST stat.ML

    Adaptive Noisy Clustering

    Authors: Michael Chichignoud, Sébastien Loustau

    Abstract: The problem of adaptive noisy clustering is investigated. Given a set of noisy observations $Z_i=X_i+ε_i$, $i=1,...,n$, the goal is to design clusters associated with the law of $X_i$'s, with unknown density $f$ with respect to the Lebesgue measure. Since we observe a corrupted sample, a direct approach as the popular {\it $k$-means} is not suitable in this case. In this paper, we propose a noisy… ▽ More

    Submitted 10 June, 2013; originally announced June 2013.

    Comments: 22 pages

  5. arXiv:1305.0630  [pdf, ps, other

    math.ST stat.ML

    Anisotropic oracle inequalities in noisy quantization

    Authors: Sébastien Loustau

    Abstract: The effect of errors in variables in quantization is investigated. We prove general exact and non-exact oracle inequalities with fast rates for an empirical minimization based on a noisy sample $Z_i=X_i+ε_i,i=1,\ldots,n$, where $X_i$ are i.i.d. with density $f$ and $ε_i$ are i.i.d. with density $η$. These rates depend on the geometry of the density $f$ and the asymptotic behaviour of the character… ▽ More

    Submitted 3 May, 2013; originally announced May 2013.

    Comments: 30 pages. arXiv admin note: text overlap with arXiv:1205.1417

  6. arXiv:1205.1417  [pdf, ps, other

    math.ST

    Fast rates for noisy clustering

    Authors: Sébastien Loustau

    Abstract: The effect of errors in variables in empirical minimization is investigated. Given a loss $l$ and a set of decision rules $\mathcal{G}$, we prove a general upper bound for an empirical minimization based on a deconvolution kernel and a noisy sample $Z_i=X_i+ε_i,i=1,...,n$. We apply this general upper bound to give the rate of convergence for the expected excess risk in noisy clustering. A recent b… ▽ More

    Submitted 7 May, 2012; originally announced May 2012.

  7. arXiv:1201.6115  [pdf, ps, other

    math.ST

    Statistical learning with indirect observations

    Authors: Sébastien Loustau

    Abstract: Let $(X,Y)\in\mathcal{X}\times \mathcal{Y}$ be a random couple with unknown distribution $P$. Let $\GG$ be a class of measurable functions and $\ell$ a loss function. The problem of statistical learning deals with the estimation of the Bayes: $$g^*=\arg\min_{g\in\GG}\E_P \ell(g(X),Y). $$ In this paper, we study this problem when we deal with a contaminated sample $(Z_1,Y_1),..., (Z_n,Y_n)$ of i.i.… ▽ More

    Submitted 10 July, 2012; v1 submitted 30 January, 2012; originally announced January 2012.

  8. Minimax fast rates for discriminant analysis with errors in variables

    Authors: Sébastien Loustau, Clément Marteau

    Abstract: The effect of measurement errors in discriminant analysis is investigated. Given observations $Z=X+ε$, where $ε$ denotes a random noise, the goal is to predict the density of $X$ among two possible candidates $f$ and $g$. We suppose that we have at our disposal two learning samples. The aim is to approach the best possible decision rule $G^\star$ defined as a minimizer of the Bayes risk. In the fr… ▽ More

    Submitted 12 May, 2015; v1 submitted 16 January, 2012; originally announced January 2012.

    Journal ref: Bernoulli, Bernoulli Society for Mathematical Statistics and Probability, 2015, pp.30