Skip to main content

Showing 1–4 of 4 results for author: Catoni, O

Searching in archive stat. Search in all archives.
.
  1. arXiv:1603.07850  [pdf, ps, other

    stat.ML math.ST

    Markov substitute processes : a new model for linguistics and beyond

    Authors: Olivier Catoni, Thomas Mainguy

    Abstract: We introduce Markov substitute processes, a new model at the crossroad of statistics and formal grammars, and prove its main property : Markov substitute processes with a given support form an exponential family.

    Submitted 25 March, 2016; originally announced March 2016.

    Comments: 22 pages

    MSC Class: 62M09; 60J10; 91F20; 68T50

  2. arXiv:1302.2569  [pdf, ps, other

    stat.ML cs.CL math.PR

    Toric grammars: a new statistical approach to natural language modeling

    Authors: Olivier Catoni, Thomas Mainguy

    Abstract: We propose a new statistical model for computational linguistics. Rather than trying to estimate directly the probability distribution of a random sentence of the language, we define a Markov chain on finite sets of sentences with many finite recurrent communicating classes and define our language model as the invariant probability measures of the chain on each recurrent communicating class. This… ▽ More

    Submitted 11 February, 2013; originally announced February 2013.

    MSC Class: 62M09; 62P99; 68T50; 91F20; 03B65; 91E40; 60J20

  3. arXiv:0902.1733  [pdf, ps, other

    stat.ML math.ST

    Risk bounds in linear regression through PAC-Bayesian truncation

    Authors: Jean-Yves Audibert, Olivier Catoni

    Abstract: We consider the problem of predicting as well as the best linear combination of d given functions in least squares regression, and variants of this problem including constraints on the parameters of the linear combination. When the input distribution is known, there already exists an algorithm having an expected excess risk of order d/n, where n is the size of the training data. Without this stron… ▽ More

    Submitted 4 July, 2010; v1 submitted 10 February, 2009; originally announced February 2009.

    Comments: 78 pages

  4. Pac-Bayesian Supervised Classification: The Thermodynamics of Statistical Learning

    Authors: Olivier Catoni

    Abstract: This monograph deals with adaptive supervised classification, using tools borrowed from statistical mechanics and information theory, stemming from the PACBayesian approach pioneered by David McAllester and applied to a conception of statistical learning theory forged by Vladimir Vapnik. Using convex analysis on the set of posterior probability measures, we show how to get local measures of the… ▽ More

    Submitted 3 December, 2007; originally announced December 2007.

    Comments: Published in at http://dx.doi.org/10.1214/074921707000000391 the IMS Lecture Notes Monograph Series (http://www.imstat.org/publications/lecnotes.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-LNMS56-LNMS5601 MSC Class: 62H30; 68T05; 62B10 (Primary)

    Journal ref: IMS Lecture Notes Monograph Series 2007, Vol. 56, i-xii, 1-163