Skip to main content

Showing 1–10 of 10 results for author: Clerico, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.16555  [pdf, other

    math.ST cs.LG stat.ML

    Confidence Sequences for Generalized Linear Models via Regret Analysis

    Authors: Eugenio Clerico, Hamish Flynn, Wojciech Kotłowski, Gergely Neu

    Abstract: We develop a methodology for constructing confidence sets for parameters of statistical models via a reduction to sequential prediction. Our key observation is that for any generalized linear model (GLM), one can construct an associated game of sequential probability assignment such that achieving low regret in the game implies a high-probability upper bound on the excess likelihood of the true pa… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  2. arXiv:2503.08231  [pdf, other

    stat.ML cs.LG

    How good is PAC-Bayes at explaining generalisation?

    Authors: Antoine Picard-Weibel, Eugenio Clerico, Roman Moscoviz, Benjamin Guedj

    Abstract: We discuss necessary conditions for a PAC-Bayes bound to provide a meaningful generalisation guarantee. Our analysis reveals that the optimal generalisation guarantee depends solely on the distribution of the risk induced by the prior distribution. In particular, achieving a target generalisation level is only achievable if the prior places sufficient mass on high-performing predictors. We relate… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  3. arXiv:2412.02640  [pdf, ps, other

    math.ST stat.ME

    On the optimality of coin-betting for mean estimation

    Authors: Eugenio Clerico

    Abstract: Confidence sequences are sequences of confidence sets that adapt to incoming data while maintaining validity. Recent advances have introduced an algorithmic formulation for constructing some of the tightest confidence sequences for bounded real random variables. These approaches use a coin-betting framework, where a player sequentially bets on differences between potential mean values and observed… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  4. arXiv:2410.08977  [pdf, ps, other

    stat.ML cs.LG

    Online-to-PAC generalization bounds under graph-mixing dependencies

    Authors: Baptiste Abélès, Eugenio Clerico, Gergely Neu

    Abstract: Traditional generalization results in statistical learning require a training data set made of independently drawn examples. Most of the recent efforts to relax this independence assumption have considered either purely temporal (mixing) dependencies, or graph-dependencies, where non-adjacent vertices correspond to independent random variables. Both approaches have their own limitations, the forme… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 13 pages (10 main + 3 supplementary material). All authors contributed equally

  5. arXiv:2312.13259  [pdf, ps, other

    stat.ML cs.LG

    A note on regularised NTK dynamics with an application to PAC-Bayesian training

    Authors: Eugenio Clerico, Benjamin Guedj

    Abstract: We establish explicit dynamics for neural networks whose training objective has a regularising term that constrains the parameters to remain close to their initial value. This keeps the network in a lazy training regime, where the dynamics can be linearised around the initialisation. The standard neural tangent kernel (NTK) governs the evolution during the training in the infinite-width limit, alt… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  6. arXiv:2209.02525  [pdf, ps, other

    stat.ML cs.LG

    Generalisation under gradient descent via deterministic PAC-Bayes

    Authors: Eugenio Clerico, Tyler Farghly, George Deligiannidis, Benjamin Guedj, Arnaud Doucet

    Abstract: We establish disintegrated PAC-Bayesian generalisation bounds for models trained with gradient descent methods or continuous gradient flows. Contrary to standard practice in the PAC-Bayesian setting, our result applies to optimisation algorithms that are deterministic, without requiring any de-randomisation step. Our bounds are fully computable, depending on the density of the initial distribution… ▽ More

    Submitted 11 February, 2025; v1 submitted 6 September, 2022; originally announced September 2022.

  7. arXiv:2203.00977  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Chained Generalisation Bounds

    Authors: Eugenio Clerico, Amitis Shidani, George Deligiannidis, Arnaud Doucet

    Abstract: This work discusses how to derive upper bounds for the expected generalisation error of supervised learning algorithms by means of the chaining technique. By developing a general theoretical framework, we establish a duality between generalisation bounds based on the regularity of the loss function, and their chained counterparts, which can be obtained by lifting the regularity assumption from the… ▽ More

    Submitted 30 June, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of the 35th Conference on Learning Theory, PMLR 178:4212-4257, 2022

  8. arXiv:2110.11886  [pdf, other

    cs.LG stat.ML

    Conditionally Gaussian PAC-Bayes

    Authors: Eugenio Clerico, George Deligiannidis, Arnaud Doucet

    Abstract: Recent studies have empirically investigated different methods to train stochastic neural networks on a classification task by optimising a PAC-Bayesian bound via stochastic gradient descent. Most of these procedures need to replace the misclassification error with a surrogate loss, leading to a mismatch between the optimisation objective and the actual generalisation bound. The present paper prop… ▽ More

    Submitted 24 February, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:2311-2329, 2022

  9. arXiv:2106.09798  [pdf, other

    stat.ML cs.LG

    Wide stochastic networks: Gaussian limit and PAC-Bayesian training

    Authors: Eugenio Clerico, George Deligiannidis, Arnaud Doucet

    Abstract: The limit of infinite width allows for substantial simplifications in the analytical study of over-parameterised neural networks. With a suitable random initialisation, an extremely large network exhibits an approximately Gaussian behaviour. In the present work, we establish a similar result for a simple stochastic architecture whose parameters are random variables, holding both before and during… ▽ More

    Submitted 13 February, 2023; v1 submitted 17 June, 2021; originally announced June 2021.

    Journal ref: The 34th International Conference on Algorithmic Learning Theory (ALT 2023)

  10. arXiv:2010.12859  [pdf, other

    cs.LG stat.ML

    Stable ResNet

    Authors: Soufiane Hayou, Eugenio Clerico, Bobby He, George Deligiannidis, Arnaud Doucet, Judith Rousseau

    Abstract: Deep ResNet architectures have achieved state of the art performance on many tasks. While they solve the problem of gradient vanishing, they might suffer from gradient exploding as the depth becomes large (Yang et al. 2017). Moreover, recent results have shown that ResNet might lose expressivity as the depth goes to infinity (Yang et al. 2017, Hayou et al. 2019). To resolve these issues, we introd… ▽ More

    Submitted 18 March, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: 43 pages, 4 figures