Skip to main content

Showing 1–11 of 11 results for author: Rivasplata, O

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.19002  [pdf, other

    cs.LG cs.AI stat.ML

    Semi-pessimistic Reinforcement Learning

    Authors: Jin Zhu, Xin Zhou, Jiaang Yao, Gholamali Aminian, Omar Rivasplata, Simon Little, Lexin Li, Chengchun Shi

    Abstract: Offline reinforcement learning (RL) aims to learn an optimal policy from pre-collected data. However, it faces challenges of distributional shift, where the learned policy may encounter unseen scenarios not covered in the offline data. Additionally, numerous applications suffer from a scarcity of labeled reward data. Relying on labeled data alone often leads to a narrow state-action distribution,… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  2. arXiv:2403.16681  [pdf, other

    stat.ML cs.LG

    A note on generalization bounds for losses with finite moments

    Authors: Borja Rodríguez-Gálvez, Omar Rivasplata, Ragnar Thobaben, Mikael Skoglund

    Abstract: This paper studies the truncation method from Alquier [1] to derive high-probability PAC-Bayes bounds for unbounded losses with heavy tails. Assuming that the $p$-th moment is bounded, the resulting bounds interpolate between a slow rate $1 / \sqrt{n}$ when $p=2$, and a fast rate $1 / n$ when $p \to \infty$ and the loss is essentially bounded. Moreover, the paper derives a high-probability PAC-Bay… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 9 pages: 5 of main text, 1 of references, and 3 of appendices

  3. arXiv:2107.12685  [pdf, other

    cs.LG math.OC stat.ML

    On the Role of Optimization in Double Descent: A Least Squares Study

    Authors: Ilja Kuzborskij, Csaba Szepesvári, Omar Rivasplata, Amal Rannen-Triki, Razvan Pascanu

    Abstract: Empirically it has been observed that the performance of deep neural networks steadily improves as we increase model size, contradicting the classical view on overfitting and generalization. Recently, the double descent phenomena has been proposed to reconcile this observation with theory, suggesting that the test error has a second descent when the model becomes sufficiently overparameterized, as… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

  4. arXiv:2101.04671  [pdf, ps, other

    math.PR stat.ML

    A note on a confidence bound of Kuzborskij and Szepesvári

    Authors: Omar Rivasplata

    Abstract: In an interesting recent work, Kuzborskij and Szepesvári derived a confidence bound for functions of independent random variables, which is based on an inequality that relates concentration to squared perturbations of the chosen function. Kuzborskij and Szepesvári also established the PAC-Bayes-ification of their confidence bound. Two important aspects of their work are that the random variables c… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

  5. arXiv:2007.12911  [pdf, other

    cs.LG cs.CV stat.ML

    Tighter risk certificates for neural networks

    Authors: María Pérez-Ortiz, Omar Rivasplata, John Shawe-Taylor, Csaba Szepesvári

    Abstract: This paper presents an empirical study regarding training probabilistic neural networks using training objectives derived from PAC-Bayes bounds. In the context of probabilistic neural networks, the output of training is a probability distribution over network weights. We present two training objectives, used here for the first time in connection with training neural networks. These two training ob… ▽ More

    Submitted 22 September, 2021; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: New version includes: i) experiment showing the potential of the risk certificate for neural architecture search (Fig. 2); ii) experiments spanning uncertainty quantification and analysis of prior/posterior (Section 7.8); iii) an outline of the strengths of probabilistic neural networks trained by PBB (Section 7.9) and iv) a strengthened discussion on the connection to Bayesian learning

    Journal ref: Journal of Machine Learning Research, 2021

  6. arXiv:2006.13057  [pdf, ps, other

    stat.ML cs.LG stat.AP

    PAC-Bayes Analysis Beyond the Usual Bounds

    Authors: Omar Rivasplata, Ilja Kuzborskij, Csaba Szepesvari, John Shawe-Taylor

    Abstract: We focus on a stochastic learning model where the learner observes a finite set of training examples and the output of the learning process is a data-dependent distribution over a space of hypotheses. The learned data-dependent distribution is then used to make randomized predictions, and the high-level theme addressed here is guaranteeing the quality of predictions on examples that were not seen… ▽ More

    Submitted 28 December, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: In NeurIPS 2020. Version 3 is the final published paper. Note that this paper is an enhanced version of the short paper with the same title that was presented at the NeurIPS 2019 Workshop on Machine Learning with Guarantees. Important update: the PAC-Bayes type inequality for unbounded loss functions (Section 2.3) is new

  7. arXiv:2006.12156  [pdf, other

    cs.LG stat.ML

    Logarithmic Pruning is All You Need

    Authors: Laurent Orseau, Marcus Hutter, Omar Rivasplata

    Abstract: The Lottery Ticket Hypothesis is a conjecture that every large neural network contains a subnetwork that, when trained in isolation, achieves comparable performance to the large network. An even stronger conjecture has been proven recently: Every sufficiently overparameterized network contains a subnetwork that, at random initialization, but without training, achieves comparable accuracy to the tr… ▽ More

    Submitted 25 October, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  8. arXiv:2006.07279  [pdf, other

    stat.ML cs.LG math.ST

    PAC-Bayes unleashed: generalisation bounds with unbounded losses

    Authors: Maxime Haddouche, Benjamin Guedj, Omar Rivasplata, John Shawe-Taylor

    Abstract: We present new PAC-Bayesian generalisation bounds for learning problems with unbounded loss functions. This extends the relevance and applicability of the PAC-Bayes learning framework, where most of the existing literature focuses on supervised learning problems with a bounded loss function (typically assumed to take values in the interval [0;1]). In order to relax this assumption, we propose a ne… ▽ More

    Submitted 30 September, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 24 pages

    Journal ref: Entropy 2021

  9. arXiv:1908.07380  [pdf, other

    cs.LG stat.ML

    PAC-Bayes with Backprop

    Authors: Omar Rivasplata, Vikram M Tankasali, Csaba Szepesvari

    Abstract: We explore the family of methods "PAC-Bayes with Backprop" (PBB) to train probabilistic neural networks by minimizing PAC-Bayes bounds. We present two training objectives, one derived from a previously known PAC-Bayes bound, and a second one derived from a novel PAC-Bayes bound. Both training objectives are evaluated on MNIST and on various UCI data sets. Our experiments show two striking observat… ▽ More

    Submitted 4 October, 2019; v1 submitted 19 August, 2019; originally announced August 2019.

  10. Meta-analysis of Bayesian analyses

    Authors: Paul Blomstedt, Diego Mesquita, Omar Rivasplata, Jarno Lintusaari, Tuomas Sivula, Jukka Corander, Samuel Kaski

    Abstract: Meta-analysis aims to generalize results from multiple related statistical analyses through a combined analysis. While the natural outcome of a Bayesian study is a posterior distribution, traditional Bayesian meta-analyses proceed by combining summary statistics (i.e., point-valued estimates) computed from data. In this paper, we develop a framework for combining posterior distributions from multi… ▽ More

    Submitted 14 October, 2024; v1 submitted 9 April, 2019; originally announced April 2019.

    Comments: Published at Bayesian Analysis

  11. arXiv:1806.06827  [pdf, other

    stat.ML cs.LG

    PAC-Bayes bounds for stable algorithms with instance-dependent priors

    Authors: Omar Rivasplata, Emilio Parrado-Hernandez, John Shawe-Taylor, Shiliang Sun, Csaba Szepesvari

    Abstract: PAC-Bayes bounds have been proposed to get risk estimates based on a training sample. In this paper the PAC-Bayes approach is combined with stability of the hypothesis learned by a Hilbert space valued algorithm. The PAC-Bayes setting is used with a Gaussian prior centered at the expected output. Thus a novelty of our paper is using priors defined in terms of the data-generating distribution. Our… ▽ More

    Submitted 30 August, 2018; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: 16 pages, discussion of theory and experiments in the main body, detailed proofs and experimental details in the appendices