Skip to main content

Showing 1–7 of 7 results for author: Ortega, L A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.04177  [pdf, other

    cs.LG stat.ML

    Fixed-Mean Gaussian Processes for Post-hoc Bayesian Deep Learning

    Authors: Luis A. Ortega, Simón Rodríguez-Santana, Daniel Hernández-Lobato

    Abstract: Recently, there has been an increasing interest in performing post-hoc uncertainty estimation about the predictions of pre-trained deep neural networks (DNNs). Given a pre-trained DNN via back-propagation, these methods enhance the original network by adding output confidence measures, such as error bars, without compromising its initial accuracy. In this context, we introduce a novel family of sp… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: 12 pages, 6 figures and 2 tables. Submitted to IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

  2. arXiv:2401.01148  [pdf, other

    stat.ML cs.LG

    PAC-Bayes-Chernoff bounds for unbounded losses

    Authors: Ioar Casado, Luis A. Ortega, Aritz Pérez, Andrés R. Masegosa

    Abstract: We introduce a new PAC-Bayes oracle bound for unbounded losses that extends Cramér-Chernoff bounds to the PAC-Bayesian setting. The proof technique relies on controlling the tails of certain random variables involving the Cramér transform of the loss. Our approach naturally leverages properties of Cramér-Chernoff bounds, such as exact optimization of the free parameter in many PAC-Bayes bounds. We… ▽ More

    Submitted 30 October, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Camera-ready version for NeurIPS 2024

  3. arXiv:2310.01189  [pdf, other

    stat.ML cs.LG

    If there is no underfitting, there is no Cold Posterior Effect

    Authors: Yijie Zhang, Yi-Shan Wu, Luis A. Ortega, Andrés R. Masegosa

    Abstract: The cold posterior effect (CPE) (Wenzel et al., 2020) in Bayesian deep learning shows that, for posteriors with a temperature $T<1$, the resulting posterior predictive could have better performances than the Bayesian posterior ($T=1$). As the Bayesian posterior is known to be optimal under perfect model specification, many recent works have studied the presence of CPE as a model misspecification p… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 9 pages, 3 figures, ICLR 2024

  4. arXiv:2306.10947  [pdf, other

    cs.LG math.ST stat.ML

    PAC-Chernoff Bounds: Understanding Generalization in the Interpolation Regime

    Authors: Andrés R. Masegosa, Luis A. Ortega

    Abstract: This paper introduces a distribution-dependent PAC-Chernoff bound that exhibits perfect tightness for interpolators, even within over-parameterized model classes. This bound, which relies on basic principles of Large Deviation Theory, defines a natural measure of the smoothness of a model, characterized by simple real-valued functions. Building upon this bound and the new concept of smoothness, we… ▽ More

    Submitted 10 February, 2025; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 60 pages, 12 figures, published at JAIR 2025

    Journal ref: Journal of Artificial Intelligence Research 82 (2025) 503-562

  5. arXiv:2302.12565  [pdf, other

    stat.ML cs.LG

    Variational Linearized Laplace Approximation for Bayesian Deep Learning

    Authors: Luis A. Ortega, Simón Rodríguez Santana, Daniel Hernández-Lobato

    Abstract: The Linearized Laplace Approximation (LLA) has been recently used to perform uncertainty estimation on the predictions of pre-trained deep neural networks (DNNs). However, its widespread application is hindered by significant computational costs, particularly in scenarios with a large number of training points or DNN parameters. Consequently, additional approximations of LLA, such as Kronecker-fac… ▽ More

    Submitted 22 May, 2024; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: 22 pages, 8 figures, ICML 2024

    Journal ref: PMLR 235 (2024)

  6. arXiv:2207.10673  [pdf, other

    stat.ML cs.LG stat.CO

    Correcting Model Bias with Sparse Implicit Processes

    Authors: Simón Rodríguez Santana, Luis A. Ortega, Daniel Hernández-Lobato, Bryan Zaldívar

    Abstract: Model selection in machine learning (ML) is a crucial part of the Bayesian learning procedure. Model choice may impose strong biases on the resulting predictions, which can hinder the performance of methods such as Bayesian neural networks and neural samplers. On the other hand, newly proposed approaches for Bayesian ML exploit features of approximate inference in function space with implicit stoc… ▽ More

    Submitted 8 August, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: 4 pages, 1 double figure. Included in ICML 2022 workshop "Beyond Bayes: Paths Towards Universal Reasoning Systems". Extension of previous work on Sparse Implicit Processes (arXiv:2110.07618)

  7. arXiv:2206.06720  [pdf, other

    stat.ML cs.LG

    Deep Variational Implicit Processes

    Authors: Luis A. Ortega, Simón Rodríguez Santana, Daniel Hernández-Lobato

    Abstract: Implicit processes (IPs) are a generalization of Gaussian processes (GPs). IPs may lack a closed-form expression but are easy to sample from. Examples include, among others, Bayesian neural networks or neural samplers. IPs can be used as priors over functions, resulting in flexible models with well-calibrated prediction uncertainty estimates. Methods based on IPs usually carry out function-space a… ▽ More

    Submitted 16 February, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 19 pages, 6 figures, ICLR 2023