Skip to main content

Showing 1–3 of 3 results for author: Steffen, M F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.14027  [pdf, other

    stat.ML cs.LG hep-ph stat.CO

    AdamMCMC: Combining Metropolis Adjusted Langevin with Momentum-based Optimization

    Authors: Sebastian Bieringer, Gregor Kasieczka, Maximilian F. Steffen, Mathias Trabs

    Abstract: Uncertainty estimation is a key issue when considering the application of deep neural network methods in science and engineering. In this work, we introduce a novel algorithm that quantifies epistemic uncertainty via Monte Carlo sampling from a tempered posterior distribution. It combines the well established Metropolis Adjusted Langevin Algorithm (MALA) with momentum-based optimization using Adam… ▽ More

    Submitted 5 December, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 16 pages, 5 figures; adapted Theorem 2

  2. arXiv:2310.09335  [pdf, ps, other

    stat.ML cs.LG math.ST

    The surrogate Gibbs-posterior of a corrected stochastic MALA: Towards uncertainty quantification for neural networks

    Authors: Sebastian Bieringer, Gregor Kasieczka, Maximilian F. Steffen, Mathias Trabs

    Abstract: MALA is a popular gradient-based Markov chain Monte Carlo method to access the Gibbs-posterior distribution. Stochastic MALA (sMALA) scales to large data sets, but changes the target distribution from the Gibbs-posterior to a surrogate posterior which only exploits a reduced sample size. We introduce a corrected stochastic MALA (csMALA) with a simple correction term for which distance between the… ▽ More

    Submitted 3 July, 2025; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: The first version of this manuscript was entitled "Statistical guarantees for stochastic Metropolis-Hastings''. Some preliminary results were initially presented in the first version of arXiv:2204.12392, but have been moved to this manuscript, where they have been further developed

  3. arXiv:2204.12392  [pdf, ps, other

    math.ST stat.ML

    A PAC-Bayes oracle inequality for sparse neural networks

    Authors: Maximilian F. Steffen, Mathias Trabs

    Abstract: We study the Gibbs posterior distribution for sparse deep neural nets in a nonparametric regression setting. The posterior can be accessed via Metropolis-adjusted Langevin algorithms. Using a mixture over uniform priors on sparse sets of network weights, we prove an oracle inequality which shows that the method adapts to the unknown regularity and hierarchical structure of the regression function.… ▽ More

    Submitted 2 October, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

    MSC Class: 62G08; 62F15; 68T05