Skip to main content

Showing 1–2 of 2 results for author: Finke, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.19695  [pdf, ps, other

    stat.ML cs.LG math.PR

    Near-optimal estimates for the $\ell^p$-Lipschitz constants of deep random ReLU neural networks

    Authors: Sjoerd Dirksen, Patrick Finke, Paul Geuchen, Dominik Stöger, Felix Voigtlaender

    Abstract: This paper studies the $\ell^p$-Lipschitz constants of ReLU neural networks $Φ: \mathbb{R}^d \to \mathbb{R}$ with random parameters for $p \in [1,\infty]$. The distribution of the weights follows a variant of the He initialization and the biases are drawn from symmetric distributions. We derive high probability upper and lower bounds for wide networks that differ at most by a factor that is logari… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: The introduction will still be expanded with additional references

    MSC Class: 68T07; 26A16; 60B20; 60G15

  2. arXiv:2310.00327  [pdf, other

    stat.ML cs.LG math.ST

    Memorization With Neural Nets: Going Beyond the Worst Case

    Authors: Sjoerd Dirksen, Patrick Finke, Martin Genzel

    Abstract: In practice, deep neural networks are often able to easily interpolate their training data. To understand this phenomenon, many works have aimed to quantify the memorization capacity of a neural network architecture: the largest number of points such that the architecture can interpolate any placement of these points with any assignment of labels. For real-world data, however, one intuitively expe… ▽ More

    Submitted 6 December, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: The current version of the manuscript has been accepted to Journal of Machine Learning Research

    Journal ref: J. Mach. Learn. Res. 25:347 (2024) 1-38