Skip to main content

Showing 1–4 of 4 results for author: Schröder, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.13999  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    Asymptotics of Learning with Deep Structured (Random) Features

    Authors: Dominik Schröder, Daniil Dmitriev, Hugo Cui, Bruno Loureiro

    Abstract: For a large class of feature maps we provide a tight asymptotic characterisation of the test error associated with learning the readout layer, in the high-dimensional limit where the input dimension, hidden layer widths, and number of training samples are proportionally large. This characterization is formulated in terms of the population covariance of the features. Our work is partially motivated… ▽ More

    Submitted 10 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: ICML camera-ready version

  2. arXiv:2302.00401  [pdf, other

    stat.ML cs.LG

    Deterministic equivalent and error universality of deep random features learning

    Authors: Dominik Schröder, Hugo Cui, Daniil Dmitriev, Bruno Loureiro

    Abstract: This manuscript considers the problem of learning a random Gaussian network function using a fully connected network with frozen intermediate layers and trainable readout layer. This problem can be seen as a natural generalization of the widely studied random features model to deeper architectures. First, we prove Gaussian universality of the test error in a ridge regression setting where the lear… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  3. arXiv:2105.05115  [pdf, ps, other

    stat.ML cs.LG math.PR

    Analysis of One-Hidden-Layer Neural Networks via the Resolvent Method

    Authors: Vanessa Piccolo, Dominik Schröder

    Abstract: In this work, we investigate the asymptotic spectral density of the random feature matrix $M = Y Y^\ast$ with $Y = f(WX)$ generated by a single-hidden-layer neural network, where $W$ and $X$ are random rectangular matrices with i.i.d. centred entries and $f$ is a non-linear smooth function which is applied entry-wise. We prove that the Stieltjes transform of the limiting spectral distribution appr… ▽ More

    Submitted 11 November, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: Final version, NeurIPS 2021. 22 pages, 4 figures

    MSC Class: 60B20; 68T07

  4. Quantifying the Preferential Direction of the Model Gradient in Adversarial Training With Projected Gradient Descent

    Authors: Ricardo Bigolin Lanfredi, Joyce D. Schroeder, Tolga Tasdizen

    Abstract: Adversarial training, especially projected gradient descent (PGD), has proven to be a successful approach for improving robustness against adversarial attacks. After adversarial training, gradients of models with respect to their inputs have a preferential direction. However, the direction of alignment is not mathematically well established, making it difficult to evaluate quantitatively. We propo… ▽ More

    Submitted 19 April, 2023; v1 submitted 10 September, 2020; originally announced September 2020.

    Comments: This paper was published in Pattern Recognition