Skip to main content

Showing 1–9 of 9 results for author: Roith, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.12189  [pdf, other

    math.OC cs.LG

    MirrorCBO: A consensus-based optimization method in the spirit of mirror descent

    Authors: Leon Bungert, Franca Hoffmann, Doh Yeon Kim, Tim Roith

    Abstract: In this work we propose MirrorCBO, a consensus-based optimization (CBO) method which generalizes standard CBO in the same way that mirror descent generalizes gradient descent. For this we apply the CBO methodology to a swarm of dual particles and retain the primal particle positions by applying the inverse of the mirror map, which we parametrize as the subdifferential of a strongly convex function… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: 64 pages, 18 figures, 19 tables

    MSC Class: 35B40; 35Q84; 35Q89; 35Q90; 65K10; 90C26; 90C56

  2. arXiv:2406.05376  [pdf, other

    cs.LG math.AP

    Adversarial flows: A gradient flow characterization of adversarial attacks

    Authors: Lukas Weigand, Tim Roith, Martin Burger

    Abstract: A popular method to perform adversarial attacks on neuronal networks is the so-called fast gradient sign method and its iterative variant. In this paper, we interpret this method as an explicit Euler discretization of a differential inclusion, where we also show convergence of the discretization to the associated gradient flow. To do so, we consider the concept of p-curves of maximal slope in the… ▽ More

    Submitted 11 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    MSC Class: 49Q20; 34A60; 68Q32; 65K15

  3. arXiv:2312.02671  [pdf, ps, other

    stat.ML cs.LG math.FA math.NA

    Learning a Sparse Representation of Barron Functions with the Inverse Scale Space Flow

    Authors: Tjeerd Jan Heeringa, Tim Roith, Christoph Brune, Martin Burger

    Abstract: This paper presents a method for finding a sparse representation of Barron functions. Specifically, given an $L^2$ function $f$, the inverse scale space flow is used to find a sparse measure $μ$ minimising the $L^2$ loss between the Barron function associated to the measure $μ$ and the function $f$. The convergence properties of this method are analysed in an ideal setting and in the cases of meas… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 30 pages, 0 figures

    MSC Class: 47A52; 68T07; 65K10; 90C25 ACM Class: I.2.6; F.2.1; G.1.6

  4. arXiv:2304.01227  [pdf, other

    cs.CV cs.LG math.NA

    Resolution-Invariant Image Classification based on Fourier Neural Operators

    Authors: Samira Kabri, Tim Roith, Daniel Tenbrinck, Martin Burger

    Abstract: In this paper we investigate the use of Fourier Neural Operators (FNOs) for image classification in comparison to standard Convolutional Neural Networks (CNNs). Neural operators are a discretization-invariant generalization of neural networks to approximate operators between infinite dimensional function spaces. FNOs - which are neural operators with a specific parametrization - have been applied… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    MSC Class: 68T45; 65T40

  5. arXiv:2111.12370  [pdf, other

    math.NA cs.LG math.AP

    Uniform Convergence Rates for Lipschitz Learning on Graphs

    Authors: Leon Bungert, Jeff Calder, Tim Roith

    Abstract: Lipschitz learning is a graph-based semi-supervised learning method where one extends labels from a labeled to an unlabeled data set by solving the infinity Laplace equation on a weighted graph. In this work we prove uniform convergence rates for solutions of the graph infinity Laplace equation as the number of vertices grows to infinity. Their continuum limits are absolutely minimizing Lipschitz… ▽ More

    Submitted 29 June, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    MSC Class: 35J20; 35R02; 65N12; 68T05

    Journal ref: IMA Journal of Numerical Analysis, 2022

  6. arXiv:2106.02479  [pdf, other

    cs.LG cs.NE math.OC

    Neural Architecture Search via Bregman Iterations

    Authors: Leon Bungert, Tim Roith, Daniel Tenbrinck, Martin Burger

    Abstract: We propose a novel strategy for Neural Architecture Search (NAS) based on Bregman iterations. Starting from a sparse neural network our gradient-based one-shot algorithm gradually adds relevant parameters in an inverse scale space manner. This allows the network to choose the best architecture in the search space which makes it well-designed for a given task, e.g., by adding neurons or skip connec… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    MSC Class: 65K10; 68T05; 90C26 ACM Class: I.2.6; F.2.1; G.1.6

  7. arXiv:2105.04319  [pdf, other

    cs.LG math.NA math.OC

    A Bregman Learning Framework for Sparse Neural Networks

    Authors: Leon Bungert, Tim Roith, Daniel Tenbrinck, Martin Burger

    Abstract: We propose a learning framework based on stochastic Bregman iterations, also known as mirror descent, to train sparse neural networks with an inverse scale space approach. We derive a baseline algorithm called LinBreg, an accelerated version using momentum, and AdaBreg, which is a Bregmanized generalization of the Adam algorithm. In contrast to established methods for sparse training the proposed… ▽ More

    Submitted 17 February, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 43 pages, 5 figures, some minor modifications, weakened assumptions

    MSC Class: 65K10; 68T05; 90C26 ACM Class: I.2.6; F.2.1; G.1.6

    Journal ref: Journal of Machine Learning Research, 23(192), 1-43, 2022

  8. arXiv:2103.12531  [pdf, other

    cs.LG math.OC stat.ML

    CLIP: Cheap Lipschitz Training of Neural Networks

    Authors: Leon Bungert, René Raab, Tim Roith, Leo Schwinn, Daniel Tenbrinck

    Abstract: Despite the large success of deep neural networks (DNN) in recent years, most neural networks still lack mathematical guarantees in terms of stability. For instance, DNNs are vulnerable to small or even imperceptible input perturbations, so called adversarial examples, that can cause false predictions. This instability can have severe consequences in applications which influence the health and saf… ▽ More

    Submitted 31 October, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: 12 pages, 2 figures, fixed a small mistake in the proof of Proposition 3, published at SSVM 2021

    MSC Class: 65K10; 68T07

    Journal ref: International Conference on Scale Space and Variational Methods in Computer Vision, 307-319, 2021

  9. arXiv:2012.03772  [pdf, ps, other

    cs.LG math.AP math.NA stat.ML

    Continuum Limit of Lipschitz Learning on Graphs

    Authors: Tim Roith, Leon Bungert

    Abstract: Tackling semi-supervised learning problems with graph-based methods has become a trend in recent years since graphs can represent all kinds of data and provide a suitable framework for studying continuum limits, e.g., of differential operators. A popular strategy here is $p$-Laplacian learning, which poses a smoothness condition on the sought inference function on the set of unlabeled data. For… ▽ More

    Submitted 29 November, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: 39 pages, added acknowledgement, corrected typos

    MSC Class: 35J20; 35R02; 65N12; 68T05

    Journal ref: Found Comput Math (2022)