Skip to main content

Showing 1–8 of 8 results for author: Gerken, J E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17720  [pdf, ps, other

    cs.LG physics.ao-ph

    PEAR: Equal Area Weather Forecasting on the Sphere

    Authors: Hampus Linander, Christoffer Petersson, Daniel Persson, Jan E. Gerken

    Abstract: Machine learning methods for global medium-range weather forecasting have recently received immense attention. Following the publication of the Pangu Weather model, the first deep learning model to outperform traditional numerical simulations of the atmosphere, numerous models have been published in this domain, building on Pangu's success. However, all of these models operate on input data and pr… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2502.15376  [pdf, other

    cs.LG cond-mat.mes-hall

    Learning Chern Numbers of Topological Insulators with Gauge Equivariant Neural Networks

    Authors: Longde Huang, Oleksandr Balabanov, Hampus Linander, Mats Granath, Daniel Persson, Jan E. Gerken

    Abstract: Equivariant network architectures are a well-established tool for predicting invariant or equivariant quantities. However, almost all learning problems considered in this context feature a global symmetry, i.e. each point of the underlying space is transformed with the same group element, as opposed to a local ``gauge'' symmetry, where each point is transformed with a different group element, expo… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  3. arXiv:2406.06504  [pdf, other

    cs.LG

    Equivariant Neural Tangent Kernels

    Authors: Philipp Misof, Pan Kessel, Jan E. Gerken

    Abstract: Little is known about the training dynamics of equivariant neural networks, in particular how it compares to data augmented training of their non-equivariant counterparts. Recently, neural tangent kernels (NTKs) have emerged as a powerful tool to analytically study the training dynamics of wide neural networks. In this work, we take an important step towards a theoretical understanding of training… ▽ More

    Submitted 31 January, 2025; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 16 pages + 20 pages appendices

  4. arXiv:2403.03103  [pdf, other

    cs.LG

    Emergent Equivariance in Deep Ensembles

    Authors: Jan E. Gerken, Pan Kessel

    Abstract: We show that deep ensembles become equivariant for all inputs and at all training times by simply using data augmentation. Crucially, equivariance holds off-manifold and for any architecture in the infinite width limit. The equivariance is emergent in the sense that predictions of individual ensemble members are not equivariant but their collective prediction is. Neural tangent kernel theory is us… ▽ More

    Submitted 15 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 11 pages + 17 pages appendices

  5. arXiv:2307.07313  [pdf, other

    cs.CV cs.LG

    HEAL-SWIN: A Vision Transformer On The Sphere

    Authors: Oscar Carlsson, Jan E. Gerken, Hampus Linander, Heiner Spieß, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: High-resolution wide-angle fisheye images are becoming more and more important for robotics applications such as autonomous driving. However, using ordinary convolutional neural networks or vision transformers on this data is problematic due to projection and distortion losses introduced when projecting to a rectangular grid on the plane. We introduce the HEAL-SWIN transformer, which combines the… ▽ More

    Submitted 8 May, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted as poster to CVPR 2024. Main body: 10 pages, 7 figures. Appendices: 9 pages, 6 figures

  6. arXiv:2206.05075  [pdf, other

    cs.LG cs.AI

    Diffeomorphic Counterfactuals with Generative Models

    Authors: Ann-Kathrin Dombrowski, Jan E. Gerken, Klaus-Robert Müller, Pan Kessel

    Abstract: Counterfactuals can explain classification decisions of neural networks in a human interpretable way. We propose a simple but effective method to generate such counterfactuals. More specifically, we perform a suitable diffeomorphic coordinate transformation and then perform gradient ascent in these coordinates to find counterfactuals which are classified with great confidence as a specified target… ▽ More

    Submitted 16 June, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

  7. arXiv:2202.03990  [pdf, other

    cs.LG cs.CV

    Equivariance versus Augmentation for Spherical Images

    Authors: Jan E. Gerken, Oscar Carlsson, Hampus Linander, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: We analyze the role of rotational equivariance in convolutional neural networks (CNNs) applied to spherical images. We compare the performance of the group equivariant networks known as S2CNNs and standard non-equivariant CNNs trained with an increasing amount of data augmentation. The chosen architectures can be considered baseline references for the respective design paradigms. Our models are tr… ▽ More

    Submitted 12 July, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted to ICML2022, updated according to ICML-reviewer comments, 18 pages of which 9 in main body, 16 figures,

  8. arXiv:2105.13926  [pdf, other

    cs.LG cs.CV hep-th

    Geometric Deep Learning and Equivariant Neural Networks

    Authors: Jan E. Gerken, Jimmy Aronsson, Oscar Carlsson, Hampus Linander, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: We survey the mathematical foundations of geometric deep learning, focusing on group equivariant and gauge equivariant neural networks. We develop gauge equivariant convolutional neural networks on arbitrary manifolds $\mathcal{M}$ using principal bundles with structure group $K$ and equivariant maps between sections of associated vector bundles. We also discuss group equivariant neural networks f… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: 57 pages