Skip to main content

Showing 1–15 of 15 results for author: Persson, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17720  [pdf, ps, other

    cs.LG physics.ao-ph

    PEAR: Equal Area Weather Forecasting on the Sphere

    Authors: Hampus Linander, Christoffer Petersson, Daniel Persson, Jan E. Gerken

    Abstract: Machine learning methods for global medium-range weather forecasting have recently received immense attention. Following the publication of the Pangu Weather model, the first deep learning model to outperform traditional numerical simulations of the atmosphere, numerous models have been published in this domain, building on Pangu's success. However, all of these models operate on input data and pr… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2505.16937  [pdf, ps, other

    math.NA cs.DS

    Quasi-optimal hierarchically semi-separable matrix approximation

    Authors: Noah Amsel, Tyler Chen, Feyza Duman Keles, Diana Halikias, Cameron Musco, Christopher Musco, David Persson

    Abstract: We present a randomized algorithm for producing a quasi-optimal hierarchically semi-separable (HSS) approximation to an $N\times N$ matrix $A$ using only matrix-vector products with $A$ and $A^T$. We prove that, using $O(k \log(N/k))$ matrix-vector products and ${O}(N k^2 \log(N/k))$ additional runtime, the algorithm returns an HSS matrix $B$ with rank-$k$ blocks whose expected Frobenius norm erro… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    MSC Class: 65F55; 68W20; 68W25

  3. arXiv:2505.16932  [pdf, ps, other

    cs.LG cs.AI cs.CL math.NA math.OC

    The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm

    Authors: Noah Amsel, David Persson, Christopher Musco, Robert M. Gower

    Abstract: Computing the polar decomposition and the related matrix sign function, has been a well-studied problem in numerical analysis for decades. More recently, it has emerged as an important subroutine in deep learning, particularly within the Muon optimization framework. However, the requirements in this setting differ significantly from those of traditional numerical analysis. In deep learning, method… ▽ More

    Submitted 3 June, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: 34 pages, 8 figures, 4 algorithms

    MSC Class: 65F30; 68T07; 68N19 ACM Class: G.1.3; I.2.6; F.2.1; G.1.6

  4. arXiv:2504.20974  [pdf, ps, other

    cs.LG math.RT stat.ML

    Equivariant non-linear maps for neural networks on homogeneous spaces

    Authors: Elias Nyholm, Oscar Carlsson, Maurice Weiler, Daniel Persson

    Abstract: This paper presents a novel framework for non-linear equivariant neural network layers on homogeneous spaces. The seminal work of Cohen et al. on equivariant $G$-CNNs on homogeneous spaces characterized the representation theory of such layers in the linear setting, finding that they are given by convolutions with kernels satisfying so-called steerability constraints. Motivated by the empirical su… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: 45 pages,10 figures

  5. arXiv:2502.15376  [pdf, other

    cs.LG cond-mat.mes-hall

    Learning Chern Numbers of Topological Insulators with Gauge Equivariant Neural Networks

    Authors: Longde Huang, Oleksandr Balabanov, Hampus Linander, Mats Granath, Daniel Persson, Jan E. Gerken

    Abstract: Equivariant network architectures are a well-established tool for predicting invariant or equivariant quantities. However, almost all learning problems considered in this context feature a global symmetry, i.e. each point of the underlying space is transformed with the same group element, as opposed to a local ``gauge'' symmetry, where each point is transformed with a different group element, expo… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  6. arXiv:2407.04686  [pdf, other

    cs.DS math.NA

    Near-optimal hierarchical matrix approximation from matrix-vector products

    Authors: Tyler Chen, Feyza Duman Keles, Diana Halikias, Cameron Musco, Christopher Musco, David Persson

    Abstract: We describe a randomized algorithm for producing a near-optimal hierarchical off-diagonal low-rank (HODLR) approximation to an $n\times n$ matrix $\mathbf{A}$, accessible only though matrix-vector products with $\mathbf{A}$ and $\mathbf{A}^{\mathsf{T}}$. We prove that, for the rank-$k$ HODLR approximation problem, our method achieves a $(1+β)^{\log(n)}$-optimal approximation in expected Frobenius… ▽ More

    Submitted 24 October, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Journal ref: SIAM Symposium on Discrete Algorithms (SODA 2025)

  7. arXiv:2401.14131  [pdf, other

    cs.LG math.DS

    Equivariant Manifold Neural ODEs and Differential Invariants

    Authors: Emma Andersdotter, Daniel Persson, Fredrik Ohlsson

    Abstract: In this paper, we develop a manifestly geometric framework for equivariant manifold neural ordinary differential equations (NODEs) and use it to analyse their modelling capabilities for symmetric data. First, we consider the action of a Lie group $G$ on a smooth manifold $M$ and establish the equivalence between equivariance of vector fields, symmetries of the corresponding Cauchy problems, and eq… ▽ More

    Submitted 10 October, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Additional co-author added. Substantially revised version. Added mathematical preliminary, numerical examples and discussion on practical use. Extended related work section. 29 pages, 8 figures

  8. arXiv:2311.14023  [pdf, ps, other

    math.NA cs.DS

    Algorithm-agnostic low-rank approximation of operator monotone matrix functions

    Authors: David Persson, Raphael A. Meyer, Christopher Musco

    Abstract: Low-rank approximation of a matrix function, $f(A)$, is an important task in computational mathematics. Most methods require direct access to $f(A)$, which is often considerably more expensive than accessing $A$. Persson and Kressner (SIMAX 2023) avoid this issue for symmetric positive semidefinite matrices by proposing funNyström, which first constructs a Nyström approximation to $A$ using subspa… ▽ More

    Submitted 4 July, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    MSC Class: 65F15; 65F55; 65F60; 68W25

  9. arXiv:2307.07313  [pdf, other

    cs.CV cs.LG

    HEAL-SWIN: A Vision Transformer On The Sphere

    Authors: Oscar Carlsson, Jan E. Gerken, Hampus Linander, Heiner Spieß, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: High-resolution wide-angle fisheye images are becoming more and more important for robotics applications such as autonomous driving. However, using ordinary convolutional neural networks or vision transformers on this data is problematic due to projection and distortion losses introduced when projecting to a rectangular grid on the plane. We introduce the HEAL-SWIN transformer, which combines the… ▽ More

    Submitted 8 May, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted as poster to CVPR 2024. Main body: 10 pages, 7 figures. Appendices: 9 pages, 6 figures

  10. arXiv:2202.11962  [pdf, other

    cs.LG cs.HC

    Large Scale Passenger Detection with Smartphone/Bus Implicit Interaction and Multisensory Unsupervised Cause-effect Learning

    Authors: Valentino Servizi, Dan R. Persson, Francisco C. Pereira, Hannah Villadsen, Per Bækgaard, Jeppe Rich, Otto A. Nielsen

    Abstract: Intelligent Transportation Systems (ITS) underpin the concept of Mobility as a Service (MaaS), which requires universal and seamless users' access across multiple public and private transportation systems while allowing operators' proportional revenue sharing. Current user sensing technologies such as Walk-in/Walk-out (WIWO) and Check-in/Check-out (CICO) have limited scalability for large-scale de… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 20 pages, 13 figures, 3 tables

  11. "Is not the truth the truth?": Analyzing the Impact of User Validations for Bus In/Out Detection in Smartphone-based Surveys

    Authors: Valentino Servizi., Dan R. Persson, Francisco C. Pereira, Hannah Villadsen, Per Bækgaard, Inon Peled, Otto A. Nielsen

    Abstract: Passenger flow allows the study of users' behavior through the public network and assists in designing new facilities and services. This flow is observed through interactions between passengers and infrastructure. For this task, Bluetooth technology and smartphones represent the ideal solution. The latter component allows users' identification, authentication, and billing, while the former allows… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 22 pages, 11 figures, 4 tables, 3 algorithms

  12. arXiv:2202.03990  [pdf, other

    cs.LG cs.CV

    Equivariance versus Augmentation for Spherical Images

    Authors: Jan E. Gerken, Oscar Carlsson, Hampus Linander, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: We analyze the role of rotational equivariance in convolutional neural networks (CNNs) applied to spherical images. We compare the performance of the group equivariant networks known as S2CNNs and standard non-equivariant CNNs trained with an increasing amount of data augmentation. The chosen architectures can be considered baseline references for the respective design paradigms. Our models are tr… ▽ More

    Submitted 12 July, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted to ICML2022, updated according to ICML-reviewer comments, 18 pages of which 9 in main body, 16 figures,

  13. arXiv:2105.13926  [pdf, other

    cs.LG cs.CV hep-th

    Geometric Deep Learning and Equivariant Neural Networks

    Authors: Jan E. Gerken, Jimmy Aronsson, Oscar Carlsson, Hampus Linander, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: We survey the mathematical foundations of geometric deep learning, focusing on group equivariant and gauge equivariant neural networks. We develop gauge equivariant convolutional neural networks on arbitrary manifolds $\mathcal{M}$ using principal bundles with structure group $K$ and equivariant maps between sections of associated vector bundles. We also discuss group equivariant neural networks f… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: 57 pages

  14. arXiv:1404.7736  [pdf, other

    cs.IT

    Massive MIMO with 1-bit ADC

    Authors: Chiara Risi, Daniel Persson, Erik G. Larsson

    Abstract: We investigate massive multiple-input-multiple output (MIMO) uplink systems with 1-bit analog-to-digital converters (ADCs) on each receiver antenna. Receivers that rely on 1-bit ADC do not need energy-consuming interfaces such as automatic gain control (AGC). This decreases both ADC building and operational costs. Our design is based on maximal ratio combining (MRC), zero-forcing (ZF), and least s… ▽ More

    Submitted 30 April, 2014; originally announced April 2014.

  15. Scaling up MIMO: Opportunities and Challenges with Very Large Arrays

    Authors: Fredrik Rusek, Daniel Persson, Buon Kiong Lau, Erik G. Larsson, Thomas L. Marzetta, Ove Edfors, Fredrik Tufvesson

    Abstract: This paper surveys recent advances in the area of very large MIMO systems. With very large MIMO, we think of systems that use antenna arrays with an order of magnitude more elements than in systems being built today, say a hundred antennas or more. Very large MIMO entails an unprecedented number of antennas simultaneously serving a much smaller number of terminals. The disparity in number emerge… ▽ More

    Submitted 16 January, 2012; originally announced January 2012.

    Comments: Accepted for publication in the IEEE Signal Processing Magazine, October 2011