Skip to main content

Showing 1–50 of 93 results for author: Kutyniok, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.22007  [pdf, ps, other

    cs.CV

    RoboEnvision: A Long-Horizon Video Generation Model for Multi-Task Robot Manipulation

    Authors: Liudi Yang, Yang Bai, George Eskandar, Fengyi Shen, Mohammad Altillawi, Dong Chen, Soumajit Majumder, Ziyuan Liu, Gitta Kutyniok, Abhinav Valada

    Abstract: We address the problem of generating long-horizon videos for robotic manipulation tasks. Text-to-video diffusion models have made significant progress in photorealism, language understanding, and motion generation but struggle with long-horizon robotic tasks. Recent works use video diffusion models for high-quality simulation data and predictive rollouts in robot planning. However, these works pre… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 8 pages, 6 figures

  2. arXiv:2506.08632  [pdf, other

    cs.CV

    RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping

    Authors: Yang Bai, Liudi Yang, George Eskandar, Fengyi Shen, Dong Chen, Mohammad Altillawi, Ziyuan Liu, Gitta Kutyniok

    Abstract: Recent advancements in generative models have revolutionized video synthesis and editing. However, the scarcity of diverse, high-quality datasets continues to hinder video-conditioned robotic learning, limiting cross-platform generalization. In this work, we address the challenge of swapping a robotic arm in one video with another: a key step for crossembodiment learning. Unlike previous methods t… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  3. arXiv:2505.21423  [pdf, other

    cs.LG stat.ML

    Conflicting Biases at the Edge of Stability: Norm versus Sharpness Regularization

    Authors: Vit Fojtik, Maria Matveev, Hung-Hsu Chou, Gitta Kutyniok, Johannes Maly

    Abstract: A widely believed explanation for the remarkable generalization capacities of overparameterized neural networks is that the optimization algorithms used for training induce an implicit bias towards benign solutions. To grasp this theoretically, recent works examine gradient descent and its variants in simplified training settings, often assuming vanishing learning rates. These studies reveal vario… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  4. arXiv:2505.19827  [pdf, other

    cs.LG cs.AI

    Revisiting Glorot Initialization for Long-Range Linear Recurrences

    Authors: Noga Bar, Mariia Seleznova, Yotam Alexander, Gitta Kutyniok, Raja Giryes

    Abstract: Proper initialization is critical for Recurrent Neural Networks (RNNs), particularly in long-range reasoning tasks, where repeated application of the same weight matrix can cause vanishing or exploding signals. A common baseline for linear recurrences is Glorot initialization, designed to ensure stable signal propagation--but derived under the infinite-width, fixed-length regime--an unrealistic se… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  5. arXiv:2505.18023  [pdf, ps, other

    cs.LG cs.NE

    Time to Spike? Understanding the Representational Power of Spiking Neural Networks in Discrete Time

    Authors: Duc Anh Nguyen, Ernesto Araya, Adalbert Fono, Gitta Kutyniok

    Abstract: Recent years have seen significant progress in developing spiking neural networks (SNNs) as a potential solution to the energy challenges posed by conventional artificial neural networks (ANNs). However, our theoretical understanding of SNNs remains relatively limited compared to the ever-growing body of literature on ANNs. In this paper, we study a discrete-time model of SNNs based on leaky integ… ▽ More

    Submitted 13 June, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  6. arXiv:2505.16017  [pdf, ps, other

    cs.LG cs.CV

    GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection

    Authors: Mariia Seleznova, Hung-Hsu Chou, Claudio Mayrink Verdun, Gitta Kutyniok

    Abstract: We introduce GradPCA, an Out-of-Distribution (OOD) detection method that exploits the low-rank structure of neural network gradients induced by Neural Tangent Kernel (NTK) alignment. GradPCA applies Principal Component Analysis (PCA) to gradient class-means, achieving more consistent performance than existing methods across standard image classification benchmarks. We provide a theoretical perspec… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  7. arXiv:2505.13186  [pdf, other

    cs.RO cs.LG

    Interpretable Robotic Friction Learning via Symbolic Regression

    Authors: Philipp Scholl, Alexander Dietrich, Sebastian Wolf, Jinoh Lee, Alin-Albu Schäffer, Gitta Kutyniok, Maged Iskandar

    Abstract: Accurately modeling the friction torque in robotic joints has long been challenging due to the request for a robust mathematical description. Traditional model-based approaches are often labor-intensive, requiring extensive experiments and expert knowledge, and they are difficult to adapt to new scenarios and dependencies. On the other hand, data-driven methods based on neural networks are easier… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  8. arXiv:2505.11298  [pdf, ps, other

    cs.LG

    Graph Representational Learning: When Does More Expressivity Hurt Generalization?

    Authors: Sohir Maskey, Raffaele Paolino, Fabian Jogl, Gitta Kutyniok, Johannes F. Lutzeyer

    Abstract: Graph Neural Networks (GNNs) are powerful tools for learning on structured data, yet the relationship between their expressivity and predictive performance remains unclear. We introduce a family of premetrics that capture different degrees of structural similarity between graphs and relate these similarities to generalization, and consequently, the performance of expressive GNNs. By considering a… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  9. arXiv:2504.18433  [pdf, other

    cs.LG stat.ML

    An Axiomatic Assessment of Entropy- and Variance-based Uncertainty Quantification in Regression

    Authors: Christopher Bülte, Yusuf Sale, Timo Löhr, Paul Hofman, Gitta Kutyniok, Eyke Hüllermeier

    Abstract: Uncertainty quantification (UQ) is crucial in machine learning, yet most (axiomatic) studies of uncertainty measures focus on classification, leaving a gap in regression settings with limited formal justification and evaluations. In this work, we introduce a set of axioms to rigorously assess measures of aleatoric, epistemic, and total uncertainty in supervised regression. By utilizing a predictiv… ▽ More

    Submitted 16 May, 2025; v1 submitted 25 April, 2025; originally announced April 2025.

  10. arXiv:2504.05471  [pdf, other

    cs.LG

    Graph Neural Networks for Enhancing Ensemble Forecasts of Extreme Rainfall

    Authors: Christopher Bülte, Sohir Maskey, Philipp Scholl, Jonas von Berg, Gitta Kutyniok

    Abstract: Climate change is increasing the occurrence of extreme precipitation events, threatening infrastructure, agriculture, and public safety. Ensemble prediction systems provide probabilistic forecasts but exhibit biases and difficulties in capturing extreme weather. While post-processing techniques aim to enhance forecast accuracy, they rarely focus on precipitation, which exhibits complex spatial dep… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: Accepted paper at ICLR 2025 - Tackling Climate Change with Machine Learning Workshop (https://www.climatechange.ai/events/iclr2025)

  11. arXiv:2503.02013  [pdf, ps, other

    cs.NE

    Sustainable AI: Mathematical Foundations of Spiking Neural Networks

    Authors: Adalbert Fono, Manjot Singh, Ernesto Araya, Philipp C. Petersen, Holger Boche, Gitta Kutyniok

    Abstract: Deep learning's success comes with growing energy demands, raising concerns about the long-term sustainability of the field. Spiking neural networks, inspired by biological neurons, offer a promising alternative with potential computational and energy-efficiency gains. This article examines the computational properties of spiking networks through the lens of learning theory, focusing on expressivi… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  12. arXiv:2502.12902  [pdf, other

    cs.LG

    Probabilistic neural operators for functional uncertainty quantification

    Authors: Christopher Bülte, Philipp Scholl, Gitta Kutyniok

    Abstract: Neural operators aim to approximate the solution operator of a system of differential equations purely from data. They have shown immense success in modeling complex dynamical systems across various domains. However, the occurrence of uncertainties inherent in both model and data has so far rarely been taken into account\textemdash{}a critical limitation in complex, chaotic systems such as weather… ▽ More

    Submitted 27 March, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Journal ref: Transactions on Machine Learning Research, 2025. ISSN 2835-8856

  13. arXiv:2410.09938  [pdf, other

    cs.LG math.NA

    Robust identifiability for symbolic recovery of differential equations

    Authors: Hillary Hauger, Philipp Scholl, Gitta Kutyniok

    Abstract: Recent advancements in machine learning have transformed the discovery of physical laws, moving from manual derivation to data-driven methods that simultaneously learn both the structure and parameters of governing equations. This shift introduces new challenges regarding the validity of the discovered equations, particularly concerning their uniqueness and, hence, identifiability. While the issue… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  14. arXiv:2408.06212  [pdf, other

    cs.LG cs.CC

    Computability of Classification and Deep Learning: From Theoretical Limits to Practical Feasibility through Quantization

    Authors: Holger Boche, Vit Fojtik, Adalbert Fono, Gitta Kutyniok

    Abstract: The unwavering success of deep learning in the past decade led to the increasing prevalence of deep learning methods in various application fields. However, the downsides of deep learning, most prominently its lack of trustworthiness, may not be compatible with safety-critical or high-responsibility applications requiring stricter performance guarantees. Recently, several instances of deep learnin… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    MSC Class: 68T07; 68T05; 03D80; 65D15

  15. arXiv:2404.03473  [pdf, ps, other

    cs.LG

    Generalization Bounds for Message Passing Networks on Mixture of Graphons

    Authors: Sohir Maskey, Gitta Kutyniok, Ron Levie

    Abstract: We study the generalization capabilities of Message Passing Neural Networks (MPNNs), a prevalent class of Graph Neural Networks (GNN). We derive generalization bounds specifically for MPNNs with normalized sum aggregation and mean aggregation. Our analysis is based on a data generation model incorporating a finite set of template graphons. Each graph within this framework is generated by sampling… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  16. arXiv:2403.13749  [pdf, other

    cs.LG

    Weisfeiler and Leman Go Loopy: A New Hierarchy for Graph Representational Learning

    Authors: Raffaele Paolino, Sohir Maskey, Pascal Welke, Gitta Kutyniok

    Abstract: We introduce $r$-loopy Weisfeiler-Leman ($r$-$\ell{}$WL), a novel hierarchy of graph isomorphism tests and a corresponding GNN framework, $r$-$\ell{}$MPNN, that can count cycles up to length $r + 2$. Most notably, we show that $r$-$\ell{}$WL can count homomorphisms of cactus graphs. This strictly extends classical 1-WL, which can only count homomorphisms of trees and, in fact, is incomparable to… ▽ More

    Submitted 6 November, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: NeurIPS 2024 (Oral). The first two authors contributed equally

  17. arXiv:2402.07153  [pdf, other

    math.NA cs.AI

    Error Estimation for Physics-informed Neural Networks Approximating Semilinear Wave Equations

    Authors: Beatrice Lorenz, Aras Bacho, Gitta Kutyniok

    Abstract: This paper provides rigorous error bounds for physics-informed neural networks approximating the semilinear wave equation. We provide bounds for the generalization and training error in terms of the width of the network's layers and the number of training points for a tanh neural network with two hidden layers. Our main result is a bound of the total error in the $H^1([0,T];L^2(Ω))$-norm in terms… ▽ More

    Submitted 5 March, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    MSC Class: 35L05; 68T07; 65M15; 35G50; 35A35

  18. arXiv:2401.10310  [pdf, other

    cs.LG cs.AI cs.CC

    Mathematical Algorithm Design for Deep Learning under Societal and Judicial Constraints: The Algorithmic Transparency Requirement

    Authors: Holger Boche, Adalbert Fono, Gitta Kutyniok

    Abstract: Deep learning still has drawbacks in terms of trustworthiness, which describes a comprehensible, fair, safe, and reliable method. To mitigate the potential risk of AI, clear obligations associated to trustworthiness have been proposed via regulatory guidelines, e.g., in the European AI Act. Therefore, a central question is to what extent trustworthy deep learning can be realized. Establishing the… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  19. arXiv:2312.11548  [pdf, other

    cs.CV

    Learning Interpretable Queries for Explainable Image Classification with Information Pursuit

    Authors: Stefan Kolek, Aditya Chattopadhyay, Kwan Ho Ryan Chan, Hector Andrade-Loarca, Gitta Kutyniok, Réne Vidal

    Abstract: Information Pursuit (IP) is an explainable prediction algorithm that greedily selects a sequence of interpretable queries about the data in order of information gain, updating its posterior at each step based on observed query-answer pairs. The standard paradigm uses hand-crafted dictionaries of potential data queries curated by a domain expert or a large language model after a human prompt. Howev… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  20. arXiv:2310.16763  [pdf, other

    cs.CL cs.AI cs.LG

    SuperHF: Supervised Iterative Learning from Human Feedback

    Authors: Gabriel Mukobi, Peter Chatain, Su Fong, Robert Windesheim, Gitta Kutyniok, Kush Bhatia, Silas Alberti

    Abstract: While large language models demonstrate remarkable capabilities, they often present challenges in terms of safety, alignment with human values, and stability during training. Here, we focus on two prevalent methods used to align these models, Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). SFT is simple and robust, powering a host of open-source models, while RL… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to the Socially Responsible Language Modelling Research (SoLaR) workshop at NeurIPS 2023

  21. Learning-based adaption of robotic friction models

    Authors: Philipp Scholl, Maged Iskandar, Sebastian Wolf, Jinoh Lee, Aras Bacho, Alexander Dietrich, Alin Albu-Schäffer, Gitta Kutyniok

    Abstract: In the Fourth Industrial Revolution, wherein artificial intelligence and the automation of machines occupy a central role, the deployment of robots is indispensable. However, the manufacturing process using robots, especially in collaboration with humans, is highly intricate. In particular, modeling the friction torque in robotic joints is a longstanding problem due to the lack of a good mathemati… ▽ More

    Submitted 9 May, 2025; v1 submitted 25 October, 2023; originally announced October 2023.

    Journal ref: Robotics and Computer-Integrated Manufacturing 2024

  22. arXiv:2310.07658  [pdf, other

    eess.SP cs.LG cs.NI

    The First Pathloss Radio Map Prediction Challenge

    Authors: Çağkan Yapar, Fabian Jaensch, Ron Levie, Gitta Kutyniok, Giuseppe Caire

    Abstract: To foster research and facilitate fair comparisons among recently proposed pathloss radio map prediction methods, we have launched the ICASSP 2023 First Pathloss Radio Map Prediction Challenge. In this short overview paper, we briefly describe the pathloss prediction problem, the provided datasets, the challenge task and the challenge evaluation methodology. Finally, we present the results of the… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: ICASSP 2023

  23. arXiv:2310.05537  [pdf, other

    cs.AI cs.LG

    ParFam -- (Neural Guided) Symbolic Regression Based on Continuous Global Optimization

    Authors: Philipp Scholl, Katharina Bieker, Hillary Hauger, Gitta Kutyniok

    Abstract: The problem of symbolic regression (SR) arises in many different applications, such as identifying physical laws or deriving mathematical equations describing the behavior of financial markets from given data. Various methods exist to address the problem of SR, often based on genetic programming. However, these methods are usually complicated and involve various hyperparameters. In this paper, we… ▽ More

    Submitted 6 May, 2025; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Code: https://github.com/Philipp238/parfam

  24. arXiv:2308.08218  [pdf, other

    cs.NE

    Expressivity of Spiking Neural Networks

    Authors: Manjot Singh, Adalbert Fono, Gitta Kutyniok

    Abstract: The synergy between spiking neural networks and neuromorphic hardware holds promise for the development of energy-efficient AI applications. Inspired by this potential, we revisit the foundational aspects to study the capabilities of spiking neural networks where information is encoded in the firing time of neurons. Under the Spike Response Model as a mathematical model of a spiking neuron with a… ▽ More

    Submitted 15 March, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

  25. arXiv:2308.01766  [pdf, other

    cs.CV math.AP

    Neural Poisson Surface Reconstruction: Resolution-Agnostic Shape Reconstruction from Point Clouds

    Authors: Hector Andrade-Loarca, Julius Hege, Daniel Cremers, Gitta Kutyniok

    Abstract: We introduce Neural Poisson Surface Reconstruction (nPSR), an architecture for shape reconstruction that addresses the challenge of recovering 3D shapes from points. Traditional deep neural networks face challenges with common 3D shape discretization techniques due to their computational complexity at higher resolutions. To overcome this, we leverage Fourier Neural Operators to solve the Poisson e… ▽ More

    Submitted 28 November, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

  26. arXiv:2307.02301  [pdf, other

    cs.LG cs.CL stat.ML

    Sumformer: Universal Approximation for Efficient Transformers

    Authors: Silas Alberti, Niclas Dern, Laura Thesing, Gitta Kutyniok

    Abstract: Natural language processing (NLP) made an impressive jump with the introduction of Transformers. ChatGPT is one of the most famous examples, changing the perception of the possibilities of AI even outside the research community. However, besides the impressive performance, the quadratic time and space complexity of Transformers with respect to sequence length pose significant limitations for handl… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  27. arXiv:2307.01301  [pdf, ps, other

    cs.AI quant-ph

    Reliable AI: Does the Next Generation Require Quantum Computing?

    Authors: Aras Bacho, Holger Boche, Gitta Kutyniok

    Abstract: In this survey, we aim to explore the fundamental question of whether the next generation of artificial intelligence requires quantum computing. Artificial intelligence is increasingly playing a crucial role in many aspects of our daily lives and is central to the fourth industrial revolution. It is therefore imperative that artificial intelligence is reliable and trustworthy. However, there are s… ▽ More

    Submitted 6 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    MSC Class: 15A29; 35J05; 46N10; 68Q04; 68Q12; 68Q17; 68Q25

  28. arXiv:2306.10066  [pdf, other

    physics.chem-ph cs.AI cs.LG

    On the Interplay of Subset Selection and Informed Graph Neural Networks

    Authors: Niklas Breustedt, Paolo Climaco, Jochen Garcke, Jan Hamaekers, Gitta Kutyniok, Dirk A. Lorenz, Rick Oerder, Chirag Varun Shukla

    Abstract: Machine learning techniques paired with the availability of massive datasets dramatically enhance our ability to explore the chemical compound space by providing fast and accurate predictions of molecular properties. However, learning on large datasets is strongly limited by the availability of computational resources and can be infeasible in some scenarios. Moreover, the instances in the datasets… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  29. Learning optimal controllers: a dynamical motion primitive approach

    Authors: Hugo T. M. Kussaba, Abdalla Swikir, Fan Wu, Anastasija Demerdjieva, Gitta Kutyniok, Sami Haddadin

    Abstract: Real-time computation of optimal control is a challenging problem and, to solve this difficulty, many frameworks proposed to use learning techniques to learn (possibly sub-optimal) controllers and enable their usage in an online fashion. Among these techniques, the optimal motion framework is a simple, yet powerful technique, that obtained success in many complex real-world applications. The main… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: This work has been accepted to the 22nd IFAC World Congress

  30. arXiv:2305.16427  [pdf, other

    cs.LG cs.AI

    Neural (Tangent Kernel) Collapse

    Authors: Mariia Seleznova, Dana Weitzner, Raja Giryes, Gitta Kutyniok, Hung-Hsu Chou

    Abstract: This work bridges two important concepts: the Neural Tangent Kernel (NTK), which captures the evolution of deep neural networks (DNNs) during training, and the Neural Collapse (NC) phenomenon, which refers to the emergence of symmetry and structure in the last-layer features of well-trained classification DNNs. We adopt the natural assumption that the empirical NTK develops a block structure align… ▽ More

    Submitted 26 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Journal ref: Proceedings of the 37th Conference on Neural Information Processing Systems, 2023

  31. arXiv:2305.13084  [pdf, ps, other

    cs.LG

    A Fractional Graph Laplacian Approach to Oversmoothing

    Authors: Sohir Maskey, Raffaele Paolino, Aras Bacho, Gitta Kutyniok

    Abstract: Graph neural networks (GNNs) have shown state-of-the-art performances in various applications. However, GNNs often struggle to capture long-range dependencies in graphs due to oversmoothing. In this paper, we generalize the concept of oversmoothing from undirected to directed graphs. To this aim, we extend the notion of Dirichlet energy by considering a directed symmetrically normalized Laplacian.… ▽ More

    Submitted 31 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: First two authors contributed equally. 37 pages, 8 images

  32. arXiv:2301.11456  [pdf, other

    cs.LG

    Graph Scattering beyond Wavelet Shackles

    Authors: Christian Koke, Gitta Kutyniok

    Abstract: This work develops a flexible and mathematically sound framework for the design and analysis of graph scattering networks with variable branching ratios and generic functional calculus filters. Spectrally-agnostic stability guarantees for node- and graph-level perturbations are derived; the vertex-set non-preserving case is treated by utilizing recently developed mathematical-physics based tools.… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  33. arXiv:2301.06148  [pdf, ps, other

    math.OC cs.AI cs.IT

    Computability of Optimizers

    Authors: Yunseok Lee, Holger Boche, Gitta Kutyniok

    Abstract: Optimization problems are a staple of today's scientific and technical landscape. However, at present, solvers of such problems are almost exclusively run on digital hardware. Using Turing machines as a mathematical model for any type of digital hardware, in this paper, we analyze fundamental limitations of this conceptual approach of solving optimization problems. Since in most applications, the… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

  34. arXiv:2212.11777  [pdf, other

    cs.NI cs.LG eess.SP

    Dataset of Pathloss and ToA Radio Maps With Localization Application

    Authors: Çağkan Yapar, Ron Levie, Gitta Kutyniok, Giuseppe Caire

    Abstract: In this article, we present a collection of radio map datasets in dense urban setting, which we generated and made publicly available. The datasets include simulated pathloss/received signal strength (RSS) and time of arrival (ToA) radio maps over a large collection of realistic dense urban setting in real city maps. The two main applications of the presented dataset are 1) learning methods that p… ▽ More

    Submitted 16 September, 2024; v1 submitted 18 November, 2022; originally announced December 2022.

  35. arXiv:2212.00693  [pdf, ps, other

    cs.CC math.AP

    Complexity Blowup for Solutions of the Laplace and the Diffusion Equation

    Authors: Aras Bacho, Holger Boche, Gitta Kutyniok

    Abstract: In this paper, we investigate the computational complexity of solutions to the Laplace and the diffusion equation. We show that for a certain class of initial-boundary value problems of the Laplace and the diffusion equation, the solution operator is $\# P_1/ \#P$-complete in the sense that it maps polynomial-time computable functions to the set of $\#P_1/ \#P$-complete functions. Consequently, th… ▽ More

    Submitted 12 September, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: The results of this paper on simulating physical theories on digital computers influenced the article of Holger Boche und Frank Fitzek "Metaverse at the campfire of the future" in Germany's major newspaper, the Frankfurter Allgemeine Zeitung (FAZ) (URL: https://www.faz.net/-ikh-aqw8x). Technological challenges for the design of the Metaverse are discussed in this article

    MSC Class: 68Q15; 68Q04; 68Q17; 68Q17; 68Q25; 35K05; 35J05

  36. arXiv:2211.12857  [pdf, other

    cs.CV

    Explaining Image Classifiers with Multiscale Directional Image Representation

    Authors: Stefan Kolek, Robert Windesheim, Hector Andrade Loarca, Gitta Kutyniok, Ron Levie

    Abstract: Image classifiers are known to be difficult to interpret and therefore require explanation methods to understand their decisions. We present ShearletX, a novel mask explanation method for image classifiers based on the shearlet transform -- a multiscale directional image representation. Current mask explanation methods are regularized by smoothness constraints that protect against undesirable fine… ▽ More

    Submitted 28 April, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Journal ref: CVPR 2023

  37. arXiv:2210.08342  [pdf, ps, other

    cs.LG math-ph

    Symbolic Recovery of Differential Equations: The Identifiability Problem

    Authors: Philipp Scholl, Aras Bacho, Holger Boche, Gitta Kutyniok

    Abstract: Symbolic recovery of differential equations is the ambitious attempt at automating the derivation of governing equations with the use of machine learning techniques. In contrast to classical methods which assume the structure of the equation to be known and focus on the estimation of specific parameters, these algorithms aim to learn the structure and the parameters simultaneously. While the uniqu… ▽ More

    Submitted 9 October, 2024; v1 submitted 15 October, 2022; originally announced October 2022.

  38. arXiv:2210.08219  [pdf, other

    cs.LG

    Unveiling the Sampling Density in Non-Uniform Geometric Graphs

    Authors: Raffaele Paolino, Aleksandar Bojchevski, Stephan Günnemann, Gitta Kutyniok, Ron Levie

    Abstract: A powerful framework for studying graphs is to consider them as geometric graphs: nodes are randomly sampled from an underlying metric space, and any pair of nodes is connected if their distance is less than a specified neighborhood radius. Currently, the literature mostly focuses on uniform sampling and constant neighborhood radius. However, real-world graphs are likely to be better represented b… ▽ More

    Submitted 25 November, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: updated affiliations; improved references; more experiments; streamlined the paper; added justification for the geometric graph with hubs model

  39. arXiv:2206.05530  [pdf, other

    cs.LG

    Memorization-Dilation: Modeling Neural Collapse Under Label Noise

    Authors: Duc Anh Nguyen, Ron Levie, Julian Lienen, Gitta Kutyniok, Eyke Hüllermeier

    Abstract: The notion of neural collapse refers to several emergent phenomena that have been empirically observed across various canonical classification problems. During the terminal phase of training a deep neural network, the feature embedding of all examples of the same class tend to collapse to a single representation, and the features of different classes tend to separate as much as possible. Neural co… ▽ More

    Submitted 4 April, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: to be published at ICLR 2023

  40. arXiv:2205.15117  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    OOD Link Prediction Generalization Capabilities of Message-Passing GNNs in Larger Test Graphs

    Authors: Yangze Zhou, Gitta Kutyniok, Bruno Ribeiro

    Abstract: This work provides the first theoretical study on the ability of graph Message Passing Neural Networks (gMPNNs) -- such as Graph Neural Networks (GNNs) -- to perform inductive out-of-distribution (OOD) link prediction tasks, where deployment (test) graph sizes are larger than training graphs. We first prove non-asymptotic bounds showing that link predictors based on permutation-equivariant (struct… ▽ More

    Submitted 9 October, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted at NeurIPS 2022

  41. arXiv:2203.08890  [pdf, other

    cs.LG math.HO stat.ML

    The Mathematics of Artificial Intelligence

    Authors: Gitta Kutyniok

    Abstract: We currently witness the spectacular success of artificial intelligence in both science and public life. However, the development of a rigorous mathematical foundation is still at an early stage. In this survey article, which is based on an invited lecture at the International Congress of Mathematicians 2022, we will in particular focus on the current "workhorse" of artificial intelligence, namely… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: 16 pages, 7 figures

    MSC Class: Primary 68T07; Secondary 41A25; 42C15; 35C20; 65D18

  42. arXiv:2202.13490  [pdf, ps, other

    cs.LG cs.AI eess.SP

    Limitations of Deep Learning for Inverse Problems on Digital Hardware

    Authors: Holger Boche, Adalbert Fono, Gitta Kutyniok

    Abstract: Deep neural networks have seen tremendous success over the last years. Since the training is performed on digital hardware, in this paper, we analyze what actually can be computed on current hardware platforms modeled as Turing machines, which would lead to inherent restrictions of deep learning. For this, we focus on the class of inverse problems, which, in particular, encompasses any task to rec… ▽ More

    Submitted 25 October, 2023; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: To be published in IEEE Transactions on Information Theory

  43. arXiv:2202.00738  [pdf, ps, other

    cs.LG cs.NI eess.SP

    LocUNet: Fast Urban Positioning Using Radio Maps and Deep Learning

    Authors: Çağkan Yapar, Ron Levie, Gitta Kutyniok, Giuseppe Caire

    Abstract: This paper deals with the problem of localization in a cellular network in a dense urban scenario. Global Navigation Satellite Systems (GNSS) typically perform poorly in urban environments, where the likelihood of line-of-sight conditions is low, and thus alternative localization methods are required for good accuracy. We present LocUNet: A deep learning method for localization, based merely on Re… ▽ More

    Submitted 2 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: To appear in ICASSP 2022. arXiv admin note: substantial text overlap with arXiv:2106.12556

  44. arXiv:2202.00645  [pdf, other

    cs.LG cs.AI math.NA math.PR

    Generalization Analysis of Message Passing Neural Networks on Large Random Graphs

    Authors: Sohir Maskey, Ron Levie, Yunseok Lee, Gitta Kutyniok

    Abstract: Message passing neural networks (MPNN) have seen a steep rise in popularity since their introduction as generalizations of convolutional neural networks to graph-structured data, and are now considered state-of-the-art tools for solving a large variety of graph-focused problems. We study the generalization error of MPNNs in graph classification and regression. We assume that graphs of different cl… ▽ More

    Submitted 4 August, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Preprint in Review

    MSC Class: 68T07; 68R10

  45. arXiv:2202.00553  [pdf, other

    cs.LG cs.AI

    Neural Tangent Kernel Beyond the Infinite-Width Limit: Effects of Depth and Initialization

    Authors: Mariia Seleznova, Gitta Kutyniok

    Abstract: Neural Tangent Kernel (NTK) is widely used to analyze overparametrized neural networks due to the famous result by Jacot et al. (2018): in the infinite-width limit, the NTK is deterministic and constant during training. However, this result cannot explain the behavior of deep networks, since it generally does not hold if depth and width tend to infinity simultaneously. In this paper, we study the… ▽ More

    Submitted 21 July, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:19522-19560, 2022

  46. arXiv:2110.08252  [pdf, other

    cs.LG cs.AI cs.IT

    A Rate-Distortion Framework for Explaining Black-box Model Decisions

    Authors: Stefan Kolek, Duc Anh Nguyen, Ron Levie, Joan Bruna, Gitta Kutyniok

    Abstract: We present the Rate-Distortion Explanation (RDE) framework, a mathematically well-founded method for explaining black-box model decisions. The framework is based on perturbations of the target input signal and applies to any differentiable pre-trained model such as neural networks. Our experiments demonstrate the framework's adaptability to diverse data modalities, particularly images, audio, and… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

  47. arXiv:2110.03485  [pdf, other

    cs.AI cs.CV

    Cartoon Explanations of Image Classifiers

    Authors: Stefan Kolek, Duc Anh Nguyen, Ron Levie, Joan Bruna, Gitta Kutyniok

    Abstract: We present CartoonX (Cartoon Explanation), a novel model-agnostic explanation method tailored towards image classifiers and based on the rate-distortion explanation (RDE) framework. Natural images are roughly piece-wise smooth signals -- also called cartoon-like images -- and tend to be sparse in the wavelet domain. CartoonX is the first explanation method to exploit this by requiring its explanat… ▽ More

    Submitted 20 October, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: ECCV 2022 (oral)

  48. arXiv:2109.10096  [pdf, ps, other

    cs.LG math.NA

    Transferability of Graph Neural Networks: an Extended Graphon Approach

    Authors: Sohir Maskey, Ron Levie, Gitta Kutyniok

    Abstract: We study spectral graph convolutional neural networks (GCNNs), where filters are defined as continuous functions of the graph shift operator (GSO) through functional calculus. A spectral GCNN is not tailored to one specific graph and can be transferred between different graphs. It is hence important to study the GCNN transferability: the capacity of the network to have approximately the same reper… ▽ More

    Submitted 27 June, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: Preprint in Review

    MSC Class: 68T07; 68R10; 47A60

  49. arXiv:2108.05732  [pdf, other

    cs.LG cs.CV math.FA math.NA

    Deep Microlocal Reconstruction for Limited-Angle Tomography

    Authors: Héctor Andrade-Loarca, Gitta Kutyniok, Ozan Öktem, Philipp Petersen

    Abstract: We present a deep learning-based algorithm to jointly solve a reconstruction problem and a wavefront set extraction problem in tomographic imaging. The algorithm is based on a recently developed digital wavefront set extractor as well as the well-known microlocal canonical relation for the Radon transform. We use the wavefront set information about x-ray data to improve the reconstruction by requi… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: 43 pages, 8 figures

    MSC Class: 35A18; 65T60; 68T10

  50. arXiv:2106.12556  [pdf, other

    cs.LG cs.NI eess.SP

    Real-time Outdoor Localization Using Radio Maps: A Deep Learning Approach

    Authors: Çağkan Yapar, Ron Levie, Gitta Kutyniok, Giuseppe Caire

    Abstract: Global Navigation Satellite Systems typically perform poorly in urban environments, where the likelihood of line-of-sight conditions between devices and satellites is low. Therefore, alternative location methods are required to achieve good accuracy. We present LocUNet: A convolutional, end-to-end trained neural network (NN) for the localization task, which is able to estimate the position of a us… ▽ More

    Submitted 9 April, 2023; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: Submitted to IEEE Transactions on Wireless Communications