Skip to main content

Showing 1–8 of 8 results for author: Fischer, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.03210  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    From Kernels to Features: A Multi-Scale Adaptive Theory of Feature Learning

    Authors: Noa Rubin, Kirsten Fischer, Javed Lindner, David Dahmen, Inbar Seroussi, Zohar Ringel, Michael Krämer, Moritz Helias

    Abstract: Feature learning in neural networks is crucial for their expressive power and inductive biases, motivating various theoretical approaches. Some approaches describe network behavior after training through a change in kernel scale from initialization, resulting in a generalization power comparable to a Gaussian process. Conversely, in other approaches training results in the adaptation of the kernel… ▽ More

    Submitted 28 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: 33 pages, 12 figures, accepted at International Conference on Machine Learning 2025

  2. arXiv:2305.07715  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    Field theory for optimal signal propagation in ResNets

    Authors: Kirsten Fischer, David Dahmen, Moritz Helias

    Abstract: Residual networks have significantly better trainability and thus performance than feed-forward networks at large depth. Introducing skip connections facilitates signal propagation to deeper layers. In addition, previous works found that adding a scaling parameter for the residual branch further improves generalization performance. While they empirically identified a particularly beneficial range… ▽ More

    Submitted 26 August, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: 21 pages, 8 figures, under review

  3. arXiv:2202.04925  [pdf, other

    cond-mat.dis-nn stat.ML

    Decomposing neural networks as mappings of correlation functions

    Authors: Kirsten Fischer, Alexandre René, Christian Keup, Moritz Layer, David Dahmen, Moritz Helias

    Abstract: Understanding the functional principles of information processing in deep neural networks continues to be a challenge, in particular for networks with trained and thus non-random weights. To address this issue, we study the mapping between probability distributions implemented by a deep feed-forward network. We characterize this mapping as an iterated transformation of distributions, where the non… ▽ More

    Submitted 1 December, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: Published in Physical Review Research Changes with respect to the previous version: - Added results with CIFAR-10 - Added sections to the supplementary: - Derivation of an analogous result to the depth scale of untrained deep networks. - Expanded discussion applicability of the Gaussian assumption when variables are weakly correlated. - Clarified main text in some areas. - Fixed typos

    Journal ref: Phys. Rev. Research 4, 043143 (2022)

  4. On Feature Relevance Uncertainty: A Monte Carlo Dropout Sampling Approach

    Authors: Kai Fischer, Jonas Schneider

    Abstract: Understanding decisions made by neural networks is key for the deployment of intelligent systems in real world applications. However, the opaque decision making process of these systems is a disadvantage where interpretability is essential. Many feature-based explanation techniques have been introduced over the last few years in the field of machine learning to better understand decisions made by… ▽ More

    Submitted 11 April, 2023; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: 18 pages, 15 figures

    ACM Class: I.2.10; I.4

  5. arXiv:1910.13444  [pdf, other

    physics.comp-ph cs.LG stat.ML

    Highly-scalable, physics-informed GANs for learning solutions of stochastic PDEs

    Authors: Liu Yang, Sean Treichler, Thorsten Kurth, Keno Fischer, David Barajas-Solano, Josh Romero, Valentin Churavy, Alexandre Tartakovsky, Michael Houston, Prabhat, George Karniadakis

    Abstract: Uncertainty quantification for forward and inverse problems is a central challenge across physical and biomedical disciplines. We address this challenge for the problem of modeling subsurface flow at the Hanford Site by combining stochastic computational models with observational data using physics-informed GAN models. The geographic extent, spatial heterogeneity, and multiple correlation length s… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: 3rd Deep Learning on Supercomputers Workshop (DLS) at SC19

  6. arXiv:1904.02642  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

    Authors: Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer, Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

    Abstract: Transferring knowledge across tasks to improve data-efficiency is one of the open key challenges in the field of global black-box optimization. Readily available algorithms are typically designed to be universal optimizers and, therefore, often suboptimal for specific tasks. We propose a novel transfer learning method to obtain customized optimizers within the well-established framework of Bayesia… ▽ More

    Submitted 14 February, 2020; v1 submitted 4 April, 2019; originally announced April 2019.

  7. arXiv:1810.09868  [pdf, other

    cs.PL cs.LG stat.ML

    Automatic Full Compilation of Julia Programs and ML Models to Cloud TPUs

    Authors: Keno Fischer, Elliot Saba

    Abstract: Google's Cloud TPUs are a promising new hardware architecture for machine learning workloads. They have powered many of Google's milestone machine learning achievements in recent years. Google has now made TPUs available for general use on their cloud platform and as of very recently has opened them up further to allow use by non-TensorFlow frontends. We describe a method and implementation for of… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: Submitted to SysML 2019

  8. arXiv:1802.09455  [pdf

    stat.OT

    Assessing the association between pre-course metrics of student preparation and student performance in introductory statistics: Results from early data on simulation-based inference vs. nonsimulation based inference

    Authors: Nathan Tintle, Jake Clark, Karen Fischer, Beth Chance, George Cobb, Soma Roy, Todd Swanson, Jill VanderStoep

    Abstract: The recent simulation-based inference (SBI) movement in algebra-based introductory statistics courses (Stat 101) has provided preliminary evidence of improved student conceptual understanding and retention. However, little is known about whether these positive effects are preferentially distributed across types of students entering the course. We consider how two metrics of Stat 101 student prepar… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

    Comments: 16 pages