Skip to main content

Showing 1–50 of 63 results for author: Köthe, U

.
  1. arXiv:2506.01522  [pdf, ps, other

    cs.LG stat.ML

    Beyond Diagonal Covariance: Flexible Posterior VAEs via Free-Form Injective Flows

    Authors: Peter Sorrenson, Lukas Lührs, Hans Olischläger, Ullrich Köthe

    Abstract: Variational Autoencoders (VAEs) are powerful generative models widely used for learning interpretable latent spaces, quantifying uncertainty, and compressing data for downstream generative tasks. VAEs typically rely on diagonal Gaussian posteriors due to computational constraints. Using arguments grounded in differential geometry, we demonstrate inherent limitations in the representational capacit… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  2. arXiv:2502.00820  [pdf, other

    cs.LG cs.CV

    OOD Detection with immature Models

    Authors: Behrooz Montazeran, Ullrich Köthe

    Abstract: Likelihood-based deep generative models (DGMs) have gained significant attention for their ability to approximate the distributions of high-dimensional data. However, these models lack a performance guarantee in assigning higher likelihood values to in-distribution (ID) inputs, data the models are trained on, compared to out-of-distribution (OOD) inputs. This counter-intuitive behaviour is particu… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

    Comments: 17 pages, 2 Tables, 9 Figures

    MSC Class: 53A45 ACM Class: I.4.7; I.4.9

  3. arXiv:2410.19492  [pdf, other

    cs.LG

    TRADE: Transfer of Distributions between External Conditions with Normalizing Flows

    Authors: Stefan Wahl, Armand Rousselot, Felix Draxler, Henrik Schopmans, Ullrich Köthe

    Abstract: Modeling distributions that depend on external control parameters is a common scenario in diverse applications like molecular simulations, where system properties like temperature affect molecular configurations. Despite the relevance of these applications, existing solutions are unsatisfactory as they require severely restricted model architectures or rely on energy-based training, which is prone… ▽ More

    Submitted 7 March, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

    Comments: Accepted as Poster at AISTATS 2025

  4. arXiv:2410.19426  [pdf, other

    cs.LG stat.ML

    Analyzing Generative Models by Manifold Entropic Metrics

    Authors: Daniel Galperin, Ullrich Köthe

    Abstract: Good generative models should not only synthesize high quality data, but also utilize interpretable representations that aid human understanding of their behavior. However, it is difficult to measure objectively if and to what degree desirable properties of disentangled representations have been achieved. Inspired by the principle of independent mechanisms, we address this difficulty by introducin… ▽ More

    Submitted 7 April, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

    Comments: Camera-ready version: accepted at AISTATS 2025

  5. arXiv:2407.09297  [pdf, ps, other

    cs.LG stat.ML

    Learning Distances from Data with Normalizing Flows and Score Matching

    Authors: Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr, Ullrich Köthe

    Abstract: Density-based distances (DBDs) provide a principled approach to metric learning by defining distances in terms of the underlying data distribution. By employing a Riemannian metric that increases in regions of low probability density, shortest paths naturally follow the data manifold. Fermat distances, a specific type of DBD, have attractive properties, but existing estimators based on nearest nei… ▽ More

    Submitted 30 May, 2025; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: ICML 2025

  6. arXiv:2406.15104  [pdf, other

    cs.CR cs.CV

    Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors

    Authors: Peter Lorenz, Mario Fernandez, Jens Müller, Ullrich Köthe

    Abstract: Detecting out-of-distribution (OOD) inputs is critical for safely deploying deep learning models in real-world scenarios. In recent years, many OOD detectors have been developed, and even the benchmarking has been standardized, i.e. OpenOOD. The number of post-hoc detectors is growing fast. They are showing an option to protect a pre-trained classifier against natural distribution shifts and claim… ▽ More

    Submitted 28 January, 2025; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: accepted at ICML workshop 2024

  7. arXiv:2406.03154  [pdf, other

    cs.LG cs.AI

    Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks: An Extended Investigation

    Authors: Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Recent advances in probabilistic deep learning enable efficient amortized Bayesian inference in settings where the likelihood function is only implicitly defined by a simulation program (simulation-based inference; SBI). But how faithful is such inference if the simulation represents reality somewhat inaccurately, that is, if the true system behavior at test time deviates from the one seen during… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Extended version of the conference paper https://doi.org/10.1007/978-3-031-54605-1_35. arXiv admin note: text overlap with arXiv:2112.08866

  8. DALSA: Domain Adaptation for Supervised Learning From Sparsely Annotated MR Images

    Authors: Michael Götz, Christian Weber, Franciszek Binczyk, Joanna Polanska, Rafal Tarnawski, Barbara Bobek-Billewicz, Ullrich Köthe, Jens Kleesiek, Bram Stieltjes, Klaus H. Maier-Hein

    Abstract: We propose a new method that employs transfer learning techniques to effectively correct sampling selection errors introduced by sparse annotations during supervised learning for automated tumor segmentation. The practicality of current learning-based automated tissue classification approaches is severely impeded by their dependency on manually segmented training databases that need to be recreate… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Journal ref: IEEE Transactions on Medical Imaging ( Volume: 35, Issue: 1, January 2016)

  9. arXiv:2402.06578  [pdf, other

    cs.LG stat.ML

    On the Universality of Volume-Preserving and Coupling-Based Normalizing Flows

    Authors: Felix Draxler, Stefan Wahl, Christoph Schnörr, Ullrich Köthe

    Abstract: We present a novel theoretical framework for understanding the expressive power of normalizing flows. Despite their prevalence in scientific applications, a comprehensive understanding of flows remains elusive due to their restricted architectures. Existing theorems fall short as they require the use of arbitrarily ill-conditioned neural networks, limiting practical applicability. We propose a dis… ▽ More

    Submitted 29 January, 2025; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  10. arXiv:2312.10107  [pdf, other

    cs.LG cs.AI

    Towards Context-Aware Domain Generalization: Understanding the Benefits and Limits of Marginal Transfer Learning

    Authors: Jens Müller, Lars Kühmichel, Martin Rohbeck, Stefan T. Radev, Ullrich Köthe

    Abstract: In this work, we analyze the conditions under which information about the context of an input $X$ can improve the predictions of deep learning models in new domains. Following work in marginal transfer learning in Domain Generalization (DG), we formalize the notion of context as a permutation-invariant representation of a set of data points that originate from the same domain as the input itself.… ▽ More

    Submitted 21 February, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  11. arXiv:2312.09852  [pdf, other

    cs.LG stat.ML

    Learning Distributions on Manifolds with Free-Form Flows

    Authors: Peter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Ullrich Köthe

    Abstract: We propose Manifold Free-Form Flows (M-FFF), a simple new generative model for data on manifolds. The existing approaches to learning a distribution on arbitrary manifolds are expensive at inference time, since sampling requires solving a differential equation. Our method overcomes this limitation by sampling in a single function evaluation. The key innovation is to optimize a neural network via m… ▽ More

    Submitted 25 November, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2024

  12. arXiv:2312.05440  [pdf, other

    cs.LG cs.AI stat.ML

    Consistency Models for Scalable and Fast Simulation-Based Inference

    Authors: Marvin Schmitt, Valentin Pratz, Ullrich Köthe, Paul-Christian Bürkner, Stefan T Radev

    Abstract: Simulation-based inference (SBI) is constantly in search of more expressive and efficient algorithms to accurately infer the parameters of complex simulation models. In line with this goal, we present consistency models for posterior estimation (CMPE), a new conditional sampler for SBI that inherits the advantages of recent unconstrained architectures and overcomes their sampling inefficiency at i… ▽ More

    Submitted 4 November, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Journal ref: Neural Information Processing Systems (NeurIPS 2024)

  13. arXiv:2310.16624  [pdf, other

    cs.LG stat.ML

    Free-form Flows: Make Any Architecture a Normalizing Flow

    Authors: Felix Draxler, Peter Sorrenson, Lea Zimmermann, Armand Rousselot, Ullrich Köthe

    Abstract: Normalizing Flows are generative models that directly maximize the likelihood. Previously, the design of normalizing flows was largely constrained by the need for analytical invertibility. We overcome this constraint by a training procedure that uses an efficient estimator for the gradient of the change of variables formula. This enables any dimension-preserving neural network to serve as a genera… ▽ More

    Submitted 24 April, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Camera-ready version: accepted at AISTATS 2024

  14. arXiv:2310.11122  [pdf, other

    stat.ML cs.LG stat.ME

    Sensitivity-Aware Amortized Bayesian Inference

    Authors: Lasse Elsemüller, Hans Olischläger, Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Sensitivity analyses reveal the influence of various modeling choices on the outcomes of statistical analyses. While theoretically appealing, they are overwhelmingly inefficient for complex Bayesian models. In this work, we propose sensitivity-aware amortized Bayesian inference (SA-ABI), a multifaceted approach to efficiently integrate sensitivity analyses into simulation-based inference with neur… ▽ More

    Submitted 28 August, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Published in TMLR (2024)

    Journal ref: Transactions on Machine Learning Research (08/2024)

  15. arXiv:2310.04395  [pdf, other

    cs.LG cs.AI

    Leveraging Self-Consistency for Data-Efficient Amortized Bayesian Inference

    Authors: Marvin Schmitt, Desi R. Ivanova, Daniel Habermann, Ullrich Köthe, Paul-Christian Bürkner, Stefan T. Radev

    Abstract: We propose a method to improve the efficiency and accuracy of amortized Bayesian inference by leveraging universal symmetries in the joint probabilistic model of parameters and data. In a nutshell, we invert Bayes' theorem and estimate the marginal likelihood based on approximate representations of the joint model. Upon perfect approximation, the marginal likelihood is constant across all paramete… ▽ More

    Submitted 23 July, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Proceedings of the 41st International Conference on Machine Learning (ICML), Vienna, Austria. PMLR 235, 2024

    Journal ref: ICML 2024: PMLR 235, 2024

  16. arXiv:2309.09764  [pdf, other

    cs.CV cs.LG eess.IV

    Application-driven Validation of Posteriors in Inverse Problems

    Authors: Tim J. Adler, Jan-Hinrich Nölke, Annika Reinke, Minu Dietlinde Tizabi, Sebastian Gruber, Dasha Trofimova, Lynton Ardizzone, Paul F. Jaeger, Florian Buettner, Ullrich Köthe, Lena Maier-Hein

    Abstract: Current deep learning-based solutions for image analysis tasks are commonly incapable of handling problems to which multiple different plausible solutions exist. In response, posterior-based methods such as conditional Diffusion Models and Invertible Neural Networks have emerged; however, their translation is hampered by a lack of research on adequate validation. In other words, the way progress i… ▽ More

    Submitted 21 January, 2025; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted at Medical Image Analysis. Shared first authors: Tim J. Adler and Jan-Hinrich Nölke. 24 pages, 9 figures, 1 table

    Journal ref: Medical Image Analysis, Volume 101, 2025, 103474, ISSN 1361-8415

  17. arXiv:2308.02652  [pdf, other

    cs.LG

    A Review of Change of Variable Formulas for Generative Modeling

    Authors: Ullrich Köthe

    Abstract: Change-of-variables (CoV) formulas allow to reduce complicated probability densities to simpler ones by a learned transformation with tractable Jacobian determinant. They are thus powerful tools for maximum-likelihood learning, Bayesian inference, outlier detection, model selection, etc. CoV formulas have been derived for a large variety of model types, but this information is scattered over many… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  18. arXiv:2306.16015  [pdf, other

    cs.LG cs.AI stat.ML

    BayesFlow: Amortized Bayesian Workflows With Neural Networks

    Authors: Stefan T Radev, Marvin Schmitt, Lukas Schumacher, Lasse Elsemüller, Valentin Pratz, Yannik Schälte, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: Modern Bayesian inference involves a mixture of computational techniques for estimating, validating, and drawing conclusions from probabilistic models as part of principled workflows for data analysis. Typical problems in Bayesian workflows are the approximation of intractable posterior distributions for diverse model types and the comparison of competing models of the same process in terms of the… ▽ More

    Submitted 10 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  19. arXiv:2306.13520  [pdf, other

    cs.LG stat.ML

    On the Convergence Rate of Gaussianization with Random Rotations

    Authors: Felix Draxler, Lars Kühmichel, Armand Rousselot, Jens Müller, Christoph Schnörr, Ullrich Köthe

    Abstract: Gaussianization is a simple generative model that can be trained without backpropagation. It has shown compelling performance on low dimensional data. As the dimension increases, however, it has been observed that the convergence speed slows down. We show analytically that the number of required layers scales linearly with the dimension for Gaussian input. We argue that this is because the model i… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  20. arXiv:2306.01843  [pdf, other

    cs.LG

    Lifting Architectural Constraints of Injective Flows

    Authors: Peter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Lea Zimmermann, Ullrich Köthe

    Abstract: Normalizing Flows explicitly maximize a full-dimensional likelihood on the training data. However, real data is typically only supported on a lower-dimensional manifold leading the model to expend significant compute on modeling noise. Injective Flows fix this by jointly learning a manifold and the distribution on it. So far, they have been limited by restrictive architectures and/or high computat… ▽ More

    Submitted 27 June, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Camera-ready version: accepted to ICLR 2024

  21. Training Invertible Neural Networks as Autoencoders

    Authors: The-Gia Leo Nguyen, Lynton Ardizzone, Ullrich Köthe

    Abstract: Autoencoders are able to learn useful data representations in an unsupervised matter and have been widely used in various machine learning and computer vision tasks. In this work, we present methods to train Invertible Neural Networks (INNs) as (variational) autoencoders which we call INN (variational) autoencoders. Our experiments on MNIST, CIFAR and CelebA show that for low bottleneck sizes our… ▽ More

    Submitted 21 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Conference Paper at GCPR2019

    ACM Class: I.5.1; I.4.10; I.4.2; I.4.5

    Journal ref: In: Fink, G., Frintrop, S., Jiang, X. (eds) Pattern Recognition. DAGM GCPR 2019. Lecture Notes in Computer Science, vol 11824. Springer, Cham

  22. Unsupervised Domain Transfer with Conditional Invertible Neural Networks

    Authors: Kris K. Dreher, Leonardo Ayala, Melanie Schellenberg, Marco Hübner, Jan-Hinrich Nölke, Tim J. Adler, Silvia Seidlitz, Jan Sellner, Alexander Studier-Fischer, Janek Gröhl, Felix Nickel, Ullrich Köthe, Alexander Seitel, Lena Maier-Hein

    Abstract: Synthetic medical image generation has evolved as a key technique for neural network training and validation. A core challenge, however, remains in the domain gap between simulations and real data. While deep learning-based domain transfer using Cycle Generative Adversarial Networks and similar architectures has led to substantial progress in the field, there are use cases in which state-of-the-ar… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  23. arXiv:2303.09989  [pdf, other

    cs.LG stat.ML

    Finding Competence Regions in Domain Generalization

    Authors: Jens Müller, Stefan T. Radev, Robert Schmier, Felix Draxler, Carsten Rother, Ullrich Köthe

    Abstract: We investigate a "learning to reject" framework to address the problem of silent failures in Domain Generalization (DG), where the test distribution differs from the training distribution. Assuming a mild distribution shift, we wish to accept out-of-distribution (OOD) data from a new domain whenever a model's estimated competence foresees trustworthy responses, instead of rejecting OOD data outrig… ▽ More

    Submitted 21 June, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: The paper has been published at TMLR (see https://openreview.net/forum?id=TSy0vuwQFN)

    Journal ref: Transactions on Machine Learning Research (06/2023)

  24. arXiv:2302.09125  [pdf, other

    cs.LG stat.ML

    JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models

    Authors: Stefan T. Radev, Marvin Schmitt, Valentin Pratz, Umberto Picchini, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: This work proposes ``jointly amortized neural approximation'' (JANA) of intractable likelihood functions and posterior densities arising in Bayesian surrogate modeling and simulation-based inference. We train three complementary networks in an end-to-end fashion: 1) a summary network to compress individual data points, sets, or time series into informative embedding vectors; 2) a posterior network… ▽ More

    Submitted 20 June, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  25. arXiv:2301.13462  [pdf, other

    physics.ao-ph cs.LG

    Towards Learned Emulation of Interannual Water Isotopologue Variations in General Circulation Models

    Authors: Jonathan Wider, Jakob Kruse, Nils Weitzel, Janica C. Bühler, Ullrich Köthe, Kira Rehfeld

    Abstract: Simulating abundances of stable water isotopologues, i.e. molecules differing in their isotopic composition, within climate models allows for comparisons with proxy data and, thus, for testing hypotheses about past climate and validating climate models under varying climatic conditions. However, many models are run without explicitly simulating water isotopologues. We investigate the possibility t… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Journal ref: Environmental Data Science, Volume 2 (2023), e35

  26. Noise-Net: Determining physical properties of HII regions reflecting observational uncertainties

    Authors: Da Eun Kang, Ralf S. Klessen, Victor F. Ksoll, Lynton Ardizzone, Ullrich Koethe, Simon C. O. Glover

    Abstract: Stellar feedback, the energetic interaction between young stars and their birthplace, plays an important role in the star formation history of the universe and the evolution of the interstellar medium (ISM). Correctly interpreting the observations of star-forming regions is essential to understand stellar feedback, but it is a non-trivial task due to the complexity of the feedback processes and de… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

    Comments: 22 pages, 14 figures, Accepted for publication by MNRAS on 04. January

  27. arXiv:2211.13165  [pdf, other

    stat.ME stat.ML

    Neural Superstatistics for Bayesian Estimation of Dynamic Cognitive Models

    Authors: Lukas Schumacher, Paul-Christian Bürkner, Andreas Voss, Ullrich Köthe, Stefan T. Radev

    Abstract: Mathematical models of cognition are often memoryless and ignore potential fluctuations of their parameters. However, human cognition is inherently dynamic. Thus, we propose to augment mechanistic cognitive models with a temporal dimension and estimate the resulting dynamics from a superstatistics perspective. Such a model entails a hierarchy between a low-level observation model and a high-level… ▽ More

    Submitted 20 September, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  28. arXiv:2210.14032  [pdf, other

    cs.LG stat.ML

    Whitening Convergence Rate of Coupling-based Normalizing Flows

    Authors: Felix Draxler, Christoph Schnörr, Ullrich Köthe

    Abstract: Coupling-based normalizing flows (e.g. RealNVP) are a popular family of normalizing flow architectures that work surprisingly well in practice. This calls for theoretical understanding. Existing work shows that such flows weakly converge to arbitrary data distributions. However, they make no statement about the stricter convergence criterion used in practice, the maximum likelihood loss. For the f… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Proceedings of 36th Conference on Neural Information Processing System (NeurIPS 2022)

  29. arXiv:2208.14024  [pdf, other

    cs.LG

    Positive Difference Distribution for Image Outlier Detection using Normalizing Flows and Contrastive Data

    Authors: Robert Schmier, Ullrich Köthe, Christoph-Nikolas Straehle

    Abstract: Detecting test data deviating from training data is a central problem for safe and robust machine learning. Likelihoods learned by a generative model, e.g., a normalizing flow via standard log-likelihood training, perform poorly as an outlier score. We propose to use an unlabelled auxiliary dataset and a probabilistic outlier score for outlier detection. We use a self-supervised feature extractor… ▽ More

    Submitted 26 April, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Journal ref: Transactions on Machine Learning Research (04/2023)

  30. arXiv:2207.14625  [pdf, other

    cs.CR cs.CV cs.LG

    Content-Aware Differential Privacy with Conditional Invertible Neural Networks

    Authors: Malte Tölle, Ullrich Köthe, Florian André, Benjamin Meder, Sandy Engelhardt

    Abstract: Differential privacy (DP) has arisen as the gold standard in protecting an individual's privacy in datasets by adding calibrated noise to each data sample. While the application to categorical data is straightforward, its usability in the context of images has been limited. Contrary to categorical data the meaning of an image is inherent in the spatial correlation of neighboring pixels making the… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: Accepted at 3rd DeCaF Workshop (MICCAI22)

    MSC Class: J.3 I.4.0 J.3 I.2.6

  31. arXiv:2203.16542  [pdf, other

    cs.CV

    Towards Multimodal Depth Estimation from Light Fields

    Authors: Titus Leistner, Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother

    Abstract: Light field applications, especially light field rendering and depth estimation, developed rapidly in recent years. While state-of-the-art light field rendering methods handle semi-transparent and reflective objects well, depth estimation methods either ignore these cases altogether or only deliver a weak performance. We argue that this is due current methods only considering a single "true" depth… ▽ More

    Submitted 1 April, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  32. arXiv:2202.00027  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG

    Exoplanet Characterization using Conditional Invertible Neural Networks

    Authors: Jonas Haldemann, Victor Ksoll, Daniel Walter, Yann Alibert, Ralf S. Klessen, Willy Benz, Ullrich Koethe, Lynton Ardizzone, Carsten Rother

    Abstract: The characterization of an exoplanet's interior is an inverse problem, which requires statistical methods such as Bayesian inference in order to be solved. Current methods employ Markov Chain Monte Carlo (MCMC) sampling to infer the posterior probability of planetary structure parameters for a given exoplanet. These methods are time consuming since they require the calculation of a large number of… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: 15 pages, 13 figures, submitted to Astronomy & Astrophysics

    Journal ref: A&A 672, A180 (2023)

  33. Emission-line diagnostics of HII regions using conditional Invertible Neural Networks

    Authors: Da Eun Kang, Eric W. Pellegrini, Lynton Ardizzone, Ralf S. Klessen, Ullrich Koethe, Simon C. O. Glover, Victor F. Ksoll

    Abstract: Young massive stars play an important role in the evolution of the interstellar medium (ISM) and the self-regulation of star formation in giant molecular clouds (GMCs) by injecting energy, momentum, and radiation (stellar feedback) into surrounding environments, disrupting the parental clouds, and regulating further star formation. Information of the stellar feedback inheres in the emission we obs… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: 32 pages, 23 figures, Accepted for publication by MNRAS on 21. January

  34. arXiv:2112.08866  [pdf, other

    stat.ME cs.LG stat.ML

    Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks

    Authors: Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Neural density estimators have proven remarkably powerful in performing efficient simulation-based Bayesian inference in various research domains. In particular, the BayesFlow framework uses a two-step approach to enable amortized parameter estimation in settings where the likelihood function is implicitly defined by a simulation program. But how faithful is such inference when simulations are poo… ▽ More

    Submitted 8 November, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

  35. Inference of cosmic-ray source properties by conditional invertible neural networks

    Authors: Teresa Bister, Martin Erdmann, Ullrich Köthe, Josina Schulte

    Abstract: The inference of physical parameters from measured distributions constitutes a core task in physics data analyses. Among recent deep learning methods, so-called conditional invertible neural networks provide an elegant approach owing to their probability-preserving bijective mapping properties. They enable training the parameter-observation correspondence in one mapping direction and evaluating th… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 10 pages, 8 figures

  36. arXiv:2105.02104  [pdf, other

    cs.CV cs.AI

    Conditional Invertible Neural Networks for Diverse Image-to-Image Translation

    Authors: Lynton Ardizzone, Jakob Kruse, Carsten Lüth, Niels Bracher, Carsten Rother, Ullrich Köthe

    Abstract: We introduce a new architecture called a conditional invertible neural network (cINN), and use it to address the task of diverse image-to-image translation for natural images. This is not easily possible with existing INN models due to some fundamental limitations. The cINN combines the purely generative INN model with an unconstrained feed-forward network, which efficiently preprocesses the condi… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:1907.02392

    MSC Class: 68T01

  37. arXiv:2101.10763  [pdf, other

    cs.LG

    Benchmarking Invertible Architectures on Inverse Problems

    Authors: Jakob Kruse, Lynton Ardizzone, Carsten Rother, Ullrich Köthe

    Abstract: Recent work demonstrated that flow-based invertible neural networks are promising tools for solving ambiguous inverse problems. Following up on this, we investigate how ten invertible architectures and related models fare on two intuitive, low-dimensional benchmark problems, obtaining the best results with coupling layers and simple autoencoders. We hope that our initial efforts inspire other rese… ▽ More

    Submitted 22 June, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    MSC Class: 68T01

    Journal ref: Workshop on Invertible Neural Networks and Normalizing Flows (ICML 2019)

  38. Measuring QCD Splittings with Invertible Networks

    Authors: Sebastian Bieringer, Anja Butter, Theo Heimel, Stefan Höche, Ullrich Köthe, Tilman Plehn, Stefan T. Radev

    Abstract: QCD splittings are among the most fundamental theory concepts at the LHC. We show how they can be studied systematically with the help of invertible neural networks. These networks work with sub-jet information to extract fundamental parameters from jet samples. Our approach expands the LEP measurements of QCD Casimirs to a systematic test of QCD properties based on low-level jet observables. Star… ▽ More

    Submitted 9 March, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: 25 pages, 11 figures

    Report number: FERMILAB-PUB-20-665-T

    Journal ref: SciPost Phys. 10, 126 (2021)

  39. arXiv:2012.08195  [pdf, other

    cs.CV cs.LG

    Representing Ambiguity in Registration Problems with Conditional Invertible Neural Networks

    Authors: Darya Trofimova, Tim Adler, Lisa Kausch, Lynton Ardizzone, Klaus Maier-Hein, Ulrich Köthe, Carsten Rother, Lena Maier-Hein

    Abstract: Image registration is the basis for many applications in the fields of medical image computing and computer assisted interventions. One example is the registration of 2D X-ray images with preoperative three-dimensional computed tomography (CT) images in intraoperative surgical guidance systems. Due to the high safety requirements in medical applications, estimating registration uncertainty is of a… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: The paper got accepted at Medical Imaging Meets NeurIPS Workshop at Neural Information Processing Systems 2020

  40. arXiv:2012.00524  [pdf, other

    astro-ph.GA astro-ph.SR

    Measuring Young Stars in Space and Time -- II. The Pre-Main-Sequence Stellar Content of N44

    Authors: Victor F. Ksoll, Dimitrios Gouliermis, Elena Sabbi, Jenna E. Ryon, Massimo Robberto, Mario Gennaro, Ralf S. Klessen, Ullrich Koethe, Guido de Marchi, C. -H. Rosie Chen, Michele Cignoni, Andrew E. Dolphin

    Abstract: The Hubble Space Telescope (HST) survey Measuring Young Stars in Space and Time (MYSST) entails some of the deepest photometric observations of extragalactic star formation, capturing even the lowest mass stars of the active star-forming complex N44 in the Large Magellanic Cloud. We employ the new MYSST stellar catalog to identify and characterize the content of young pre-main-sequence (PMS) stars… ▽ More

    Submitted 15 March, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: 29 pages, 21 figures, accepted for publication in AJ

  41. arXiv:2012.00521  [pdf, other

    astro-ph.GA astro-ph.SR

    Measuring Young Stars in Space and Time -- I. The Photometric Catalog and Extinction Properties of N44

    Authors: Victor F. Ksoll, Dimitrios Gouliermis, Elena Sabbi, Jenna E. Ryon, Massimo Robberto, Mario Gennaro, Ralf S. Klessen, Ullrich Koethe, Guido de Marchi, C. -H. Rosie Chen, Michele Cignoni, Andrew E. Dolphin

    Abstract: In order to better understand the role of high-mass stellar feedback in regulating star formation in giant molecular clouds, we carried out a Hubble Space Telescope (HST) Treasury Program "Measuring Young Stars in Space and Time" (MYSST) targeting the star-forming complex N44 in the Large Magellanic Cloud (LMC). Using the F555W and F814W broadband filters of both the ACS and WFC3/UVIS, we built a… ▽ More

    Submitted 15 March, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: 29 pages, 15 figures, accepted for publication in AJ

  42. arXiv:2011.05110  [pdf, other

    physics.med-ph cs.AI cs.LG

    Invertible Neural Networks for Uncertainty Quantification in Photoacoustic Imaging

    Authors: Jan-Hinrich Nölke, Tim Adler, Janek Gröhl, Thomas Kirchner, Lynton Ardizzone, Carsten Rother, Ullrich Köthe, Lena Maier-Hein

    Abstract: Multispectral photoacoustic imaging (PAI) is an emerging imaging modality which enables the recovery of functional tissue parameters such as blood oxygenation. However, the underlying inverse problems are potentially ill-posed, meaning that radically different tissue properties may - in theory - yield comparable measurements. In this work, we present a new approach for handling this specific type… ▽ More

    Submitted 23 November, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 7 pages, 4 figures, submitted to "Bildverarbeitung für die Medizin (BVM) 2021"

  43. arXiv:2010.07167  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Robust Models Using The Principle of Independent Causal Mechanisms

    Authors: Jens Müller, Robert Schmier, Lynton Ardizzone, Carsten Rother, Ullrich Köthe

    Abstract: Standard supervised learning breaks down under data distribution shift. However, the principle of independent causal mechanisms (ICM, Peters et al. (2017)) can turn this weakness into an opportunity: one can take advantage of distribution shift between different environments during training in order to obtain more robust models. We propose a new gradient-based learning framework whose objective fu… ▽ More

    Submitted 8 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

  44. arXiv:2010.00300  [pdf, other

    stat.AP cs.LG q-bio.PE

    OutbreakFlow: Model-based Bayesian inference of disease outbreak dynamics with invertible neural networks and its application to the COVID-19 pandemics in Germany

    Authors: Stefan T. Radev, Frederik Graw, Simiao Chen, Nico T. Mutters, Vanessa M. Eichel, Till Bärnighausen, Ullrich Köthe

    Abstract: Mathematical models in epidemiology are an indispensable tool to determine the dynamics and important characteristics of infectious diseases. Apart from their scientific merit, these models are often used to inform political decisions and intervention measures during an ongoing outbreak. However, reliably inferring the dynamics of ongoing outbreaks by connecting complex models to real data is stil… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

  45. arXiv:2007.15036  [pdf, other

    cs.CV cs.LG

    Generative Classifiers as a Basis for Trustworthy Image Classification

    Authors: Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother

    Abstract: With the maturing of deep learning systems, trustworthiness is becoming increasingly important for model assessment. We understand trustworthiness as the combination of explainability and robustness. Generative classifiers (GCs) are a promising class of models that are said to naturally accomplish these qualities. However, this has mostly been demonstrated on simple datasets such as MNIST and CIFA… ▽ More

    Submitted 2 December, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

  46. Stellar Parameter Determination from Photometry using Invertible Neural Networks

    Authors: Victor F. Ksoll, Lynton Ardizzone, Ralf Klessen, Ullrich Koethe, Elena Sabbi, Massimo Robberto, Dimitrios Gouliermis, Carsten Rother, Peter Zeidler, Mario Gennaro

    Abstract: Photometric surveys with the Hubble Space Telescope (HST) allow us to study stellar populations with high resolution and deep coverage, with estimates of the physical parameters of the constituent stars being typically obtained by comparing the survey data with adequate stellar evolutionary models. This is a highly non-trivial task due to effects such as differential extinction, photometric errors… ▽ More

    Submitted 21 September, 2020; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted for Publication by MNRAS on 19. September, 41 pages, 48 figures, 2 tables

  47. Invertible Networks or Partons to Detector and Back Again

    Authors: Marco Bellagente, Anja Butter, Gregor Kasieczka, Tilman Plehn, Armand Rousselot, Ramon Winterhalder, Lynton Ardizzone, Ullrich Köthe

    Abstract: For simulations where the forward and the inverse directions have a physics meaning, invertible neural networks are especially useful. A conditional INN can invert a detector simulation in terms of high-level observables, specifically for ZW production at the LHC. It allows for a per-event statistical interpretation. Next, we allow for a variable number of QCD jets. We unfold detector effects and… ▽ More

    Submitted 1 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 25 pages, 10 figures

    Journal ref: SciPost Phys. 9, 074 (2020)

  48. arXiv:2006.06085  [pdf, other

    physics.med-ph

    Long short-term memory networks for proton dose calculation in highly heterogeneous tissues

    Authors: Ahmad Neishabouri, Niklas Wahl, Ulrich Köthe, Mark Bangert

    Abstract: A novel dose calculation approach was designed based on the application of LSTM network that processes the 3D patient/phantom geometry as a sequence of 2D computed tomography input slices yielding a corresponding sequence of 2D slices that forms the respective 3D dose distribution. LSTM networks can propagate information effectively in one direction, resulting in a model that can properly imitate… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Comments: 21 Pages, 15 figures, 4 tables. To appear in the Proceedings of the ESTRO 2020 coference, 28 November - 1 December 2020, Vienna, Austria

  49. arXiv:2004.10629  [pdf, other

    stat.ML cs.LG

    Amortized Bayesian model comparison with evidential deep learning

    Authors: Stefan T. Radev, Marco D'Alessandro, Ulf K. Mertens, Andreas Voss, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: Comparing competing mathematical models of complex natural processes is a shared goal among many branches of science. The Bayesian probabilistic framework offers a principled way to perform model comparison and extract useful metrics for guiding decisions. However, many interesting models are intractable with standard Bayesian methods, as they lack a closed-form likelihood function or the likeliho… ▽ More

    Submitted 2 March, 2021; v1 submitted 22 April, 2020; originally announced April 2020.

  50. arXiv:2003.06281  [pdf, other

    stat.ML cs.LG

    BayesFlow: Learning complex stochastic models with invertible neural networks

    Authors: Stefan T. Radev, Ulf K. Mertens, Andreas Voss, Lynton Ardizzone, Ullrich Köthe

    Abstract: Estimating the parameters of mathematical models is a common problem in almost all branches of science. However, this problem can prove notably difficult when processes and model descriptions become increasingly complex and an explicit likelihood function is not available. With this work, we propose a novel method for globally amortized Bayesian inference based on invertible neural networks which… ▽ More

    Submitted 1 December, 2020; v1 submitted 13 March, 2020; originally announced March 2020.