Skip to main content

Showing 1–15 of 15 results for author: Dürr, O

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.16206  [pdf, other

    stat.ML cs.LG

    Interpretable Neural Causal Models with TRAM-DAGs

    Authors: Beate Sick, Oliver Dürr

    Abstract: The ultimate goal of most scientific studies is to understand the underlying causal mechanism between the involved variables. Structural causal models (SCMs) are widely used to represent such causal mechanisms. Given an SCM, causal queries on all three levels of Pearl's causal hierarchy can be answered: $L_1$ observational, $L_2$ interventional, and $L_3$ counterfactual. An essential aspect of mod… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: Accepted at the CLeaR 2025 Conference

  2. arXiv:2503.03382  [pdf, other

    cs.LG stat.ML

    Paths and Ambient Spaces in Neural Loss Landscapes

    Authors: Daniel Dold, Julius Kobialka, Nicolai Palm, Emanuel Sommer, David Rügamer, Oliver Dürr

    Abstract: Understanding the structure of neural network loss surfaces, particularly the emergence of low-loss tunnels, is critical for advancing neural network theory and practice. In this paper, we propose a novel approach to directly embed loss tunnels into the loss landscape of neural networks. Exploring the properties of these loss tunnels offers new insights into their length and structure and sheds li… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: 9 pages, Accepted at AISTATS 2025

  3. arXiv:2401.12950  [pdf, other

    cs.LG stat.ML

    Bayesian Semi-structured Subspace Inference

    Authors: Daniel Dold, David Rügamer, Beate Sick, Oliver Dürr

    Abstract: Semi-structured regression models enable the joint modeling of interpretable structured and complex unstructured feature effects. The structured model part is inspired by statistical models and can be used to infer the input-output relationship for features of particular importance. The complex unstructured part defines an arbitrary deep neural network and thereby provides enough flexibility to ac… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted at AISTATS 2024

  4. arXiv:2308.12785  [pdf, ps, other

    cs.LG stat.ML

    Single-shot Bayesian approximation for neural networks

    Authors: Kai Brach, Beate Sick, Oliver Dürr

    Abstract: Deep neural networks (NNs) are known for their high-prediction performances. However, NNs are prone to yield unreliable predictions when encountering completely new situations without indicating their uncertainty. Bayesian variants of NNs (BNNs), such as Monte Carlo (MC) dropout BNNs, do provide uncertainty measures and simultaneously increase the prediction performance. The only disadvantage of B… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2007.03293

  5. arXiv:2306.06144  [pdf, other

    eess.SP cs.LG stat.AP

    Bayesian Calibration of MEMS Accelerometers

    Authors: Oliver Dürr, Po-Yu Fan, Zong-Xian Yin

    Abstract: This study aims to investigate the utilization of Bayesian techniques for the calibration of micro-electro-mechanical systems (MEMS) accelerometers. These devices have garnered substantial interest in various practical applications and typically require calibration through error-correcting functions. The parameters of these error-correcting functions are determined during a calibration process. Ho… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted in IEEE Sensors

  6. arXiv:2211.13665  [pdf, other

    stat.CO

    Estimating Conditional Distributions with Neural Networks using R package deeptrafo

    Authors: Lucas Kook, Philipp FM Baumann, Oliver Dürr, Beate Sick, David Rügamer

    Abstract: Contemporary empirical applications frequently require flexible regression models for complex response types and large tabular or non-tabular, including image or text, data. Classical regression models either break down under the computational load of processing such data or require additional manual feature extraction to make these problems tractable. Here, we present deeptrafo, a package for fit… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at the Journal of Statistical Software

  7. Deep transformation models for functional outcome prediction after acute ischemic stroke

    Authors: Lisa Herzog, Lucas Kook, Andrea Götschi, Katrin Petermann, Martin Hänsel, Janne Hamann, Oliver Dürr, Susanne Wegener, Beate Sick

    Abstract: In many medical applications, interpretable models with high prediction performance are sought. Often, those models are required to handle semi-structured data like tabular and image data. We show how to apply deep transformation models (DTMs) for distributional regression which fulfill these requirements. DTMs allow the data analyst to specify (deep) neural networks for different input modalities… ▽ More

    Submitted 13 September, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Preprint under review

  8. arXiv:2204.13939  [pdf, other

    cs.LG stat.AP stat.ME stat.ML

    Short-Term Density Forecasting of Low-Voltage Load using Bernstein-Polynomial Normalizing Flows

    Authors: Marcel Arpogaus, Marcus Voss, Beate Sick, Mark Nigge-Uricher, Oliver Dürr

    Abstract: The transition to a fully renewable energy grid requires better forecasting of demand at the low-voltage level to increase efficiency and ensure reliable control. However, high fluctuations and increasing electrification cause huge forecast variability, not reflected in traditional point estimates. Probabilistic load forecasts take future uncertainties into account and thus allow more informed dec… ▽ More

    Submitted 15 June, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

  9. arXiv:2202.05650  [pdf, other

    stat.ML cs.LG

    Bernstein Flows for Flexible Posteriors in Variational Bayes

    Authors: Oliver Dürr, Stephan Hörling, Daniel Dold, Ivonne Kovylov, Beate Sick

    Abstract: Variational inference (VI) is a technique to approximate difficult to compute posteriors by optimization. In contrast to MCMC, VI scales to many observations. In the case of complex posteriors, however, state-of-the-art VI approaches often yield unsatisfactory posterior approximations. This paper presents Bernstein flow variational inference (BF-VI), a robust and easy-to-use method, flexible enoug… ▽ More

    Submitted 23 February, 2024; v1 submitted 11 February, 2022; originally announced February 2022.

  10. arXiv:2106.00528  [pdf, other

    stat.ML cs.LG

    Transformation Models for Flexible Posteriors in Variational Bayes

    Authors: Sefan Hörtling, Daniel Dold, Oliver Dürr, Beate Sick

    Abstract: The main challenge in Bayesian models is to determine the posterior for the model parameters. Already, in models with only one or few parameters, the analytical posterior can only be determined in special settings. In Bayesian neural networks, variational inference is widely used to approximate difficult-to-compute posteriors by variational distributions. Usually, Gaussians are used as variational… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 5 pages, 4 figures

  11. arXiv:2010.08376  [pdf, other

    stat.ML cs.LG

    Deep and interpretable regression models for ordinal outcomes

    Authors: Lucas Kook, Lisa Herzog, Torsten Hothorn, Oliver Dürr, Beate Sick

    Abstract: Outcomes with a natural order commonly occur in prediction tasks and often the available input data are a mixture of complex data like images and tabular predictors. Deep Learning (DL) models are state-of-the-art for image classification tasks but frequently treat ordinal outcomes as unordered and lack interpretability. In contrast, classical ordinal regression models consider the outcome's order… ▽ More

    Submitted 20 April, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: 41 pages (incl. appendix, figures and literature), 11 figures in main text, 4 figures in appendix

  12. arXiv:2008.06332  [pdf

    eess.IV cs.CV cs.LG q-bio.QM stat.ML

    Integrating uncertainty in deep neural networks for MRI based stroke analysis

    Authors: Lisa Herzog, Elvis Murina, Oliver Dürr, Susanne Wegener, Beate Sick

    Abstract: At present, the majority of the proposed Deep Learning (DL) methods provide point predictions without quantifying the models uncertainty. However, a quantification of the reliability of automated image analysis is essential, in particular in medicine when physicians rely on the results for making critical treatment decisions. In this work, we provide an entire framework to diagnose ischemic stroke… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 21 pages, 13 figures

    Journal ref: Medical Image Analysis (2020): 101790

  13. arXiv:2007.03293  [pdf, other

    cs.LG stat.ML

    Single Shot MC Dropout Approximation

    Authors: Kai Brach, Beate Sick, Oliver Dürr

    Abstract: Deep neural networks (DNNs) are known for their high prediction performance, especially in perceptual tasks such as object recognition or autonomous driving. Still, DNNs are prone to yield unreliable predictions when encountering completely new situations without indicating their uncertainty. Bayesian variants of DNNs (BDNNs), such as MC dropout BDNNs, do provide uncertainty measures. However, BDN… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: Presented at the ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

  14. arXiv:2004.00464  [pdf, other

    stat.ML cs.LG

    Deep transformation models: Tackling complex regression problems with neural network based transformation models

    Authors: Beate Sick, Torsten Hothorn, Oliver Dürr

    Abstract: We present a deep transformation model for probabilistic regression. Deep learning is known for outstandingly accurate predictions on complex data but in regression tasks, it is predominantly used to just predict a single number. This ignores the non-deterministic character of most tasks. Especially if crucial decisions are based on the predictions, like in medical applications, it is essential to… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

  15. arXiv:1807.04001  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Learning Neural Models for End-to-End Clustering

    Authors: Benjamin Bruno Meier, Ismail Elezi, Mohammadreza Amirian, Oliver Durr, Thilo Stadelmann

    Abstract: We propose a novel end-to-end neural network architecture that, once trained, directly outputs a probabilistic clustering of a batch of input examples in one pass. It estimates a distribution over the number of clusters $k$, and for each $1 \leq k \leq k_\mathrm{max}$, a distribution over the individual cluster assignment for each data point. The network is trained in advance in a supervised fashi… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted for publication on ANNPR 2018