Skip to main content

Showing 1–50 of 51 results for author: Rügamer, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.14329  [pdf, ps, other

    stat.ML cs.AI cs.LG stat.CO stat.ME

    Adjustment for Confounding using Pre-Trained Representations

    Authors: Rickmer Schulte, David Rügamer, Thomas Nagler

    Abstract: There is growing interest in extending average treatment effect (ATE) estimation to incorporate non-tabular data, such as images and text, which may act as sources of confounding. Neglecting these effects risks biased results and flawed scientific conclusions. However, incorporating non-tabular data necessitates sophisticated feature extractors, often in combination with ideas of transfer learning… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: Accepted at ICML 2025

  2. arXiv:2506.05088  [pdf, other

    cs.LG stat.CO

    Semi-Implicit Variational Inference via Kernelized Path Gradient Descent

    Authors: Tobias Pielok, Bernd Bischl, David Rügamer

    Abstract: Semi-implicit variational inference (SIVI) is a powerful framework for approximating complex posterior distributions, but training with the Kullback-Leibler (KL) divergence can be challenging due to high variance and bias in high-dimensional settings. While current state-of-the-art semi-implicit variational inference methods, particularly Kernel Semi-Implicit Variational Inference (KSIVI), have be… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Preliminary version

    MSC Class: 62F15; 68T07 ACM Class: I.2.6; G.3

  3. arXiv:2506.03839  [pdf, other

    cs.LG stat.ML

    Revisiting Unbiased Implicit Variational Inference

    Authors: Tobias Pielok, Bernd Bischl, David Rügamer

    Abstract: Recent years have witnessed growing interest in semi-implicit variational inference (SIVI) methods due to their ability to rapidly generate samples from complex distributions. However, since the likelihood of these samples is non-trivial to estimate in high dimensions, current research focuses on finding effective SIVI training routines. Although unbiased implicit variational inference (UIVI) has… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted to ICML 2025

    MSC Class: 62F15; 68T07 ACM Class: I.2.6; G.3

  4. arXiv:2505.14164  [pdf, ps, other

    stat.ML cs.LG stat.AP stat.ME

    Hybrid Bernstein Normalizing Flows for Flexible Multivariate Density Regression with Interpretable Marginals

    Authors: Marcel Arpogaus, Thomas Kneib, Thomas Nagler, David Rügamer

    Abstract: Density regression models allow a comprehensive understanding of data by modeling the complete conditional probability distribution. While flexible estimation approaches such as normalizing flows (NF) work particularly well in multiple dimensions, interpreting the input-output relationship of such models is often difficult, due to the black-box character of deep learning models. In contrast, exist… ▽ More

    Submitted 12 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  5. arXiv:2505.11325  [pdf, other

    stat.ME cs.AI cs.LG stat.CO stat.ML

    Uncertainty Quantification for Prior-Data Fitted Networks using Martingale Posteriors

    Authors: Thomas Nagler, David Rügamer

    Abstract: Prior-data fitted networks (PFNs) have emerged as promising foundation models for prediction from tabular data sets, achieving state-of-the-art performance on small to moderate data sizes without tuning. While PFNs are motivated by Bayesian ideas, they do not provide any uncertainty quantification for predictive means, quantiles, or similar quantities. We propose a principled and efficient samplin… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  6. arXiv:2503.09244  [pdf, other

    cs.CV q-bio.QM stat.AP

    How To Make Your Cell Tracker Say "I dunno!"

    Authors: Richard D. Paul, Johannes Seiffarth, David Rügamer, Hanno Scharr, Katharina Nöh

    Abstract: Cell tracking is a key computational task in live-cell microscopy, but fully automated analysis of high-throughput imaging requires reliable and, thus, uncertainty-aware data analysis tools, as the amount of data recorded within a single experiment exceeds what humans are able to overlook. We here propose and benchmark various methods to reason about and quantify uncertainty in linear assignment-b… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  7. arXiv:2503.05538  [pdf, other

    cs.LG stat.ML

    Additive Model Boosting: New Insights and Path(ologie)s

    Authors: Rickmer Schulte, David Rügamer

    Abstract: Additive models (AMs) have sparked a lot of interest in machine learning recently, allowing the incorporation of interpretable structures into a wide range of model classes. Many commonly used approaches to fit a wide variety of potentially complex additive models build on the idea of boosting additive models. While boosted additive models (BAMs) work well in practice, certain theoretical aspects… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  8. arXiv:2503.03382  [pdf, other

    cs.LG stat.ML

    Paths and Ambient Spaces in Neural Loss Landscapes

    Authors: Daniel Dold, Julius Kobialka, Nicolai Palm, Emanuel Sommer, David Rügamer, Oliver Dürr

    Abstract: Understanding the structure of neural network loss surfaces, particularly the emergence of low-loss tunnels, is critical for advancing neural network theory and practice. In this paper, we propose a novel approach to directly embed loss tunnels into the loss landscape of neural networks. Exploring the properties of these loss tunnels offers new insights into their length and structure and sheds li… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: 9 pages, Accepted at AISTATS 2025

  9. arXiv:2502.02496  [pdf, other

    cs.LG stat.ML

    Deep Weight Factorization: Sparse Learning Through the Lens of Artificial Symmetries

    Authors: Chris Kolb, Tobias Weber, Bernd Bischl, David Rügamer

    Abstract: Sparse regularization techniques are well-established in machine learning, yet their application in neural networks remains challenging due to the non-differentiability of penalties like the $L_1$ norm, which is incompatible with stochastic gradient descent. A promising alternative is shallow weight factorization, where weights are decomposed into two factors, allowing for smooth optimization of… ▽ More

    Submitted 7 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: accepted at ICLR 2025

  10. arXiv:2410.05430  [pdf, other

    cs.LG stat.AP stat.CO stat.ML

    A Functional Extension of Semi-Structured Networks

    Authors: David Rügamer, Bernard X. W. Liew, Zainab Altai, Almond Stöcker

    Abstract: Semi-structured networks (SSNs) merge the structures familiar from additive models with deep neural networks, allowing the modeling of interpretable partial feature effects while capturing higher-order non-linearities at the same time. A significant challenge in this integration is maintaining the interpretability of the additive model component. Inspired by large-scale biomechanics datasets, this… ▽ More

    Submitted 13 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted at NeurIPS 2024

  11. arXiv:2407.18650  [pdf, other

    stat.ML cs.LG

    Achieving interpretable machine learning by functional decomposition of black-box models into explainable predictor effects

    Authors: David Köhler, David Rügamer, Matthias Schmid

    Abstract: Machine learning (ML) has seen significant growth in both popularity and importance. The high prediction accuracy of ML models is often achieved through complex black-box architectures that are difficult to interpret. This interpretability problem has been hindering the use of ML in fields like medicine, ecology and insurance, where an understanding of the inner workings of the model is paramount… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  12. arXiv:2405.05429  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    How Inverse Conditional Flows Can Serve as a Substitute for Distributional Regression

    Authors: Lucas Kook, Chris Kolb, Philipp Schiele, Daniel Dold, Marcel Arpogaus, Cornelius Fritz, Philipp F. Baumann, Philipp Kopper, Tobias Pielok, Emilio Dorigatti, David Rügamer

    Abstract: Neural network representations of simple models, such as linear regression, are being studied increasingly to better understand the underlying principles of deep learning algorithms. However, neural representations of distributional regression models, such as the Cox model, have received little attention so far. We close this gap by proposing a framework for distributional regression using inverse… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted at UAI 2024 https://www.auai.org/uai2024/accepted_papers

  13. arXiv:2405.02475  [pdf, other

    cs.LG cs.AI stat.CO stat.ME

    Generalizing Orthogonalization for Models with Non-Linearities

    Authors: David Rügamer, Chris Kolb, Tobias Weber, Lucas Kook, Thomas Nagler

    Abstract: The complexity of black-box algorithms can lead to various challenges, including the introduction of biases. These biases present immediate risks in the algorithms' application. It was, for instance, shown that neural networks can deduce racial information solely from a patient's X-ray scan, a task beyond the capability of medical experts. If this fact is not known to the medical expert, automatic… ▽ More

    Submitted 2 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  14. arXiv:2405.02200  [pdf, other

    cs.LG stat.ML

    Position: Why We Must Rethink Empirical Research in Machine Learning

    Authors: Moritz Herrmann, F. Julian D. Lange, Katharina Eggensperger, Giuseppe Casalicchio, Marcel Wever, Matthias Feurer, David Rügamer, Eyke Hüllermeier, Anne-Laure Boulesteix, Bernd Bischl

    Abstract: We warn against a common but incomplete understanding of empirical research in machine learning that leads to non-replicable results, makes findings unreliable, and threatens to undermine progress in the field. To overcome this alarming situation, we call for more awareness of the plurality of ways of gaining knowledge experimentally but also of some epistemic limitations. In particular, we argue… ▽ More

    Submitted 25 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: 20 pages, accepted for publication at ICML 2024, camera-ready version

  15. arXiv:2403.13150  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    On Training Survival Models with Scoring Rules

    Authors: Philipp Kopper, David Rügamer, Raphael Sonabend, Bernd Bischl, Andreas Bender

    Abstract: Scoring rules are an established way of comparing predictive performances across model classes. In the context of survival analysis, they require adaptation in order to accommodate censoring. This work investigates using scoring rules for model training rather than evaluation. Doing so, we establish a general framework for training survival models that is model agnostic and can learn event time di… ▽ More

    Submitted 13 November, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  16. arXiv:2403.10923  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    Interpretable Machine Learning for TabPFN

    Authors: David Rundel, Julius Kobialka, Constantin von Crailsheim, Matthias Feurer, Thomas Nagler, David Rügamer

    Abstract: The recently developed Prior-Data Fitted Networks (PFNs) have shown very promising results for applications in low-data regimes. The TabPFN model, a special case of PFNs for tabular data, is able to achieve state-of-the-art performance on a variety of classification tasks while producing posterior predictive distributions in mere seconds by in-context learning without the need for learning paramet… ▽ More

    Submitted 23 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in Explainable Artificial Intelligence, and is available online at https://doi.org/10.1007/978-3-031-63797-1_23

  17. arXiv:2402.01484  [pdf, other

    cs.LG stat.CO stat.ML

    Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks?

    Authors: Emanuel Sommer, Lisa Wimmer, Theodore Papamarkou, Ludwig Bothmann, Bernd Bischl, David Rügamer

    Abstract: A major challenge in sample-based inference (SBI) for Bayesian neural networks is the size and structure of the networks' parameter space. Our work shows that successful SBI is possible by embracing the characteristic relationship between weight and function space, uncovering a systematic link between overparameterization and the difficulty of the sampling problem. Through extensive experiments, w… ▽ More

    Submitted 27 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  18. arXiv:2402.01090  [pdf, other

    stat.ML cs.LG stat.CO

    Scalable Higher-Order Tensor Product Spline Models

    Authors: David Rügamer

    Abstract: In the current era of vast data and transparent machine learning, it is essential for techniques to operate at a large scale while providing a clear mathematical comprehension of the internal workings of the method. Although there already exist interpretable semi-parametric regression methods for large-scale applications that take into account non-linearity in the data, the complexity of the model… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024. arXiv admin note: substantial text overlap with arXiv:2205.14515

  19. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 6 August, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  20. arXiv:2401.12950  [pdf, other

    cs.LG stat.ML

    Bayesian Semi-structured Subspace Inference

    Authors: Daniel Dold, David Rügamer, Beate Sick, Oliver Dürr

    Abstract: Semi-structured regression models enable the joint modeling of interpretable structured and complex unstructured feature effects. The structured model part is inspired by statistical models and can be used to infer the input-output relationship for features of particular importance. The complex unstructured part defines an arbitrary deep neural network and thereby provides enough flexibility to ac… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted at AISTATS 2024

  21. arXiv:2312.05523  [pdf, other

    stat.ME stat.AP

    Functional Data Analysis: An Introduction and Recent Developments

    Authors: Jan Gertheiss, David Rügamer, Bernard X. W. Liew, Sonja Greven

    Abstract: Functional data analysis (FDA) is a statistical framework that allows for the analysis of curves, images, or functions on higher dimensional domains. The goals of FDA, such as descriptive analyses, classification, and regression, are generally the same as for statistical analyses of scalar-valued or multivariate data, but FDA brings additional challenges due to the high- and infinite dimensionalit… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  22. arXiv:2311.01349  [pdf, other

    cs.LG cs.CY stat.ML

    Post-hoc Orthogonalization for Mitigation of Protected Feature Bias in CXR Embeddings

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: Purpose: To analyze and remove protected feature effects in chest radiograph embeddings of deep learning models. Methods: An orthogonalization is utilized to remove the influence of protected features (e.g., age, sex, race) in CXR embeddings, ensuring feature-independent results. To validate the efficacy of the approach, we retrospectively study the MIMIC and CheXpert datasets using three pre-trai… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  23. arXiv:2307.03571  [pdf, other

    cs.LG math.OC stat.ML

    Smoothing the Edges: Smooth Optimization for Sparse Regularization using Hadamard Overparametrization

    Authors: Chris Kolb, Christian L. Müller, Bernd Bischl, David Rügamer

    Abstract: We present a framework for smooth optimization of explicitly regularized objectives for (structured) sparsity. These non-smooth and possibly non-convex problems typically rely on solvers tailored to specific models and regularizers. In contrast, our method enables fully differentiable and approximation-free optimization and is thus compatible with the ubiquitous gradient descent paradigm in deep l… ▽ More

    Submitted 26 April, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  24. arXiv:2306.00522  [pdf, other

    cs.LG stat.ML

    A New PHO-rmula for Improved Performance of Semi-Structured Networks

    Authors: David Rügamer

    Abstract: Recent advances to combine structured regression models and deep neural networks for better interpretability, more expressiveness, and statistically valid uncertainty quantification demonstrate the versatility of semi-structured neural networks (SSNs). We show that techniques to properly identify the contributions of the different model components in SSNs, however, lead to suboptimal network estim… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  25. arXiv:2304.02902  [pdf, other

    stat.ML cs.LG

    Towards Efficient MCMC Sampling in Bayesian Neural Networks by Exploiting Symmetry

    Authors: Jonas Gregor Wiese, Lisa Wimmer, Theodore Papamarkou, Bernd Bischl, Stephan Günnemann, David Rügamer

    Abstract: Bayesian inference in deep neural networks is challenging due to the high-dimensional, strongly multi-modal parameter posterior density landscape. Markov chain Monte Carlo approaches asymptotically recover the true posterior but are considered prohibitively expensive for large modern architectures. Local methods, which have emerged as a popular alternative, focus on specific parameter regions that… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  26. arXiv:2302.02043  [pdf, other

    stat.CO

    mixdistreg: An R Package for Fitting Mixture of Experts Distributional Regression with Adaptive First-order Methods

    Authors: David Rügamer

    Abstract: This paper presents a high-level description of the R software package mixdistreg to fit mixture of experts distributional regression models. The proposed framework is implemented in R using the deepregression software template, which is based on TensorFlow and follows the neural structured additive learning principle. The software comprises various approaches as special cases, including mixture d… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  27. arXiv:2211.13665  [pdf, other

    stat.CO

    Estimating Conditional Distributions with Neural Networks using R package deeptrafo

    Authors: Lucas Kook, Philipp FM Baumann, Oliver Dürr, Beate Sick, David Rügamer

    Abstract: Contemporary empirical applications frequently require flexible regression models for complex response types and large tabular or non-tabular, including image or text, data. Classical regression models either break down under the computational load of processing such data or require additional manual feature extraction to make these problems tractable. Here, we present deeptrafo, a package for fit… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at the Journal of Statistical Software

  28. arXiv:2211.09875  [pdf, other

    stat.CO

    Mixture of Experts Distributional Regression: Implementation Using Robust Estimation with Adaptive First-order Methods

    Authors: David Rügamer, Florian Pfisterer, Bernd Bischl, Bettina Grün

    Abstract: In this work, we propose an efficient implementation of mixtures of experts distributional regression models which exploits robust estimation by using stochastic first-order optimization techniques with adaptive learning rate schedulers. We take advantage of the flexibility and scalability of neural network software and implement the proposed framework in mixdistreg, an R software package that all… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2010.06889

  29. arXiv:2211.02730  [pdf, other

    stat.ML cs.LG

    Uncertainty-aware predictive modeling for fair data-driven decisions

    Authors: Patrick Kaiser, Christoph Kern, David Rügamer

    Abstract: Both industry and academia have made considerable progress in developing trustworthy and responsible machine learning (ML) systems. While critical concepts like fairness and explainability are often addressed, the safety of systems is typically not sufficiently taken into account. By viewing data-driven decision systems as socio-technical systems, we draw on the uncertainty in ML literature to sho… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  30. arXiv:2210.07723  [pdf, other

    stat.ML cs.CR cs.LG

    Privacy-Preserving and Lossless Distributed Estimation of High-Dimensional Generalized Additive Mixed Models

    Authors: Daniel Schalk, Bernd Bischl, David Rügamer

    Abstract: Various privacy-preserving frameworks that respect the individual's privacy in the analysis of data have been developed in recent years. However, available model classes such as simple statistics or generalized linear models lack the flexibility required for a good approximation of the underlying data-generating process in practice. In this paper, we propose an algorithm for a distributed, privacy… ▽ More

    Submitted 10 March, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  31. arXiv:2208.14919  [pdf, other

    cs.LG cs.NE stat.ML

    ARMA Cell: A Modular and Effective Approach for Neural Autoregressive Modeling

    Authors: Philipp Schiele, Christoph Berninger, David Rügamer

    Abstract: The autoregressive moving average (ARMA) model is a classical, and arguably one of the most studied approaches to model time series data. It has compelling theoretical properties and is widely used among practitioners. More recent deep learning approaches popularize recurrent neural networks (RNNs) and, in particular, Long Short-Term Memory (LSTM) cells that have become one of the best performing… ▽ More

    Submitted 11 January, 2024; v1 submitted 31 August, 2022; originally announced August 2022.

    ACM Class: G.3

  32. arXiv:2205.14515  [pdf, other

    stat.CO cs.LG stat.ML

    Additive Higher-Order Factorization Machines

    Authors: David Rügamer

    Abstract: In the age of big data and interpretable machine learning, approaches need to work at scale and at the same time allow for a clear mathematical understanding of the method's inner workings. While there exist inherently interpretable semi-parametric regression techniques for large-scale applications to account for non-linearity in the data, their model complexity is still often restricted. One of t… ▽ More

    Submitted 1 February, 2023; v1 submitted 28 May, 2022; originally announced May 2022.

  33. arXiv:2205.13080  [pdf, other

    stat.ML cs.LG stat.CO

    Factorized Structured Regression for Large-Scale Varying Coefficient Models

    Authors: David Rügamer, Andreas Bender, Simon Wiegrebe, Daniel Racek, Bernd Bischl, Christian L. Müller, Clemens Stachl

    Abstract: Recommender Systems (RS) pervade many aspects of our everyday digital life. Proposed to work at scale, state-of-the-art RS allow the modeling of thousands of interactions and facilitate highly individualized recommendations. Conceptually, many RS can be viewed as instances of statistical regression models that incorporate complex feature effects and potentially non-Gaussian outcomes. Such structur… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  34. arXiv:2202.07423  [pdf, other

    stat.ML cs.LG

    DeepPAMM: Deep Piecewise Exponential Additive Mixed Models for Complex Hazard Structures in Survival Analysis

    Authors: Philipp Kopper, Simon Wiegrebe, Bernd Bischl, Andreas Bender, David Rügamer

    Abstract: Survival analysis (SA) is an active field of research that is concerned with time-to-event outcomes and is prevalent in many domains, particularly biomedical applications. Despite its importance, SA remains challenging due to small-scale data sets and complex outcome distributions, concealed by truncation and censoring processes. The piecewise exponential additive mixed model (PAMM) is a model cla… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: 13 pages, 2 figures, This work has been accepted by the 26th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD2022)

  35. arXiv:2111.05303  [pdf, other

    cs.LG physics.ao-ph stat.ML

    Identifying the atmospheric drivers of drought and heat using a smoothed deep learning approach

    Authors: Magdalena Mittermeier, Maximilian Weigert, David Rügamer

    Abstract: Europe was hit by several, disastrous heat and drought events in recent summers. Besides thermodynamic influences, such hot and dry extremes are driven by certain atmospheric situations including anticyclonic conditions. Effects of climate change on atmospheric circulations are complex and many open research questions remain in this context, e.g., on future trends of anticyclonic conditions. Based… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021: Tackling Climate Change with Machine Learning

  36. arXiv:2110.03513  [pdf, other

    stat.CO cs.LG

    Accelerated Componentwise Gradient Boosting using Efficient Data Representation and Momentum-based Optimization

    Authors: Daniel Schalk, Bernd Bischl, David Rügamer

    Abstract: Componentwise boosting (CWB), also known as model-based boosting, is a variant of gradient boosting that builds on additive models as base learners to ensure interpretability. CWB is thus often used in research areas where models are employed as tools to explain relationships in data. One downside of CWB is its computational complexity in terms of memory and runtime. In this paper, we propose two… ▽ More

    Submitted 29 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

  37. arXiv:2109.05583  [pdf, ps, other

    stat.ML cs.LG

    Automatic Componentwise Boosting: An Interpretable AutoML System

    Authors: Stefan Coors, Daniel Schalk, Bernd Bischl, David Rügamer

    Abstract: In practice, machine learning (ML) workflows require various different steps, from data preprocessing, missing value imputation, model selection, to model tuning as well as model evaluation. Many of these steps rely on human ML experts. AutoML - the field of automating these ML pipelines - tries to help practitioners to apply ML off-the-shelf without any expert knowledge. Most modern AutoML system… ▽ More

    Submitted 16 October, 2021; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: 6 pages, 4 figures, ECML-PKDD Workshop on Automating Data Science 2021

  38. arXiv:2104.02705  [pdf, other

    stat.ML cs.LG stat.CO

    deepregression: a Flexible Neural Network Framework for Semi-Structured Deep Distributional Regression

    Authors: David Rügamer, Chris Kolb, Cornelius Fritz, Florian Pfisterer, Philipp Kopper, Bernd Bischl, Ruolin Shen, Christina Bukas, Lisa Barros de Andrade e Sousa, Dominik Thalmeier, Philipp Baumann, Lucas Kook, Nadja Klein, Christian L. Müller

    Abstract: In this paper we describe the implementation of semi-structured deep distributional regression, a flexible framework to learn conditional distributions based on the combination of additive regression models and deep networks. Our implementation encompasses (1) a modular neural network building system based on the deep learning library \pkg{TensorFlow} for the fusion of various statistical and deep… ▽ More

    Submitted 10 March, 2022; v1 submitted 6 April, 2021; originally announced April 2021.

  39. arXiv:2101.00661  [pdf, other

    cs.LG stat.AP

    Combining Graph Neural Networks and Spatio-temporal Disease Models to Predict COVID-19 Cases in Germany

    Authors: Cornelius Fritz, Emilio Dorigatti, David Rügamer

    Abstract: During 2020, the infection rate of COVID-19 has been investigated by many scholars from different research fields. In this context, reliable and interpretable forecasts of disease incidents are a vital tool for policymakers to manage healthcare resources. Several experts have called for the necessity to account for human mobility to explain the spread of COVID-19. Existing approaches are often app… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

  40. arXiv:2011.05824  [pdf, other

    cs.LG cs.AI stat.ML

    Semi-Structured Deep Piecewise Exponential Models

    Authors: Philipp Kopper, Sebastian Pölsterl, Christian Wachinger, Bernd Bischl, Andreas Bender, David Rügamer

    Abstract: We propose a versatile framework for survival analysis that combines advanced concepts from statistics with deep learning. The presented framework is based on piecewise exponential models and thereby supports various survival tasks, such as competing risks and multi-state modeling, and further allows for estimation of time-varying effects and time-varying features. To also include multiple data so… ▽ More

    Submitted 1 March, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: 8 pages, 3 figures, Accepted at the AAAI spring symposium: Survival Prediction

  41. Deep Conditional Transformation Models

    Authors: Philipp F. M. Baumann, Torsten Hothorn, David Rügamer

    Abstract: Learning the cumulative distribution function (CDF) of an outcome variable conditional on a set of features remains challenging, especially in high-dimensional settings. Conditional transformation models provide a semi-parametric approach that allows to model a large class of conditional CDFs without an explicit parametric distribution assumption and with only a few parameters. Existing estimation… ▽ More

    Submitted 6 April, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

    Journal ref: Machine Learning and Knowledge Discovery in Databases. Research Track. ECML PKDD 2021

  42. arXiv:2010.06889  [pdf, other

    stat.CO cs.LG stat.ML

    Neural Mixture Distributional Regression

    Authors: David Rügamer, Florian Pfisterer, Bernd Bischl

    Abstract: We present neural mixture distributional regression (NMDR), a holistic framework to estimate complex finite mixtures of distributional regressions defined by flexible additive predictors. Our framework is able to handle a large number of mixtures of potentially different distributions in high-dimensional settings, allows for efficient and scalable optimization and can be applied to recent concepts… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  43. arXiv:2007.07930  [pdf, other

    stat.ME stat.CO

    Selective Inference for Additive and Linear Mixed Models

    Authors: David Rügamer, Philipp F. M. Baumann, Sonja Greven

    Abstract: This work addresses the problem of conducting valid inference for additive and linear mixed models after model selection. One possible solution to overcome overconfident inference results after model selection is selective inference, which constitutes a post-selection inference framework, yielding valid inference statements by conditioning on the selection event. We extend recent work on selective… ▽ More

    Submitted 20 December, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

  44. arXiv:2006.15442  [pdf, other

    stat.ML cs.LG stat.CO

    A General Machine Learning Framework for Survival Analysis

    Authors: Andreas Bender, David Rügamer, Fabian Scheipl, Bernd Bischl

    Abstract: The modeling of time-to-event data, also known as survival analysis, requires specialized methods that can deal with censoring and truncation, time-varying features and effects, and that extend to settings with multiple competing events. However, many machine learning methods for survival analysis only consider the standard setting with right-censored data and proportional hazards assumption. The… ▽ More

    Submitted 17 April, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

  45. arXiv:2006.05750  [pdf, other

    stat.ME q-fin.RM stat.AP

    A Bayesian Time-Varying Autoregressive Model for Improved Short- and Long-Term Prediction

    Authors: Christoph Berninger, Almond Stöcker, David Rügamer

    Abstract: Motivated by the application to German interest rates, we propose a timevarying autoregressive model for short and long term prediction of time series that exhibit a temporary non-stationary behavior but are assumed to mean revert in the long run. We use a Bayesian formulation to incorporate prior assumptions on the mean reverting process in the model and thereby regularize predictions in the far… ▽ More

    Submitted 21 February, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Revised Introduction, results unchanged

    MSC Class: 47N30; 62M10; 62P20 ACM Class: G.3

  46. arXiv:2002.05777  [pdf, other

    stat.ML cs.LG stat.ME

    Semi-Structured Distributional Regression -- Extending Structured Additive Models by Arbitrary Deep Neural Networks and Data Modalities

    Authors: David Rügamer, Chris Kolb, Nadja Klein

    Abstract: Combining additive models and neural networks allows to broaden the scope of statistical regression and extend deep learning-based approaches by interpretable structured additive predictors at the same time. Existing attempts uniting the two modeling approaches are, however, limited to very specific combinations and, more importantly, involve an identifiability issue. As a consequence, interpretab… ▽ More

    Submitted 9 July, 2022; v1 submitted 13 February, 2020; originally announced February 2020.

  47. arXiv:1805.01852  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Inference for $L_2$-Boosting

    Authors: David Rügamer, Sonja Greven

    Abstract: We propose a statistical inference framework for the component-wise functional gradient descent algorithm (CFGD) under normality assumption for model errors, also known as $L_2$-Boosting. The CFGD is one of the most versatile tools to analyze data, because it scales well to high-dimensional data sets, allows for a very flexible definition of additive regression models and incorporates inbuilt vari… ▽ More

    Submitted 4 June, 2019; v1 submitted 4 May, 2018; originally announced May 2018.

  48. arXiv:1803.05664  [pdf, other

    stat.CO stat.AP

    Conditional Model Selection in Mixed-Effects Models with cAIC4

    Authors: Benjamin Säfken, David Rügamer, Thomas Kneib, Sonja Greven

    Abstract: Model selection in mixed models based on the conditional distribution is appropriate for many practical applications and has been a focus of recent statistical research. In this paper we introduce the R-package cAIC4 that allows for the computation of the conditional Akaike Information Criterion (cAIC). Computation of the conditional AIC needs to take into account the uncertainty of the random eff… ▽ More

    Submitted 17 March, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

  49. arXiv:1706.09796  [pdf, other

    stat.ME stat.AP

    Selective inference after likelihood- or test-based model selection in linear models

    Authors: David Rügamer, Sonja Greven

    Abstract: Statistical inference after model selection requires an inference framework that takes the selection into account in order to be valid. Following recent work on selective inference, we derive analytical expressions for inference after likelihood- or test-based model selection for linear models.

    Submitted 23 September, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

  50. arXiv:1705.10662  [pdf, other

    stat.CO

    Boosting Functional Regression Models with FDboost

    Authors: Sarah Brockhaus, David Rügamer, Sonja Greven

    Abstract: The R add-on package FDboost is a flexible toolbox for the estimation of functional regression models by model-based boosting. It provides the possibility to fit regression models for scalar and functional response with effects of scalar as well as functional covariates, i.e., scalar-on-function, function-on-scalar and function-on-function regression models. In addition to mean regression, quantil… ▽ More

    Submitted 26 April, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

    Comments: Revised version