Skip to main content

Showing 1–28 of 28 results for author: Mayr, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.15041  [pdf, other

    stat.ME

    Boosting Distributional Copula Regression for Bivariate Right-Censored Time-to-Event Data

    Authors: Guillermo Briseno-Sanchez, Nadja Klein, Andreas Groll, Andreas Mayr

    Abstract: We propose a highly flexible distributional copula regression model for bivariate time-to-event data in the presence of right-censoring. The joint survival function of the response is constructed using parametric copulas, allowing for a separate specification of the dependence structure between the time-to-event outcome variables and their respective marginal survival distributions. The latter are… ▽ More

    Submitted 20 December, 2024; v1 submitted 19 December, 2024; originally announced December 2024.

  2. arXiv:2406.03900  [pdf, other

    stat.ME stat.AP

    Enhanced variable selection for boosting sparser and less complex models in distributional copula regression

    Authors: Annika Strömer, Nadja Klein, Christian Staerk, Florian Faschingbauer, Hannah Klinkhammer, Andreas Mayr

    Abstract: Structured additive distributional copula regression allows to model the joint distribution of multivariate outcomes by relating all distribution parameters to covariates. Estimation via statistical boosting enables accounting for high-dimensional data and incorporating data-driven variable selection, both of which are useful given the complexity of the model class. However, as known from univaria… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2404.08331  [pdf, other

    stat.ME

    A Balanced Statistical Boosting Approach for GAMLSS via New Step Lengths

    Authors: Alexandra Daub, Andreas Mayr, Boyao Zhang, Elisabeth Bergherr

    Abstract: Component-wise gradient boosting algorithms are popular for their intrinsic variable selection and implicit regularization, which can be especially beneficial for very flexible model classes. When estimating generalized additive models for location, scale and shape (GAMLSS) by means of a component-wise gradient boosting algorithm, an important part of the estimation procedure is to determine the r… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 34 pages, 26 figures

  4. arXiv:2403.04747  [pdf, other

    cs.LG cs.AI stat.ML

    GNN-VPA: A Variance-Preserving Aggregation Strategy for Graph Neural Networks

    Authors: Lisa Schneckenreiter, Richard Freinschlag, Florian Sestak, Johannes Brandstetter, Günter Klambauer, Andreas Mayr

    Abstract: Graph neural networks (GNNs), and especially message-passing neural networks, excel in various domains such as physics, drug discovery, and molecular modeling. The expressivity of GNNs with respect to their ability to discriminate non-isomorphic graphs critically depends on the functions employed for message aggregation and graph-level readout. By applying signal propagation theory, we propose a v… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted at ICLR 2024 (Tiny Papers Track)

  5. arXiv:2403.02194  [pdf, other

    stat.ME

    Boosting Distributional Copula Regression for Bivariate Binary, Discrete and Mixed Responses

    Authors: Guillermo Briseño Sanchez, Nadja Klein, Hannah Klinkhammer, Andreas Mayr

    Abstract: Motivated by challenges in the analysis of biomedical data and observational studies, we develop statistical boosting for the general class of bivariate distributional copula regression with arbitrary marginal distributions, which is suited to model binary, count, continuous or mixed outcomes. In our framework, the joint distribution of arbitrary, bivariate responses is modelled through a parametr… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2207.08470  [pdf, other

    stat.ME

    Boosting Multivariate Structured Additive Distributional Regression Models

    Authors: Annika Strömer, Nadja Klein, Christian Staerk, Hannah Klinkhammer, Andreas Mayr

    Abstract: We develop a model-based boosting approach for multivariate distributional regression within the framework of generalized additive models for location, scale, and shape. Our approach enables the simultaneous modeling of all distribution parameters of an arbitrary parametric distribution of a multivariate response conditional on explanatory variables, while being applicable to potentially high-dime… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  7. arXiv:2202.12851  [pdf, other

    stat.ME

    Boosting Distributional Copula Regression

    Authors: Nicolai Hans, Nadja Klein, Florian Faschingbauer, Michael Schneider, Andreas Mayr

    Abstract: Capturing complex dependence structures between outcome variables (e.g., study endpoints) is of high relevance in contemporary biomedical data problems and medical research. Distributional copula regression provides a flexible tool to model the joint distribution of multiple outcome variables by disentangling the marginal response distributions and their dependence structure. In a regression setup… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  8. arXiv:2202.01657  [pdf, other

    stat.ME stat.AP

    Deselection of Base-Learners for Statistical Boosting -- with an Application to Distributional Regression

    Authors: Annika Strömer, Christian Staerk, Nadja Klein, Leonie Weinhold, Stephanie Titze, Andreas Mayr

    Abstract: We present a new procedure for enhanced variable selection for component-wise gradient boosting. Statistical boosting is a computational approach that emerged from machine learning, which allows to fit regression models in the presence of high-dimensional data. Furthermore, the algorithm can lead to data-driven variable selection. In practice, however, the final models typically tend to include to… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  9. arXiv:2109.02599  [pdf, other

    q-bio.PE stat.AP

    Estimating the course of the COVID-19 pandemic in Germany via spline-based hierarchical modelling of death counts

    Authors: Tobias Wistuba, Andreas Mayr, Christian Staerk

    Abstract: The effective reproduction number is a key figure to monitor the course of the COVID-19 pandemic. In this study we consider a retrospective modelling approach for estimating the effective reproduction number based on death counts during the first year of the pandemic in Germany. The proposed Bayesian hierarchical model incorporates splines to estimate reproduction numbers flexibly over time while… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  10. arXiv:2106.11299  [pdf, other

    cs.LG cs.AI stat.ML

    Boundary Graph Neural Networks for 3D Simulations

    Authors: Andreas Mayr, Sebastian Lehner, Arno Mayrhofer, Christoph Kloss, Sepp Hochreiter, Johannes Brandstetter

    Abstract: The abundance of data has given machine learning considerable momentum in natural sciences and engineering, though modeling of physical processes is often difficult. A particularly tough problem is the efficient representation of geometric boundaries. Triangularized geometric boundaries are well understood and ubiquitous in engineering applications. However, it is notoriously difficult to integrat… ▽ More

    Submitted 20 April, 2023; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: accepted for presentation at the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23)

  11. arXiv:2105.01636  [pdf, other

    cs.LG stat.ML

    Learning 3D Granular Flow Simulations

    Authors: Andreas Mayr, Sebastian Lehner, Arno Mayrhofer, Christoph Kloss, Sepp Hochreiter, Johannes Brandstetter

    Abstract: Recently, the application of machine learning models has gained momentum in natural sciences and engineering, which is a natural fit due to the abundance of data in these fields. However, the modeling of physical processes from simulation data without first principle solutions remains difficult. Here, we present a Graph Neural Networks approach towards accurate modeling of complex 3D granular flow… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  12. arXiv:2011.02420  [pdf, other

    q-bio.PE stat.AP

    Estimating effective infection fatality rates during the course of the COVID-19 pandemic in Germany

    Authors: Christian Staerk, Tobias Wistuba, Andreas Mayr

    Abstract: The infection fatality rate (IFR) of the Coronavirus Disease 2019 (COVID-19) is one of the most discussed figures in the context of this pandemic. Using German COVID-19 surveillance data and age-group specific IFR estimates from multiple international studies, this work investigates time-dependent variations in effective IFR over the course of the pandemic. Three different methods for estimating (… ▽ More

    Submitted 21 January, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

  13. arXiv:2004.00979  [pdf, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    Large-scale ligand-based virtual screening for SARS-CoV-2 inhibitors using deep neural networks

    Authors: Markus Hofmarcher, Andreas Mayr, Elisabeth Rumetshofer, Peter Ruch, Philipp Renz, Johannes Schimunek, Philipp Seidl, Andreu Vall, Michael Widrich, Sepp Hochreiter, Günter Klambauer

    Abstract: Due to the current severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic, there is an urgent need for novel therapies and drugs. We conducted a large-scale virtual screening for small molecules that are potential CoV-2 inhibitors. To this end, we utilized "ChemAI", a deep neural network trained on more than 220M data points across 3.6M molecules from three public drug-discovery dat… ▽ More

    Submitted 17 August, 2020; v1 submitted 25 March, 2020; originally announced April 2020.

    Comments: Additional results added. Various corrections to formulations and typos

  14. arXiv:1901.09775  [pdf, other

    stat.AP

    RefCurv: A Software for the Construction of Pediatric Reference Curves

    Authors: Christian Winkler, Katharina Linden, Andreas Mayr, Thomas Schultz, Thomas Welchowski, Johannes Breuer, Ulrike Herberg

    Abstract: In medicine, reference curves serve as an important tool for everyday clinical practice. Pediatricians assess the growth process of children with the help of percentile curves serving as norm references. The mathematical methods for the construction of these reference curves are sophisticated and often require technical knowledge beyond the scope of physicians. An easy-to-use software for life sci… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

    Comments: You can find more information about the software (tutorials, link to the source code, etc.) on https://refcurv.com

  15. arXiv:1810.10239  [pdf, other

    stat.ME

    Extension of the Gradient Boosting Algorithm for Joint Modeling of Longitudinal and Time-to-Event data

    Authors: Colin Griesbach, Andreas Mayr, Elisabeth Waldmann

    Abstract: In various data situations joint models are an efficient tool to analyze relationships between time dependent covariates and event times or to correct for event-dependent dropout occurring in regression analysis. Joint modeling connects a longitudinal and a survival submodel within a single joint likelihood which then can be maximized by standard optimization methods. Main burdens of these convent… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

  16. arXiv:1710.02385  [pdf, ps, other

    stat.ME stat.CO

    Gradient boosting in Markov-switching generalized additive models for location, scale and shape

    Authors: Timo Adam, Andreas Mayr, Thomas Kneib

    Abstract: We propose a novel class of flexible latent-state time series regression models which we call Markov-switching generalized additive models for location, scale and shape. In contrast to conventional Markov-switching regression models, the presented methodology allows us to model different state-dependent parameters of the response distribution - not only the mean, but also variance, skewness and ku… ▽ More

    Submitted 17 May, 2018; v1 submitted 6 October, 2017; originally announced October 2017.

  17. arXiv:1706.02515  [pdf, other

    cs.LG stat.ML

    Self-Normalizing Neural Networks

    Authors: Günter Klambauer, Thomas Unterthiner, Andreas Mayr, Sepp Hochreiter

    Abstract: Deep Learning has revolutionized vision via convolutional neural networks (CNNs) and natural language processing via recurrent neural networks (RNNs). However, success stories of Deep Learning with standard feed-forward neural networks (FNNs) are rare. FNNs that perform well are typically shallow and, therefore cannot exploit many levels of abstract representations. We introduce self-normalizing n… ▽ More

    Submitted 7 September, 2017; v1 submitted 8 June, 2017; originally announced June 2017.

    Comments: 9 pages (+ 93 pages appendix)

    Journal ref: Advances in Neural Information Processing Systems 30 (NIPS 2017)

  18. arXiv:1702.08185  [pdf, ps, other

    stat.AP stat.CO stat.ML

    An update on statistical boosting in biomedicine

    Authors: Andreas Mayr, Benjamin Hofner, Elisabeth Waldmann, Tobias Hepp, Olaf Gefeller, Matthias Schmid

    Abstract: Statistical boosting algorithms have triggered a lot of research during the last decade. They combine a powerful machine-learning approach with classical statistical modelling, offering various practical advantages like automated variable selection and implicit regularization of effect estimates. They are extremely flexible, as the underlying base-learners (regression functions defining the type o… ▽ More

    Submitted 27 February, 2017; originally announced February 2017.

  19. arXiv:1702.04561  [pdf, other

    stat.ML stat.CO

    Probing for sparse and fast variable selection with model-based boosting

    Authors: Janek Thomas, Tobias Hepp, Andreas Mayr, Bernd Bischl

    Abstract: We present a new variable selection method based on model-based gradient boosting and randomly permuted variables. Model-based boosting is a tool to fit a statistical model while performing variable selection at the same time. A drawback of the fitting lies in the need of multiple model fits on slightly altered data (e.g. cross-validation or bootstrap) to find the optimal number of boosting iterat… ▽ More

    Submitted 15 February, 2017; originally announced February 2017.

    Comments: 14 pages, 2 figures

  20. Stability selection for component-wise gradient boosting in multiple dimensions

    Authors: Janek Thomas, Andreas Mayr, Bernd Bischl, Matthias Schmid, Adam Smith, Benjamin Hofner

    Abstract: We present a new algorithm for boosting generalized additive models for location, scale and shape (GAMLSS) that allows to incorporate stability selection, an increasingly popular way to obtain stable sets of covariates while controlling the per-family error rate (PFER). The model is fitted repeatedly to subsampled data and variables with high selection frequencies are extracted. To apply stability… ▽ More

    Submitted 30 November, 2016; originally announced November 2016.

    Comments: 16 pages

  21. arXiv:1609.02686  [pdf, other

    stat.ML stat.ME

    Boosting Joint Models for Longitudinal and Time-to-Event Data

    Authors: Elisabeth Waldmann, David Taylor-Robinson, Nadja Klein, Thomas Kneib, Tania Pressler, Matthias Schmid, Andreas Mayr

    Abstract: Joint Models for longitudinal and time-to-event data have gained a lot of attention in the last few years as they are a helpful technique to approach common a data structure in clinical studies where longitudinal outcomes are recorded alongside event times. Those two processes are often linked and the two outcomes should thus be modeled jointly in order to prevent the potential bias introduced by… ▽ More

    Submitted 22 December, 2016; v1 submitted 9 September, 2016; originally announced September 2016.

  22. arXiv:1605.04281  [pdf, other

    stat.CO

    Signal Regression Models for Location, Scale and Shape with an Application to Stock Returns

    Authors: Sarah Brockhaus, Andreas Fuest, Andreas Mayr, Sonja Greven

    Abstract: We discuss scalar-on-function regression models where all parameters of the assumed response distribution can be modeled depending on covariates. We thus combine signal regression models with generalized additive models for location, scale and shape (GAMLSS). We compare two fundamentally different methods for estimation, a gradient boosting and a penalized likelihood based approach, and address pr… ▽ More

    Submitted 13 May, 2016; originally announced May 2016.

  23. arXiv:1503.01445  [pdf, other

    stat.ML cs.LG cs.NE q-bio.BM

    Toxicity Prediction using Deep Learning

    Authors: Thomas Unterthiner, Andreas Mayr, Günter Klambauer, Sepp Hochreiter

    Abstract: Everyday we are exposed to various chemicals via food additives, cleaning and cosmetic products and medicines -- and some of them might be toxic. However testing the toxicity of all existing compounds by biological experiments is neither financially nor logistically feasible. Therefore the government agencies NIH, EPA and FDA launched the Tox21 Data Challenge within the "Toxicology in the 21st Cen… ▽ More

    Submitted 4 March, 2015; originally announced March 2015.

  24. arXiv:1502.06464  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Rectified Factor Networks

    Authors: Djork-Arné Clevert, Andreas Mayr, Thomas Unterthiner, Sepp Hochreiter

    Abstract: We propose rectified factor networks (RFNs) to efficiently construct very sparse, non-linear, high-dimensional representations of the input. RFN models identify rare and small events in the input, have a low interference between code units, have a small reconstruction error, and explain the data covariance structure. RFN learning is a generalized alternating minimization algorithm derived from the… ▽ More

    Submitted 11 June, 2015; v1 submitted 23 February, 2015; originally announced February 2015.

    Comments: 9 pages + 49 pages supplement

    Journal ref: Advances in Neural Information Processing Systems 28 (NIPS 2015)

  25. arXiv:1407.1774  [pdf, other

    stat.CO

    gamboostLSS: An R Package for Model Building and Variable Selection in the GAMLSS Framework

    Authors: Benjamin Hofner, Andreas Mayr, Matthias Schmid

    Abstract: Generalized additive models for location, scale and shape (GAMLSS) are a flexible class of regression models that allow to model multiple parameters of a distribution function, such as the mean and the standard deviation, simultaneously. With the R package gamboostLSS, we provide a boosting method to fit these models. Variable selection and model choice are naturally available within this regulari… ▽ More

    Submitted 7 July, 2014; originally announced July 2014.

  26. Extending Statistical Boosting - An Overview of Recent Methodological Developments

    Authors: Andreas Mayr, Harald Binder, Olaf Gefeller, Matthias Schmid

    Abstract: Boosting algorithms to simultaneously estimate and select predictor effects in statistical models have gained substantial interest during the last decade. This review article aims to highlight recent methodological developments regarding boosting algorithms for statistical modelling especially focusing on topics relevant for biomedical research. We suggest a unified framework for gradient boosting… ▽ More

    Submitted 18 November, 2014; v1 submitted 7 March, 2014; originally announced March 2014.

    Journal ref: Methods Inf Med 2014; 53(6): 428-435

  27. The Evolution of Boosting Algorithms - From Machine Learning to Statistical Modelling

    Authors: Andreas Mayr, Harald Binder, Olaf Gefeller, Matthias Schmid

    Abstract: The concept of boosting emerged from the field of machine learning. The basic idea is to boost the accuracy of a weak classifying tool by combining various instances into a more accurate prediction. This general concept was later adapted to the field of statistical modelling. This review article attempts to highlight this evolution of boosting algorithms from machine learning to statistical modell… ▽ More

    Submitted 18 November, 2014; v1 submitted 6 March, 2014; originally announced March 2014.

    Journal ref: Methods Inf Med 2014; 53(6): 419-427

  28. arXiv:1307.6417  [pdf, ps, other

    stat.AP stat.ME stat.ML

    Boosting the concordance index for survival data - a unified framework to derive and evaluate biomarker combinations

    Authors: Andreas Mayr, Matthias Schmid

    Abstract: The development of molecular signatures for the prediction of time-to-event outcomes is a methodologically challenging task in bioinformatics and biostatistics. Although there are numerous approaches for the derivation of marker combinations and their evaluation, the underlying methodology often suffers from the problem that different optimization criteria are mixed during the feature selection, e… ▽ More

    Submitted 25 October, 2013; v1 submitted 24 July, 2013; originally announced July 2013.

    Comments: revised manuscript - added simulation study, additional results

    Journal ref: PloS ONE 2014, 9(1): e84483