Skip to main content

Showing 1–50 of 110 results for author: Pauly, M

.
  1. arXiv:2505.14435  [pdf

    cs.CY cs.AI

    Choosing a Model, Shaping a Future: Comparing LLM Perspectives on Sustainability and its Relationship with AI

    Authors: Annika Bush, Meltem Aksoy, Markus Pauly, Greta Ontrup

    Abstract: As organizations increasingly rely on AI systems for decision support in sustainability contexts, it becomes critical to understand the inherent biases and perspectives embedded in Large Language Models (LLMs). This study systematically investigates how five state-of-the-art LLMs -- Claude, DeepSeek, GPT, LLaMA, and Mistral - conceptualize sustainability and its relationship with AI. We administer… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  2. arXiv:2505.07117  [pdf, other

    eess.SY eess.SP physics.med-ph

    OPTIKS: Optimized Gradient Properties Through Timing in K-Space

    Authors: Matthew A. McCready, Xiaozhi Cao, Kawin Setsompop, John M. Pauly, Adam B. Kerr

    Abstract: A customizable method (OPTIKS) for designing fast trajectory-constrained gradient waveforms with optimized time domain properties was developed. Given a specified multidimensional k-space trajectory, the method optimizes traversal speed (and therefore timing) with position along the trajectory. OPTIKS facilitates optimization of objectives dependent on the time domain gradient waveform and the arc… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  3. arXiv:2502.15568  [pdf, other

    cs.LG cs.AI

    A Cautionary Tale About "Neutrally" Informative AI Tools Ahead of the 2025 Federal Elections in Germany

    Authors: Ina Dormuth, Sven Franke, Marlies Hafer, Tim Katzke, Alexander Marx, Emmanuel Müller, Daniel Neider, Markus Pauly, Jérôme Rutinowski

    Abstract: In this study, we examine the reliability of AI-based Voting Advice Applications (VAAs) and large language models (LLMs) in providing objective political information. Our analysis is based upon a comparison with party responses to 38 statements of the Wahl-O-Mat, a well-established German online tool that helps inform voters by comparing their views with political party positions. For the LLMs, we… ▽ More

    Submitted 7 April, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

  4. arXiv:2412.13570  [pdf, other

    stat.AP stat.ML

    Which Imputation Fits Which Feature Selection Method? A Survey-Based Simulation Study

    Authors: Jakob Schwerter, Andrés Romero, Florian Dumpert, Markus Pauly

    Abstract: Tree-based learning methods such as Random Forest and XGBoost are still the gold-standard prediction methods for tabular data. Feature importance measures are usually considered for feature selection as well as to assess the effect of features on the outcome variables in the model. This also applies to survey data, which are frequently encountered in the social sciences and official statistics. Th… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  5. arXiv:2412.13020  [pdf, other

    math.ST stat.ML

    A Central Limit Theorem for the permutation importance measure

    Authors: Nico Föge, Lena Schmid, Marc Ditzhaus, Markus Pauly

    Abstract: Random Forests have become a widely used tool in machine learning since their introduction in 2001, known for their strong performance in classification and regression tasks. One key feature of Random Forests is the Random Forest Permutation Importance Measure (RFPIM), an internal, non-parametric measure of variable importance. While widely used, theoretical work on RFPIM is sparse, and most resea… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  6. arXiv:2411.10121  [pdf, ps, other

    stat.ME math.ST

    Quadratic Form based Multiple Contrast Tests for Comparison of Group Means

    Authors: Paavo Sattler, Markus Pauly, Merle Munko

    Abstract: Comparing the mean vectors across different groups is a cornerstone in the realm of multivariate statistics, with quadratic forms commonly serving as test statistics. However, when the overall hypothesis is rejected, identifying specific vector components or determining the groups among which differences exist requires additional investigations. Conversely, employing multiple contrast tests (MCT)… ▽ More

    Submitted 3 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

  7. arXiv:2410.21098  [pdf, other

    stat.ME

    Single CASANOVA? Not in multiple comparisons

    Authors: Ina Dormuth, Carolin Herrmann, Frank Konietschke, Markus Pauly, Matthias Wirth, Marc Ditzhaus

    Abstract: When comparing multiple groups in clinical trials, we are not only interested in whether there is a difference between any groups but rather the location. Such research questions lead to testing multiple individual hypotheses. To control the familywise error rate (FWER), we must apply some corrections or introduce tests that control the FWER by design. In the case of time-to-event data, a Bonferro… ▽ More

    Submitted 7 January, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  8. arXiv:2410.21008  [pdf, other

    cs.CL

    Is GPT-4 Less Politically Biased than GPT-3.5? A Renewed Investigation of ChatGPT's Political Biases

    Authors: Erik Weber, Jérôme Rutinowski, Niklas Jost, Markus Pauly

    Abstract: This work investigates the political biases and personality traits of ChatGPT, specifically comparing GPT-3.5 to GPT-4. In addition, the ability of the models to emulate political viewpoints (e.g., liberal or conservative positions) is analyzed. The Political Compass Test and the Big Five Personality Test were employed 100 times for each scenario, providing statistically significant results and an… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  9. arXiv:2410.00942  [pdf, other

    stat.ML cs.LG

    AR-Sieve Bootstrap for the Random Forest and a simulation-based comparison with rangerts time series prediction

    Authors: Cabrel Teguemne Fokam, Carsten Jentsch, Michel Lang, Markus Pauly

    Abstract: The Random Forest (RF) algorithm can be applied to a broad spectrum of problems, including time series prediction. However, neither the classical IID (Independent and Identically distributed) bootstrap nor block bootstrapping strategies (as implemented in rangerts) completely account for the nature of the Data Generating Process (DGP) while resampling the observations. We propose the combination o… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  10. arXiv:2409.18550  [pdf, other

    stat.ME

    Iterative Trace Minimization for the Reconciliation of Very Short Hierarchical Time Series

    Authors: Louis Steinmeister, Markus Pauly

    Abstract: Time series often appear in an additive hierarchical structure. In such cases, time series on higher levels are the sums of their subordinate time series. This hierarchical structure places a natural constraint on forecasts. However, univariate forecasting techniques are incapable of ensuring this forecast coherence. An obvious solution is to forecast only bottom time series and obtain higher leve… ▽ More

    Submitted 19 March, 2025; v1 submitted 27 September, 2024; originally announced September 2024.

  11. arXiv:2409.14926  [pdf, other

    stat.ME

    Early and Late Buzzards: Comparing Different Approaches for Quantile-based Multiple Testing in Heavy-Tailed Wildlife Research Data

    Authors: Marléne Baumeister, Merle Munko, Kai-Philipp Gladow, Marc Ditzhaus, Nayden Chakarov, Markus Pauly

    Abstract: In medical, ecological and psychological research, there is a need for methods to handle multiple testing, for example to consider group comparisons with more than two groups. Typical approaches that deal with multiple testing are mean or variance based which can be less effective in the context of heavy-tailed and skewed data. Here, the median is the preferred measure of location and the interqua… ▽ More

    Submitted 28 April, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: DOI for supplementary code: doi:10.17877/TUDODATA-2025-M6TDKFDE

  12. arXiv:2406.12531  [pdf, other

    cs.LG stat.ML

    TREE: Tree Regularization for Efficient Execution

    Authors: Lena Schmid, Daniel Biebert, Christian Hakert, Kuan-Hsun Chen, Michel Lang, Markus Pauly, Jian-Jia Chen

    Abstract: The rise of machine learning methods on heavily resource constrained devices requires not only the choice of a suitable model architecture for the target platform, but also the optimization of the chosen model with regard to execution time consumption for inference in order to optimally utilize the available resources. Random forests and decision trees are shown to be a suitable model for such a s… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  13. arXiv:2406.01242  [pdf, other

    stat.ME

    Multiple Comparison Procedures for Simultaneous Inference in Functional MANOVA

    Authors: Merle Munko, Marc Ditzhaus, Markus Pauly, Łukasz Smaga

    Abstract: Functional data analysis is becoming increasingly popular to study data from real-valued random functions. Nevertheless, there is a lack of multiple testing procedures for such data. These are particularly important in factorial designs to compare different groups or to infer factor effects. We propose a new class of testing procedures for arbitrary linear hypotheses in general factorial designs w… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. Human Vs. Machines: Who Wins In Semiconductor Market Forecasting?

    Authors: Louis Steinmeister, Markus Pauly

    Abstract: "If you ask ten experts, you will get ten different opinions." This common proverb illustrates the common association of expert forecasts with personal bias and lack of consistency. On the other hand, digitization promises consistency and explainability through data-driven forecasts employing machine learning (ML) and statistical models. In the following, we compare such forecasts to expert foreca… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  15. arXiv:2404.06850   

    math.ST

    Even naive trees are consistent

    Authors: Nico Föge, Markus Pauly, Lena Schmid, Marc Ditzhaus

    Abstract: The last decade has shed some light on theoretical properties such as their consistency for regression tasks. In the current paper, we propose a new class of very simple learners based on so-called naive trees. These naive trees partition the feature space completely at random and independent of the data. Although counter-intuitive, we prove these naive trees and ensembles are consistent under fai… ▽ More

    Submitted 17 December, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: Wrong proof

    MSC Class: Primary 62G05; secondary 62G20

  16. arXiv:2402.04110  [pdf, other

    cs.CL

    Behind the Screen: Investigating ChatGPT's Dark Personality Traits and Conspiracy Beliefs

    Authors: Erik Weber, Jérôme Rutinowski, Markus Pauly

    Abstract: ChatGPT is notorious for its intransparent behavior. This paper tries to shed light on this, providing an in-depth analysis of the dark personality traits and conspiracy beliefs of GPT-3.5 and GPT-4. Different psychological tests and questionnaires were employed, including the Dark Factor Test, the Mach-IV Scale, the Generic Conspiracy Belief Scale, and the Conspiracy Mentality Scale. The response… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 15 pages, 5 figures

  17. arXiv:2401.14161  [pdf, other

    stat.AP stat.ML

    Adapting tree-based multiple imputation methods for multi-level data? A simulation study

    Authors: Nico Föge, Jakob Schwerter, Ketevan Gurtskaia, Markus Pauly, Philipp Doebler

    Abstract: When data have a hierarchical structure, such as students nested within classrooms, ignoring dependencies between observations can compromise the validity of imputation procedures. Standard tree-based imputation methods implicitly assume independence between observations, limiting their applicability in multilevel data settings. Although Multivariate Imputation by Chained Equations (MICE) is widel… ▽ More

    Submitted 19 March, 2025; v1 submitted 25 January, 2024; originally announced January 2024.

  18. arXiv:2401.09602  [pdf, other

    stat.AP stat.ML

    Evaluating tree-based imputation methods as an alternative to MICE PMM for drawing inference in empirical studies

    Authors: Jakob Schwerter, Ketevan Gurtskaia, Andrés Romero, Birgit Zeyer-Gliozzo, Markus Pauly

    Abstract: Dealing with missing data is an important problem in statistical analysis that is often addressed with imputation procedures. The performance and validity of such methods are of great importance for their application in empirical studies. While the prevailing method of Multiple Imputation by Chained Equations (MICE) with Predictive Mean Matching (PMM) is considered standard in the social science l… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: The project "From Prediction to Agile Interventions in the Social Sciences (FAIR)" is receiving funding from the programme "Profilbildung 2020'', an initiative of the Ministry of Culture and Science of the State of Northrhine Westphalia. The sole responsibility for the content of this publication lies with the authors

  19. arXiv:2308.07842  [pdf, ps, other

    stat.AP

    How to Simulate Realistic Survival Data? A Simulation Study to Compare Realistic Simulation Models

    Authors: Maria Thurow, Ina Dormuth, Christina Sauer, Marc Ditzhaus, Markus Pauly

    Abstract: In statistics, it is important to have realistic data sets available for a particular context to allow an appropriate and objective method comparison. For many use cases, benchmark data sets for method comparison are already available online. However, in most medical applications and especially for clinical trials in oncology, there is a lack of adequate benchmark data sets, as patient data can be… ▽ More

    Submitted 29 May, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

  20. arXiv:2306.15259  [pdf, other

    stat.ME

    General multiple tests for functional data

    Authors: Merle Munko, Marc Ditzhaus, Markus Pauly, Łukasz Smaga, Jin-Ting Zhang

    Abstract: While there exists several inferential methods for analyzing functional data in factorial designs, there is a lack of statistical tests that are valid (i) in general designs, (ii) under non-restrictive assumptions on the data generating process and (iii) allow for coherent post-hoc analyses. In particular, most existing methods assume Gaussianity or equal covariance functions across groups (homosc… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  21. AutoSamp: Autoencoding k-space Sampling via Variational Information Maximization for 3D MRI

    Authors: Cagan Alkan, Morteza Mardani, Congyu Liao, Zhitao Li, Shreyas S. Vasanawala, John M. Pauly

    Abstract: Accelerated MRI protocols routinely involve a predefined sampling pattern that undersamples the k-space. Finding an optimal pattern can enhance the reconstruction quality, however this optimization is a challenging task. To address this challenge, we introduce a novel deep learning framework, AutoSamp, based on variational information maximization that enables joint optimization of sampling patter… ▽ More

    Submitted 29 August, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in IEEE Transactions on Medical Imaging (TMI)

  22. arXiv:2305.01610  [pdf, other

    cs.LG cs.AI

    Finding Neurons in a Haystack: Case Studies with Sparse Probing

    Authors: Wes Gurnee, Neel Nanda, Matthew Pauly, Katherine Harvey, Dmitrii Troitskii, Dimitris Bertsimas

    Abstract: Despite rapid adoption and deployment of large language models (LLMs), the internal computations of these models remain opaque and poorly understood. In this work, we seek to understand how high-level human-interpretable features are represented within the internal neuron activations of LLMs. We train $k$-sparse linear classifiers (probes) on these internal activations to predict the presence of f… ▽ More

    Submitted 2 June, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

  23. arXiv:2304.07333  [pdf, other

    cs.CY cs.AI cs.CL cs.HC

    The Self-Perception and Political Biases of ChatGPT

    Authors: Jérôme Rutinowski, Sven Franke, Jan Endendyk, Ina Dormuth, Markus Pauly

    Abstract: This contribution analyzes the self-perception and political biases of OpenAI's Large Language Model ChatGPT. Taking into account the first small-scale reports and studies that have emerged, claiming that ChatGPT is politically biased towards progressive and libertarian points of view, this contribution aims to provide further clarity on this subject. For this purpose, ChatGPT was asked to answer… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  24. arXiv:2303.08193  [pdf, other

    cs.DB cs.LG

    RODD: Robust Outlier Detection in Data Cubes

    Authors: Lara Kuhlmann, Daniel Wilmes, Emmanuel Müller, Markus Pauly, Daniel Horn

    Abstract: Data cubes are multidimensional databases, often built from several separate databases, that serve as flexible basis for data analysis. Surprisingly, outlier detection on data cubes has not yet been treated extensively. In this work, we provide the first framework to evaluate robust outlier detection methods in data cubes (RODD). We introduce a novel random forest-based outlier detection approach… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  25. arXiv:2303.07139  [pdf, other

    stat.ML cs.LG

    Comparing statistical and machine learning methods for time series forecasting in data-driven logistics -- A simulation study

    Authors: Lena Schmid, Moritz Roidl, Markus Pauly

    Abstract: Many planning and decision activities in logistics and supply chain management are based on forecasts of multiple time dependent factors. Therefore, the quality of planning depends on the quality of the forecasts. We compare various forecasting methods in terms of out of the box forecasting performance on a broad set of simulated time series. We simulate various linear and non-linear time series a… ▽ More

    Submitted 6 June, 2024; v1 submitted 13 March, 2023; originally announced March 2023.

  26. arXiv:2302.01807  [pdf, other

    cond-mat.quant-gas cond-mat.stat-mech

    Ultracold plasmas from strongly anti-correlated Rydberg gases in the Kinetic Field Theory formalism

    Authors: Elena Kozlikin, Robert Lilow, Martin Pauly, Alexander Schuckert, Andre Salzinger, Matthias Bartelmann, Matthias Weidemüller

    Abstract: The dynamics of correlated systems is relevant in many fields ranging from cosmology to plasma physics. However, they are challenging to predict and understand even for classical systems due to the typically large numbers of particles involved. Here, we study the evolution of an ultracold, correlated many-body system with repulsive interactions and initial correlations set by the Rydberg blockade… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  27. arXiv:2301.10161  [pdf, other

    eess.SP cs.AI cs.LG

    Dataset Bias in Human Activity Recognition

    Authors: Nilah Ravi Nair, Lena Schmid, Fernando Moya Rueda, Markus Pauly, Gernot A. Fink, Christopher Reining

    Abstract: When creating multi-channel time-series datasets for Human Activity Recognition (HAR), researchers are faced with the issue of subject selection criteria. It is unknown what physical characteristics and/or soft-biometrics, such as age, height, and weight, need to be taken into account to train a classifier to achieve robustness towards heterogeneous populations in the training and testing data. Th… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: Submitted for review to THE 32nd INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-23)

  28. arXiv:2301.03244  [pdf, other

    stat.ME stat.AP

    The impact of neglected confounding and interactions in mixed-effects meta-regression

    Authors: Eric S. Knop, Markus Pauly, Tim Friede, Thilo Welz

    Abstract: Analysts seldom include interaction terms in meta-regression model, what can introduce bias if an interaction is present. We illustrate this in the current paper by re-analyzing an example from research on acute heart failure, where neglecting an interaction might have led to erroneous inference and conclusions. Moreover, we perform a brief simulation study based on this example highlighting the e… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: 10 pages, 2 figures. arXiv admin note: text overlap with arXiv:2201.05491

  29. arXiv:2212.01067  [pdf, other

    stat.ME stat.AP

    Using meta-analytic priors to incorporate external information for study evaluation

    Authors: Thilo Welz, Eric Knop, Frank Konietschke, Jan-Hendrik B. Hardenberg, Markus Pauly, Christian Röver

    Abstract: Background: The COVID-19 pandemic has had a profound impact on health, everyday life and economics around the world. An important complication that can arise in connection with a COVID-19 infection is acute kidney injury. A recent observational cohort study of COVID-19 patients treated at multiple sites of a tertiary care center in Berlin, Germany identified risk factors for the development of (se… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 20 pages (including Appendix), 2 Tables, 8 figures

  30. arXiv:2211.15484  [pdf, other

    math.ST

    Quantile-based MANOVA: A new tool for inferring multivariate data in factorial designs

    Authors: Marléne Baumeister, Marc Ditzhaus, Markus Pauly

    Abstract: Multivariate analysis-of-variance (MANOVA) is a well established tool to examine multivariate endpoints. While classical approaches depend on restrictive assumptions like normality and homogeneity, there is a recent trend to more general and flexible proce dures. In this paper, we proceed on this path, but do not follow the typical mean-focused perspective. Instead we consider general quantiles, i… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  31. arXiv:2211.04703  [pdf

    eess.IV cs.CV

    Automated MRI Field of View Prescription from Region of Interest Prediction by Intra-stack Attention Neural Network

    Authors: Ke Lei, Ali B. Syed, Xucheng Zhu, John M. Pauly, Shreyas S. Vasanawala

    Abstract: Manual prescription of the field of view (FOV) by MRI technologists is variable and prolongs the scanning process. Often, the FOV is too large or crops critical anatomy. We propose a deep-learning framework, trained by radiologists' supervision, for automating FOV prescription. An intra-stack shared feature extraction network and an attention network are used to process a stack of 2D image inputs… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  32. arXiv:2210.14573  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Causal Graphs in Manufacturing Domains using Structural Equation Models

    Authors: Maximilian Kertel, Stefan Harmeling, Markus Pauly

    Abstract: Many production processes are characterized by numerous and complex cause-and-effect relationships. Since they are only partially known they pose a challenge to effective process control. In this work we present how Structural Equation Models can be used for deriving cause-and-effect relationships from the combination of prior knowledge and process data in the manufacturing domain. Compared to exi… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: To be published in the Proceedings of IEEE AI4I 2022

  33. arXiv:2210.13258  [pdf, other

    stat.ME stat.AP

    A comparative study to alternatives to the log-rank test

    Authors: Ina Dormuth, Tiantian Liu, Jin Xu, Markus Pauly, Marc Ditzhaus

    Abstract: Studies to compare the survival of two or more groups using time-to-event data are of high importance in medical research. The gold standard is the log-rank test, which is optimal under proportional hazards. As the latter is no simple regularity assumption, we are interested in evaluating the power of various statistical tests under different settings including proportional and non-proportional ha… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  34. arXiv:2209.04380  [pdf, other

    math.ST

    Testing Hypotheses about Correlation Matrices in General MANOVA Designs

    Authors: Paavo Sattler, Markus Pauly

    Abstract: Correlation matrices are an essential tool for investigating the dependency structures of random vectors or comparing them. We introduce an approach for testing a variety of null hypotheses that can be formulated based upon the correlation matrix. Examples cover MANOVA-type hypothesis of equal correlation matrices as well as testing for special correlation structures such as, e.g., sphericity. Apa… ▽ More

    Submitted 11 July, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

  35. arXiv:2208.01231  [pdf, other

    stat.ME

    The nonparametric Behrens-Fisher problem in small samples

    Authors: Claus P. Nowak, Markus Pauly, Edgar Brunner

    Abstract: While there appears to be a general consensus in the literature on the definition of the estimand and estimator associated with the Wilcoxon-Mann-Whitney test, it seems somewhat less clear as to how best to estimate the variance. In addition to the Wilcoxon-Mann-Whitney test, we review different proposals of variance estimators consistent under both the null hypothesis and the alternative. Moreove… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  36. arXiv:2207.09382  [pdf, other

    math.ST

    Inference for high-dimensional split-plot designs with different dimensions between groups

    Authors: Paavo Sattler, Markus Pauly

    Abstract: In repeated Measure Designs with multiple groups, the primary purpose is to compare different groups in various aspects. For several reasons, the number of measurements and therefore the dimension of the observation vectors can depend on the group, making the usage of existing approaches impossible. We develop an approach which can be used not only for a possibly increasing number of groups $a$, b… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  37. arXiv:2207.08393  [pdf, other

    eess.IV cs.CV

    GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction

    Authors: Batu Ozturkler, Arda Sahiner, Tolga Ergen, Arjun D Desai, Christopher M Sandino, Shreyas Vasanawala, John M Pauly, Morteza Mardani, Mert Pilanci

    Abstract: Unrolled neural networks have recently achieved state-of-the-art accelerated MRI reconstruction. These networks unroll iterative optimization algorithms by alternating between physics-based consistency and neural-network based regularization. However, they require several iterations of a large neural network to handle high-dimensional imaging tasks such as 3D MRI. This limits traditional training… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  38. arXiv:2203.02234  [pdf, other

    stat.ME stat.CO

    Cluster-Robust Estimators for Bivariate Mixed-Effects Meta-Regression

    Authors: Thilo Welz, Wolfgang Viechtbauer, Markus Pauly

    Abstract: Meta-analyses frequently include trials that report multiple effect sizes based on a common set of study participants. These effect sizes will generally be correlated. Cluster-robust variance-covariance estimators are a fruitful approach for synthesizing dependent effects. However, when the number of studies is small, state-of-the-art robust estimators can yield inflated Type 1 errors. We present… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 23 pages + 1 page supplement, 6 figures

    MSC Class: 62H12; 62H15; 62J05

  39. arXiv:2201.05565  [pdf, other

    stat.ML cs.LG stat.ME

    Estimating Gaussian Copulas with Missing Data

    Authors: Maximilian Kertel, Markus Pauly

    Abstract: In this work we present a rigorous application of the Expectation Maximization algorithm to determine the marginal distributions and the dependence structure in a Gaussian copula model with missing data. We further show how to circumvent a priori assumptions on the marginals with semiparametric modelling. The joint distribution learned through this algorithm is considerably closer to the underlyin… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  40. arXiv:2201.05491  [pdf, other

    stat.ME stat.AP stat.CO

    Robust Confidence Intervals for Meta-Regression with Interaction Effects

    Authors: Thilo Welz, Eric S. Knop, Tim Friede, Markus Pauly

    Abstract: Meta-analysis is an important statistical technique for synthesizing the results of multiple studies regarding the same or closely related research question. So-called meta-regression extends meta-analysis models by accounting for studylevel covariates. Mixed-effects meta-regression models provide a powerful tool for evidence synthesis, by appropriately accounting for betweem-study heterogeneity.… ▽ More

    Submitted 21 February, 2023; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: main paper: 23 pages, 6 figures; supplement: 148 pages, 131 figures

  41. arXiv:2201.05340  [pdf, other

    stat.ML cs.LG

    Machine Learning for Multi-Output Regression: When should a holistic multivariate approach be preferred over separate univariate ones?

    Authors: Lena Schmid, Alexander Gerharz, Andreas Groll, Markus Pauly

    Abstract: Tree-based ensembles such as the Random Forest are modern classics among statistical learning methods. In particular, they are used for predicting univariate responses. In case of multiple outputs the question arises whether we separately fit univariate models or directly follow a multivariate approach. For the latter, several possibilities exist that are, e.g. based on modified splitting or stopp… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  42. Using Sequential Statistical Tests for Efficient Hyperparameter Tuning

    Authors: Philip Buczak, Andreas Groll, Markus Pauly, Jakob Rehof, Daniel Horn

    Abstract: Hyperparameter tuning is one of the the most time-consuming parts in machine learning. Despite the existence of modern optimization algorithms that minimize the number of evaluations needed, evaluations of a single setting may still be expensive. Usually a resampling technique is used, where the machine learning method has to be fitted a fixed number of k times on different training datasets. The… ▽ More

    Submitted 28 November, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

  43. arXiv:2112.05248  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    On the Relation between Prediction and Imputation Accuracy under Missing Covariates

    Authors: Burim Ramosaj, Justus Tulowietzki, Markus Pauly

    Abstract: Missing covariates in regression or classification problems can prohibit the direct use of advanced tools for further analysis. Recent research has realized an increasing trend towards the usage of modern Machine Learning algorithms for imputation. It originates from their capability of showing favourable prediction accuracy in different learning problems. In this work, we analyze through simulati… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Includes supplementary material

  44. arXiv:2111.03780  [pdf, other

    eess.IV cs.AI cs.CV

    Artifact- and content-specific quality assessment for MRI with image rulers

    Authors: Ke Lei, John M. Pauly, Shreyas S. Vasanawala

    Abstract: In clinical practice MR images are often first seen by radiologists long after the scan. If image quality is inadequate either patients have to return for an additional scan, or a suboptimal interpretation is rendered. An automatic image quality assessment (IQA) would enable real-time remediation. Existing IQA works for MRI give only a general quality score, agnostic to the cause of and solution t… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  45. arXiv:2111.02549  [pdf, other

    eess.IV physics.med-ph

    VORTEX: Physics-Driven Data Augmentations Using Consistency Training for Robust Accelerated MRI Reconstruction

    Authors: Arjun D Desai, Beliz Gunel, Batu M Ozturkler, Harris Beg, Shreyas Vasanawala, Brian A Hargreaves, Christopher Ré, John M Pauly, Akshay S Chaudhari

    Abstract: Deep neural networks have enabled improved image quality and fast inference times for various inverse problems, including accelerated magnetic resonance imaging (MRI) reconstruction. However, such models require a large number of fully-sampled ground truth datasets, which are difficult to curate, and are sensitive to distribution drifts. In this work, we propose applying physics-driven data augmen… ▽ More

    Submitted 17 June, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: Accepted to MIDL 2022

  46. arXiv:2110.00075  [pdf, other

    eess.IV cs.CV

    Noise2Recon: Enabling Joint MRI Reconstruction and Denoising with Semi-Supervised and Self-Supervised Learning

    Authors: Arjun D Desai, Batu M Ozturkler, Christopher M Sandino, Robert Boutin, Marc Willis, Shreyas Vasanawala, Brian A Hargreaves, Christopher M Ré, John M Pauly, Akshay S Chaudhari

    Abstract: Deep learning (DL) has shown promise for faster, high quality accelerated MRI reconstruction. However, supervised DL methods depend on extensive amounts of fully-sampled (labeled) data and are sensitive to out-of-distribution (OOD) shifts, particularly low signal-to-noise ratio (SNR) acquisitions. To alleviate this challenge, we propose Noise2Recon, a model-agnostic, consistency training method fo… ▽ More

    Submitted 7 October, 2022; v1 submitted 30 September, 2021; originally announced October 2021.

  47. arXiv:2108.04068  [pdf, other

    stat.OT stat.AP

    On the role of data, statistics and decisions in a pandemic

    Authors: Beate Jahn, Sarah Friedrich, Joachim Behnke, Joachim Engel, Ursula Garczarek, Ralf Münnich, Markus Pauly, Adalbert Wilhelm, Olaf Wolkenhauer, Markus Zwick, Uwe Siebert, Tim Friede

    Abstract: A pandemic poses particular challenges to decision-making because of the need to continuously adapt decisions to rapidly changing evidence and available data. For example, which countermeasures are appropriate at a particular stage of the pandemic? How can the severity of the pandemic be measured? What is the effect of vaccination in the population and which groups should be vaccinated first? The… ▽ More

    Submitted 8 March, 2022; v1 submitted 6 August, 2021; originally announced August 2021.

  48. arXiv:2107.07949  [pdf, other

    hep-ph gr-qc hep-th

    Towards a Higgs mass determination in asymptotically safe gravity with a dark portal

    Authors: Astrid Eichhorn, Martin Pauly, Shouryya Ray

    Abstract: There are indications that an asymptotically safe UV completion of the Standard Model with gravity could constrain the Higgs self-coupling, resulting in a prediction of the Higgs mass close to the vacuum stability bound in the Standard Model. The predicted value depends on the top quark mass and comes out somewhat higher than the experimental value if the current central value for the top quark ma… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 23 pages, 11 figures

  49. arXiv:2107.07325  [pdf, other

    cond-mat.dis-nn gr-qc physics.soc-ph

    A sprinkling of hybrid-signature discrete spacetimes in real-world networks

    Authors: Astrid Eichhorn, Martin Pauly

    Abstract: Many real-world networks are embedded into a space or spacetime. The embedding space(time) constrains the properties of these real-world networks. We use the scale-dependent spectral dimension as a tool to probe whether real-world networks encode information on the dimensionality of the embedding space. We find that spacetime networks which are inspired by quantum gravity and based on a hybrid sig… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: 19 pages, 18 figures

  50. arXiv:2106.06660  [pdf, other

    eess.IV physics.med-ph

    Least Squares Optimal Density Compensation for the Gridding Non-uniform Discrete Fourier Transform

    Authors: Nicholas Dwork, Daniel O'Connor, Ethan M. I. Johnson, Corey A. Baron, Jeremy W. Gordon, John M. Pauly, Peder E. Z. Larson

    Abstract: The Gridding algorithm has shown great utility for reconstructing images from non-uniformly spaced samples in the Fourier domain in several imaging modalities. Due to the non-uniform spacing, some correction for the variable density of the samples must be made. Existing methods for generating density compensation values are either sub-optimal or only consider a finite set of points (a set of measu… ▽ More

    Submitted 16 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.