Skip to main content

Showing 1–13 of 13 results for author: Gossmann, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.00756  [pdf, ps, other

    cs.LG cs.AI stat.ML

    "Who experiences large model decay and why?" A Hierarchical Framework for Diagnosing Heterogeneous Performance Drift

    Authors: Harvineet Singh, Fan Xia, Alexej Gossmann, Andrew Chuang, Julian C. Hong, Jean Feng

    Abstract: Machine learning (ML) models frequently experience performance degradation when deployed in new contexts. Such degradation is rarely uniform: some subgroups may suffer large performance decay while others may not. Understanding where and how large differences in performance arise is critical for designing targeted corrective actions that mitigate decay for the most affected subgroups while minimiz… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: 13 pages, 9 figures, 8 tables, 18 pages appendix. To be published in Proceedings of the 42nd International Conference on Machine Learning, Vancouver, Canada. PMLR 267, 2025

  2. arXiv:2402.14254  [pdf, other

    cs.LG stat.ML

    A hierarchical decomposition for explaining ML performance discrepancies

    Authors: Jean Feng, Harvineet Singh, Fan Xia, Adarsh Subbaswamy, Alexej Gossmann

    Abstract: Machine learning (ML) algorithms can often differ in performance across domains. Understanding $\textit{why}$ their performance differs is crucial for determining what types of interventions (e.g., algorithmic or operational) are most effective at closing the performance gaps. Existing methods focus on $\textit{aggregate decompositions}$ of the total performance gap into the impact of a shift in t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 11 pages, 5 figures in main body; 14 pages and 2 figures in appendices

  3. arXiv:2311.11463  [pdf, other

    cs.LG stat.ML

    Designing monitoring strategies for deployed machine learning algorithms: navigating performativity through a causal lens

    Authors: Jean Feng, Adarsh Subbaswamy, Alexej Gossmann, Harvineet Singh, Berkman Sahiner, Mi-Ok Kim, Gene Pennello, Nicholas Petrick, Romain Pirracchio, Fan Xia

    Abstract: After a machine learning (ML)-based system is deployed, monitoring its performance is important to ensure the safety and effectiveness of the algorithm over time. When an ML algorithm interacts with its environment, the algorithm can affect the data-generating mechanism and be a major source of bias when evaluating its standalone performance, an issue known as performativity. Although prior work h… ▽ More

    Submitted 26 February, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  4. arXiv:2307.15247  [pdf, other

    cs.LG stat.ME stat.ML

    Is this model reliable for everyone? Testing for strong calibration

    Authors: Jean Feng, Alexej Gossmann, Romain Pirracchio, Nicholas Petrick, Gene Pennello, Berkman Sahiner

    Abstract: In a well-calibrated risk prediction model, the average predicted probability is close to the true event rate for any given subgroup. Such models are reliable across heterogeneous populations and satisfy strong notions of algorithmic fairness. However, the task of auditing a model for strong calibration is well-known to be difficult -- particularly for machine learning (ML) algorithms -- due to th… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  5. arXiv:2211.09781  [pdf, other

    stat.ML cs.CY cs.LG

    Monitoring machine learning (ML)-based risk prediction algorithms in the presence of confounding medical interventions

    Authors: Jean Feng, Alexej Gossmann, Gene Pennello, Nicholas Petrick, Berkman Sahiner, Romain Pirracchio

    Abstract: Performance monitoring of machine learning (ML)-based risk prediction models in healthcare is complicated by the issue of confounding medical interventions (CMI): when an algorithm predicts a patient to be at high risk for an adverse event, clinicians are more likely to administer prophylactic treatment and alter the very target that the algorithm aims to predict. A simple approach is to ignore CM… ▽ More

    Submitted 14 April, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

  6. arXiv:2203.11377  [pdf, other

    stat.ML cs.LG stat.ME

    Sequential algorithmic modification with test data reuse

    Authors: Jean Feng, Gene Pennello, Nicholas Petrick, Berkman Sahiner, Romain Pirracchio, Alexej Gossmann

    Abstract: After initial release of a machine learning algorithm, the model can be fine-tuned by retraining on subsequently gathered data, adding newly discovered features, or more. Each modification introduces a risk of deteriorating performance and must be validated on a test dataset. It may not always be practical to assemble a new dataset for testing each modification, especially when most modifications… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  7. arXiv:2110.06866  [pdf, other

    stat.ML cs.LG stat.AP

    Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees

    Authors: Jean Feng, Alexej Gossmann, Berkman Sahiner, Romain Pirracchio

    Abstract: After deploying a clinical prediction model, subsequently collected data can be used to fine-tune its predictions and adapt to temporal shifts. Because model updating carries risks of over-updating/fitting, we study online methods with performance guarantees. We introduce two procedures for continual recalibration or revision of an underlying prediction model: Bayesian logistic regression (BLR) an… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  8. arXiv:2007.00479  [pdf, ps, other

    stat.ML cs.LG math.NA

    The Restricted Isometry of ReLU Networks: Generalization through Norm Concentration

    Authors: Alex Goeßmann, Gitta Kutyniok

    Abstract: While regression tasks aim at interpolating a relation on the entire input space, they often have to be solved with a limited amount of training data. Still, if the hypothesis functions can be sketched well with the data, one can hope for identifying a generalizing model. In this work, we introduce with the Neural Restricted Isometry Property (NeuRIP) a uniform concentration event, in which all… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: 27 pages, 5 figures

    MSC Class: G.3 ACM Class: F.2; G.3

  9. arXiv:2002.12388  [pdf, other

    math.NA cs.LG math.DS quant-ph stat.ML

    Tensor network approaches for learning non-linear dynamical laws

    Authors: A. Goeßmann, M. Götte, I. Roth, R. Sweke, G. Kutyniok, J. Eisert

    Abstract: Given observations of a physical system, identifying the underlying non-linear governing equation is a fundamental task, necessary both for gaining understanding and generating deterministic future predictions. Of most practical relevance are automated approaches to theory building that scale efficiently for complex systems with many degrees of freedom. To date, available scalable methods aim at a… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 17 pages, 8 figures

  10. arXiv:1906.02972  [pdf, other

    cs.LG stat.ML

    Variational Resampling Based Assessment of Deep Neural Networks under Distribution Shift

    Authors: Xudong Sun, Alexej Gossmann, Yu Wang, Bernd Bischl

    Abstract: A novel variational inference based resampling framework is proposed to evaluate the robustness and generalization capability of deep learning models with respect to distribution shift. We use Auto Encoding Variational Bayes to find a latent representation of the data, on which a Variational Gaussian Mixture Model is applied to deliberately create distribution shift by dividing the dataset into di… ▽ More

    Submitted 27 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  11. arXiv:1904.01070  [pdf

    cs.LG cs.NE q-bio.NC stat.ML

    Multimodal Sparse Classifier for Adolescent Brain Age Prediction

    Authors: Peyman Hosseinzadeh Kassani, Alexej Gossmann, Yu-Ping Wang

    Abstract: The study of healthy brain development helps to better understand the brain transformation and brain connectivity patterns which happen during childhood to adulthood. This study presents a sparse machine learning solution across whole-brain functional connectivity (FC) measures of three sets of data, derived from resting state functional magnetic resonance imaging (rs-fMRI) and task fMRI data, inc… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

  12. arXiv:1705.04312  [pdf, other

    stat.ME q-bio.QM stat.AP stat.ML

    FDR-Corrected Sparse Canonical Correlation Analysis with Applications to Imaging Genomics

    Authors: Alexej Gossmann, Pascal Zille, Vince Calhoun, Yu-Ping Wang

    Abstract: Reducing the number of false discoveries is presently one of the most pressing issues in the life sciences. It is of especially great importance for many applications in neuroimaging and genomics, where datasets are typically high-dimensional, which means that the number of explanatory variables exceeds the sample size. The false discovery rate (FDR) is a criterion that can be employed to address… ▽ More

    Submitted 23 June, 2018; v1 submitted 11 May, 2017; originally announced May 2017.

    Comments: - Clarification of the definition of FDR for CCA in Section III; results unchanged. - Corrected typos. - Added IEEE copyright notice for the accepted article

  13. arXiv:1610.04960  [pdf, other

    stat.ME

    Group SLOPE - adaptive selection of groups of predictors

    Authors: Damian Brzyski, Alexej Gossmann, Weijie Su, Malgorzata Bogdan

    Abstract: Sorted L-One Penalized Estimation (SLOPE) is a relatively new convex optimization procedure which allows for adaptive selection of regressors under sparse high dimensional designs. Here we extend the idea of SLOPE to deal with the situation when one aims at selecting whole groups of explanatory variables instead of single regressors. Such groups can be formed by clustering strongly correlated pred… ▽ More

    Submitted 16 October, 2016; originally announced October 2016.

    Comments: 40 pages, 22 paged in Appendix, 5 figures included. arXiv admin note: text overlap with arXiv:1511.09078

    MSC Class: 46N10 ACM Class: G.1.6