Skip to main content

Showing 1–12 of 12 results for author: Oberst, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.20178  [pdf, ps, other

    stat.ML cs.LG

    No Free Lunch: Non-Asymptotic Analysis of Prediction-Powered Inference

    Authors: Pranav Mani, Peng Xu, Zachary C. Lipton, Michael Oberst

    Abstract: Prediction-Powered Inference (PPI) is a popular strategy for combining gold-standard and possibly noisy pseudo-labels to perform statistical estimation. Prior work has shown an asymptotic "free lunch" for PPI++, an adaptive form of PPI, showing that the *asymptotic* variance of PPI++ is always less than or equal to the variance obtained from using gold-standard labels alone. Notably, this result h… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2502.09467  [pdf, other

    stat.ME

    Just Trial Once: Ongoing Causal Validation of Machine Learning Models

    Authors: Jacob M. Chen, Michael Oberst

    Abstract: Machine learning (ML) models are increasingly used as decision-support tools in high-risk domains. Evaluating the causal impact of deploying such models can be done with a randomized controlled trial (RCT) that randomizes users to ML vs. control groups and assesses the effect on relevant outcomes. However, ML models are inevitably updated over time, and we often lack evidence for the causal impact… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 27 pages

  3. arXiv:2403.14713  [pdf, other

    cs.LG cs.CY stat.ME stat.ML

    Auditing Fairness under Unobserved Confounding

    Authors: Yewon Byun, Dylan Sam, Michael Oberst, Zachary C. Lipton, Bryan Wilder

    Abstract: Many definitions of fairness or inequity involve unobservable causal quantities that cannot be directly estimated without strong assumptions. For instance, it is particularly difficult to estimate notions of fairness that rely on hard-to-measure concepts such as risk (e.g., quantifying whether patients at the same risk level have equal probability of treatment, regardless of group membership). Suc… ▽ More

    Submitted 9 December, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: AISTATS 2024

  4. arXiv:2402.15137  [pdf, other

    stat.ME stat.ML

    Benchmarking Observational Studies with Experimental Data under Right-Censoring

    Authors: Ilker Demirel, Edward De Brouwer, Zeshan Hussain, Michael Oberst, Anthony Philippakis, David Sontag

    Abstract: Drawing causal inferences from observational studies (OS) requires unverifiable validity assumptions; however, one can falsify those assumptions by benchmarking the OS with experimental data from a randomized controlled trial (RCT). A major limitation of existing procedures is not accounting for censoring, despite the abundance of RCTs and OSes that report right-censored time-to-event outcomes. We… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Artificial Intelligence and Statistics (AISTATS) 2024

  5. arXiv:2301.13133  [pdf, other

    stat.ME cs.LG

    Falsification of Internal and External Validity in Observational Studies via Conditional Moment Restrictions

    Authors: Zeshan Hussain, Ming-Chieh Shih, Michael Oberst, Ilker Demirel, David Sontag

    Abstract: Randomized Controlled Trials (RCT)s are relied upon to assess new treatments, but suffer from limited power to guide personalized treatment decisions. On the other hand, observational (i.e., non-experimental) studies have large and diverse populations, but are prone to various biases (e.g. residual confounding). To safely leverage the strengths of observational studies, we focus on the problem of… ▽ More

    Submitted 6 March, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Artificial Intelligence and Statistics 2023

  6. arXiv:2205.15947  [pdf, other

    cs.LG stat.ML

    Evaluating Robustness to Dataset Shift via Parametric Robustness Sets

    Authors: Nikolaj Thams, Michael Oberst, David Sontag

    Abstract: We give a method for proactively identifying small, plausible shifts in distribution which lead to large differences in model performance. These shifts are defined via parametric changes in the causal mechanisms of observed variables, where constraints on parameters yield a "robustness set" of plausible distributions and a corresponding worst-case loss over the set. While the loss under an individ… ▽ More

    Submitted 15 January, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022; Equal Contribution by Nikolaj/Michael, order determined by coin flip

  7. arXiv:2205.10467  [pdf, other

    stat.ME

    Understanding the Risks and Rewards of Combining Unbiased and Possibly Biased Estimators, with Applications to Causal Inference

    Authors: Michael Oberst, Alexander D'Amour, Minmin Chen, Yuyan Wang, David Sontag, Steve Yadlowsky

    Abstract: Several problems in statistics involve the combination of high-variance unbiased estimators with low-variance estimators that are only unbiased under strong assumptions. A notable example is the estimation of causal effects while combining small experimental datasets with larger observational datasets. There exist a series of recent proposals on how to perform such a combination, even when the bia… ▽ More

    Submitted 24 May, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

  8. arXiv:2103.02477  [pdf, other

    cs.LG stat.ML

    Regularizing towards Causal Invariance: Linear Models with Proxies

    Authors: Michael Oberst, Nikolaj Thams, Jonas Peters, David Sontag

    Abstract: We propose a method for learning linear models whose predictive performance is robust to causal interventions on unobserved variables, when noisy proxies of those variables are available. Our approach takes the form of a regularization term that trades off between in-distribution performance and robustness to interventions. Under the assumption of a linear structural causal model, we show that a s… ▽ More

    Submitted 27 June, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: ICML 2021 (to appear)

  9. arXiv:2006.00927  [pdf, other

    cs.LG stat.ML

    Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes

    Authors: Soorajnath Boominathan, Michael Oberst, Helen Zhou, Sanjat Kanjilal, David Sontag

    Abstract: In several medical decision-making problems, such as antibiotic prescription, laboratory testing can provide precise indications for how a patient will respond to different treatment options. This enables us to "fully observe" all potential treatment outcomes, but while present in historical data, these results are infeasible to produce in real-time at the point of the initial treatment decision.… ▽ More

    Submitted 12 August, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: To appear at KDD'20

  10. arXiv:2002.01584   

    cs.LG stat.ML

    ML4H Abstract Track 2019

    Authors: Matthew B. A. McDermott, Emily Alsentzer, Sam Finlayson, Michael Oberst, Fabian Falck, Tristan Naumann, Brett K. Beaulieu-Jones, Adrian V. Dalca

    Abstract: A collection of the accepted abstracts for the Machine Learning for Health (ML4H) workshop at NeurIPS 2019. This index is not complete, as some accepted abstracts chose to opt-out of inclusion.

    Submitted 4 February, 2020; originally announced February 2020.

  11. arXiv:1907.04138  [pdf, other

    cs.LG stat.ML

    Characterization of Overlap in Observational Studies

    Authors: Michael Oberst, Fredrik D. Johansson, Dennis Wei, Tian Gao, Gabriel Brat, David Sontag, Kush R. Varshney

    Abstract: Overlap between treatment groups is required for non-parametric estimation of causal effects. If a subgroup of subjects always receives the same intervention, we cannot estimate the effect of intervention changes on that subgroup without further assumptions. When overlap does not hold globally, characterizing local regions of overlap can inform the relevance of causal conclusions for new subjects,… ▽ More

    Submitted 3 June, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: To appear at AISTATS 2020

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108:788-798, 2020

  12. arXiv:1905.05824  [pdf, other

    cs.LG stat.ML

    Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models

    Authors: Michael Oberst, David Sontag

    Abstract: We introduce an off-policy evaluation procedure for highlighting episodes where applying a reinforcement learned (RL) policy is likely to have produced a substantially different outcome than the observed policy. In particular, we introduce a class of structural causal models (SCMs) for generating counterfactual trajectories in finite partially observable Markov Decision Processes (POMDPs). We see… ▽ More

    Submitted 6 June, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: To appear in ICML 2019

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, PMLR 97:4881-4890, 2019