Skip to main content

Showing 1–12 of 12 results for author: Pfohl, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.04193  [pdf, ps, other

    stat.ML cs.CY cs.LG

    Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness

    Authors: Stephen R. Pfohl, Natalie Harris, Chirag Nagpal, David Madras, Vishwali Mhasawade, Olawale Salaudeen, Awa Dieng, Shannon Sequeira, Santiago Arciniegas, Lillian Sung, Nnamdi Ezeanochie, Heather Cole-Lewis, Katherine Heller, Sanmi Koyejo, Alexander D'Amour

    Abstract: Disaggregated evaluation across subgroups is critical for assessing the fairness of machine learning models, but its uncritical use can mislead practitioners. We show that equal performance across subgroups is an unreliable measure of fairness when data are representative of the relevant populations but reflective of real-world disparities. Furthermore, when data are not representative due to sele… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  2. arXiv:2403.07442  [pdf, other

    cs.LG stat.ML

    Proxy Methods for Domain Adaptation

    Authors: Katherine Tsai, Stephen R. Pfohl, Olawale Salaudeen, Nicole Chiou, Matt J. Kusner, Alexander D'Amour, Sanmi Koyejo, Arthur Gretton

    Abstract: We study the problem of domain adaptation under distribution shift, where the shift is due to a change in the distribution of an unobserved, latent variable that confounds both the covariates and the labels. In this setting, neither the covariate shift nor the label shift assumptions apply. Our approach to adaptation employs proximal causal learning, a technique for estimating causal effects in se… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  3. arXiv:2212.11254  [pdf, other

    stat.ML cs.AI cs.LG

    Adapting to Latent Subgroup Shifts via Concepts and Proxies

    Authors: Ibrahim Alabdulmohsin, Nicole Chiou, Alexander D'Amour, Arthur Gretton, Sanmi Koyejo, Matt J. Kusner, Stephen R. Pfohl, Olawale Salaudeen, Jessica Schrouff, Katherine Tsai

    Abstract: We address the problem of unsupervised domain adaptation when the source domain differs from the target domain because of a shift in the distribution of a latent subgroup. When this subgroup confounds all observed data, neither covariate shift nor label shift assumptions apply. We show that the optimal target predictor can be non-parametrically identified with the help of concept and proxy variabl… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Authors listed in alphabetical order

  4. arXiv:2202.01906  [pdf, other

    stat.ML cs.CY cs.LG

    Net benefit, calibration, threshold selection, and training objectives for algorithmic fairness in healthcare

    Authors: Stephen R. Pfohl, Yizhe Xu, Agata Foryciarz, Nikolaos Ignatiadis, Julian Genkins, Nigam H. Shah

    Abstract: A growing body of work uses the paradigm of algorithmic fairness to frame the development of techniques to anticipate and proactively mitigate the introduction or exacerbation of health inequities that may follow from the use of model-guided decision-making. We evaluate the interplay between measures of model performance, fairness, and the expected utility of decision-making to offer practical rec… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  5. arXiv:2108.12250  [pdf, other

    stat.ML cs.CY cs.LG

    A comparison of approaches to improve worst-case predictive model performance over patient subpopulations

    Authors: Stephen R. Pfohl, Haoran Zhang, Yizhe Xu, Agata Foryciarz, Marzyeh Ghassemi, Nigam H. Shah

    Abstract: Predictive models for clinical outcomes that are accurate on average in a patient population may underperform drastically for some subpopulations, potentially introducing or reinforcing inequities in care access and quality. Model training approaches that aim to maximize worst-case model performance across subpopulations, such as distributionally robust optimization (DRO), attempt to address this… ▽ More

    Submitted 1 February, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

  6. arXiv:2007.10306  [pdf, other

    stat.ML cs.CY cs.LG stat.AP

    An Empirical Characterization of Fair Machine Learning For Clinical Risk Prediction

    Authors: Stephen R. Pfohl, Agata Foryciarz, Nigam H. Shah

    Abstract: The use of machine learning to guide clinical decision making has the potential to worsen existing health disparities. Several recent works frame the problem as that of algorithmic fairness, a framework that has attracted considerable attention and criticism. However, the appropriateness of this framework is unclear due to both ethical as well as technical considerations, the latter of which inclu… ▽ More

    Submitted 15 June, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: Published in the Journal of Biomedical Informatics (https://doi.org/10.1016/j.jbi.2020.103621). Version 3 updates acknowledgements and fixes typos

    Journal ref: Journal of Biomedical Informatics, Volume 113, January 2021, 103621

  7. arXiv:2001.05295  [pdf, other

    cs.CL cs.LG stat.ML

    Language Models Are An Effective Patient Representation Learning Technique For Electronic Health Record Data

    Authors: Ethan Steinberg, Ken Jung, Jason A. Fries, Conor K. Corbin, Stephen R. Pfohl, Nigam H. Shah

    Abstract: Widespread adoption of electronic health records (EHRs) has fueled the development of using machine learning to build prediction models for various clinical outcomes. This process is often constrained by having a relatively small number of patient records for training the model. We demonstrate that using patient representation schemes inspired from techniques in natural language processing can inc… ▽ More

    Submitted 12 May, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

  8. arXiv:1911.05861  [pdf, other

    cs.LG stat.ML

    Federated and Differentially Private Learning for Electronic Health Records

    Authors: Stephen R. Pfohl, Andrew M. Dai, Katherine Heller

    Abstract: The use of collaborative and decentralized machine learning techniques such as federated learning have the potential to enable the development and deployment of clinical risk predictions models in low-resource settings without requiring sensitive data be shared or stored in a central repository. This process necessitates communication of model weights or updates between collaborating entities, but… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  9. arXiv:1907.06260  [pdf, other

    cs.LG cs.CY stat.ML

    Counterfactual Reasoning for Fair Clinical Risk Prediction

    Authors: Stephen Pfohl, Tony Duan, Daisy Yi Ding, Nigam H. Shah

    Abstract: The use of machine learning systems to support decision making in healthcare raises questions as to what extent these systems may introduce or exacerbate disparities in care for historically underrepresented and mistreated groups, due to biases implicitly embedded in observational data in electronic health records. To address this problem in the context of clinical risk prediction models, we devel… ▽ More

    Submitted 14 July, 2019; originally announced July 2019.

    Comments: Machine Learning for Healthcare 2019

  10. arXiv:1812.00371  [pdf, other

    cs.LG stat.ML

    Predicting Inpatient Discharge Prioritization With Electronic Health Records

    Authors: Anand Avati, Stephen Pfohl, Chris Lin, Thao Nguyen, Meng Zhang, Philip Hwang, Jessica Wetstone, Kenneth Jung, Andrew Ng, Nigam H. Shah

    Abstract: Identifying patients who will be discharged within 24 hours can improve hospital resource management and quality of care. We studied this problem using eight years of Electronic Health Records (EHR) data from Stanford Hospital. We fit models to predict 24 hour discharge across the entire inpatient population. The best performing models achieved an area under the receiver-operator characteristic cu… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

  11. arXiv:1809.04663  [pdf, other

    cs.LG stat.ML

    Creating Fair Models of Atherosclerotic Cardiovascular Disease Risk

    Authors: Stephen Pfohl, Ben Marafino, Adrien Coulet, Fatima Rodriguez, Latha Palaniappan, Nigam H. Shah

    Abstract: Guidelines for the management of atherosclerotic cardiovascular disease (ASCVD) recommend the use of risk stratification models to identify patients most likely to benefit from cholesterol-lowering and other therapies. These models have differential performance across race and gender groups with inconsistent behavior across studies, potentially resulting in an inequitable distribution of beneficia… ▽ More

    Submitted 14 June, 2019; v1 submitted 12 September, 2018; originally announced September 2018.

  12. arXiv:1808.03331  [pdf, other

    stat.ML cs.LG

    The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data

    Authors: Daisy Yi Ding, ChloƩ Simpson, Stephen Pfohl, Dave C. Kale, Kenneth Jung, Nigam H. Shah

    Abstract: Electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of multitask learning for phenotyping using electronic health records (EHR) data. Multitask learning aim… ▽ More

    Submitted 5 January, 2019; v1 submitted 9 August, 2018; originally announced August 2018.

    Comments: Pacific Symposium on Biocomputing (PSB) 2019, Hawaii, https://psb.stanford.edu/psb-online/; 13 pages, 7 figures