Skip to main content

Showing 1–26 of 26 results for author: Heller, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2202.01034  [pdf, other

    cs.LG cs.CY stat.ML

    Diagnosing failures of fairness transfer across distribution shift in real-world medical settings

    Authors: Jessica Schrouff, Natalie Harris, Oluwasanmi Koyejo, Ibrahim Alabdulmohsin, Eva Schnider, Krista Opsahl-Ong, Alex Brown, Subhrajit Roy, Diana Mincu, Christina Chen, Awa Dieng, Yuan Liu, Vivek Natarajan, Alan Karthikesalingam, Katherine Heller, Silvia Chiappa, Alexander D'Amour

    Abstract: Diagnosing and mitigating changes in model fairness under distribution shift is an important component of the safe deployment of machine learning in healthcare settings. Importantly, the success of any mitigation strategy strongly depends on the structure of the shift. Despite this, there has been little discussion of how to empirically assess the structure of a distribution shift that one is enco… ▽ More

    Submitted 10 February, 2023; v1 submitted 2 February, 2022; originally announced February 2022.

    Journal ref: Advances in Neural Information Processing Systems 35 (NeurIPS 2022)

  2. arXiv:2106.15980  [pdf, other

    stat.ML cs.LG stat.CO

    Variational Refinement for Importance Sampling Using the Forward Kullback-Leibler Divergence

    Authors: Ghassen Jerfel, Serena Wang, Clara Fannjiang, Katherine A. Heller, Yian Ma, Michael I. Jordan

    Abstract: Variational Inference (VI) is a popular alternative to asymptotically exact sampling in Bayesian inference. Its main workhorse is optimization over a reverse Kullback-Leibler divergence (RKL), which typically underestimates the tail of the posterior leading to miscalibration and potential degeneracy. Importance sampling (IS), on the other hand, is often used to fine-tune and de-bias the estimates… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: Accepted for the 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021)

  3. arXiv:2101.06536  [pdf, other

    cs.LG stat.ME stat.ML

    Deep Cox Mixtures for Survival Regression

    Authors: Chirag Nagpal, Steve Yadlowsky, Negar Rostamzadeh, Katherine Heller

    Abstract: Survival analysis is a challenging variation of regression modeling because of the presence of censoring, where the outcome measurement is only partially known, due to, for example, loss to follow up. Such problems come up frequently in medical applications, making survival analysis a key endeavor in biostatistics and machine learning for healthcare, with Cox regression models being amongst the mo… ▽ More

    Submitted 26 June, 2022; v1 submitted 16 January, 2021; originally announced January 2021.

    Comments: Machine Learning for Healthcare Conference, 2021

    Journal ref: Proceedings of the 6th Machine Learning for Healthcare Conference, PMLR 149:674-708, 2021

  4. arXiv:2011.03395  [pdf, other

    cs.LG stat.ML

    Underspecification Presents Challenges for Credibility in Modern Machine Learning

    Authors: Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew D. Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne , et al. (15 additional authors not shown)

    Abstract: ML models often exhibit unexpectedly poor behavior when they are deployed in real-world domains. We identify underspecification as a key reason for these failures. An ML pipeline is underspecified when it can return many predictors with equivalently strong held-out performance in the training domain. Underspecification is common in modern ML pipelines, such as those based on deep learning. Predict… ▽ More

    Submitted 24 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Updates: Updated statistical analysis in Section 6; Additional citations

  5. arXiv:2005.07186  [pdf, other

    cs.LG stat.ML

    Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors

    Authors: Michael W. Dusenberry, Ghassen Jerfel, Yeming Wen, Yi-An Ma, Jasper Snoek, Katherine Heller, Balaji Lakshminarayanan, Dustin Tran

    Abstract: Bayesian neural networks (BNNs) demonstrate promising success in improving the robustness and uncertainty quantification of modern deep learning. However, they generally struggle with underfitting at scale and parameter efficiency. On the other hand, deep ensembles have emerged as alternatives for uncertainty quantification that, while outperforming BNNs on certain problems, also suffer from effic… ▽ More

    Submitted 14 August, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: Published in the International Conference on Machine Learning (ICML) 2020. Code available at https://github.com/google/edward2

  6. arXiv:1911.05861  [pdf, other

    cs.LG stat.ML

    Federated and Differentially Private Learning for Electronic Health Records

    Authors: Stephen R. Pfohl, Andrew M. Dai, Katherine Heller

    Abstract: The use of collaborative and decentralized machine learning techniques such as federated learning have the potential to enable the development and deployment of clinical risk predictions models in low-resource settings without requiring sensitive data be shared or stored in a central repository. This process necessitates communication of model weights or updates between collaborating entities, but… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  7. Analyzing the Role of Model Uncertainty for Electronic Health Records

    Authors: Michael W. Dusenberry, Dustin Tran, Edward Choi, Jonas Kemp, Jeremy Nixon, Ghassen Jerfel, Katherine Heller, Andrew M. Dai

    Abstract: In medicine, both ethical and monetary costs of incorrect predictions can be significant, and the complexity of the problems often necessitates increasingly complex models. Recent work has shown that changing just the random seed is enough for otherwise well-tuned deep neural networks to vary in their individual predicted probabilities. In light of this, we investigate the role of model uncertaint… ▽ More

    Submitted 25 March, 2020; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Published in the ACM Conference on Health, Inference, and Learning (CHIL) 2020. Code available at https://github.com/Google-Health/records-research

  8. arXiv:1812.06080  [pdf, other

    cs.LG stat.ML

    Reconciling meta-learning and continual learning with online mixtures of tasks

    Authors: Ghassen Jerfel, Erin Grant, Thomas L. Griffiths, Katherine Heller

    Abstract: Learning-to-learn or meta-learning leverages data-driven inductive bias to increase the efficiency of learning on a novel task. This approach encounters difficulty when transfer is not advantageous, for instance, when tasks are considerably dissimilar or change over time. We use the connection between gradient-based meta-learning and hierarchical Bayes to propose a Dirichlet process mixture of hie… ▽ More

    Submitted 19 June, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: updated experimental results

  9. arXiv:1807.09237  [pdf, other

    stat.AP

    Hierarchical infinite factor model for improving the prediction of surgical complications for geriatric patients

    Authors: Elizabeth Lorenzi, Ricardo Henao, Katherine Heller

    Abstract: We develop a hierarchical infinite latent factor model (HIFM) to appropriately account for the covariance structure across subpopulations in data. We propose a novel Hierarchical Dirichlet Process shrinkage prior on the loadings matrix that flexibly captures the underlying structure of our data across subpopulations while sharing information to improve inference and prediction. The stick-breaking… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.

  10. arXiv:1806.06696  [pdf, other

    stat.AP

    SMOGS: Social Network Metrics of Game Success

    Authors: Fan Bu, Sonia Xu, Katherine Heller, Alexander Volfovsky

    Abstract: This paper develops metrics from a social network perspective that are directly translatable to the outcome of a basketball game. We extend a state-of-the-art multi-resolution stochastic process approach to modeling basketball by modeling passes between teammates as directed dynamic relational links on a network and introduce multiplicative latent factors to study higher-order patterns in players'… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Journal ref: PMLR 2019 89:2406-2414

  11. arXiv:1708.05894  [pdf, other

    stat.ML stat.AP stat.ME

    An Improved Multi-Output Gaussian Process RNN with Real-Time Validation for Early Sepsis Detection

    Authors: Joseph Futoma, Sanjay Hariharan, Mark Sendak, Nathan Brajer, Meredith Clement, Armando Bedoya, Cara O'Brien, Katherine Heller

    Abstract: Sepsis is a poorly understood and potentially life-threatening complication that can occur as a result of infection. Early detection and treatment improves patient outcomes, and as such it poses an important challenge in medicine. In this work, we develop a flexible classifier that leverages streaming lab results, vitals, and medications to predict sepsis before it occurs. We model patient clinica… ▽ More

    Submitted 19 August, 2017; originally announced August 2017.

    Comments: Presented at Machine Learning for Healthcare 2017, Boston, MA

  12. arXiv:1706.04152  [pdf, other

    stat.ML stat.AP stat.ME

    Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier

    Authors: Joseph Futoma, Sanjay Hariharan, Katherine Heller

    Abstract: We present a scalable end-to-end classifier that uses streaming physiological and medication data to accurately predict the onset of sepsis, a life-threatening complication from infections that has high mortality and morbidity. Our proposed framework models the multivariate trajectories of continuous-valued physiological time series using multitask Gaussian processes, seamlessly accounting for the… ▽ More

    Submitted 13 June, 2017; originally announced June 2017.

    Comments: Presented at 34th International Conference on Machine Learning (ICML 2017), Sydney, Australia

  13. arXiv:1612.00555  [pdf, other

    stat.ML

    Transfer Learning via Latent Factor Modeling to Improve Prediction of Surgical Complications

    Authors: Elizabeth C Lorenzi, Zhifei Sun, Erich Huang, Ricardo Henao, Katherine A Heller

    Abstract: We aim to create a framework for transfer learning using latent factor models to learn the dependence structure between a larger source dataset and a target dataset. The methodology is motivated by our goal of building a risk-assessment model for surgery patients, using both institutional and national surgical outcomes data. The national surgical outcomes data is collected through NSQIP (National… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

  14. arXiv:1608.04615  [pdf, other

    stat.ML stat.AP stat.ME

    Scalable Modeling of Multivariate Longitudinal Data for Prediction of Chronic Kidney Disease Progression

    Authors: Joseph Futoma, Mark Sendak, C. Blake Cameron, Katherine Heller

    Abstract: Prediction of the future trajectory of a disease is an important challenge for personalized medicine and population health management. However, many complex chronic diseases exhibit large degrees of heterogeneity, and furthermore there is not always a single readily available biomarker to quantify disease severity. Even when such a clinical variable exists, there are often additional related bioma… ▽ More

    Submitted 16 August, 2016; originally announced August 2016.

    Comments: Presented at 2016 Machine Learning and Healthcare Conference (MLHC 2016), Los Angeles, CA

  15. arXiv:1604.07031  [pdf, other

    stat.ME stat.AP

    Predictive Hierarchical Clustering: Learning clusters of CPT codes for improving surgical outcomes

    Authors: Elizabeth C. Lorenzi, Stephanie L. Brown, Zhifei Sun, Katherine Heller

    Abstract: We develop a novel algorithm, Predictive Hierarchical Clustering (PHC), for agglomerative hierarchical clustering of current procedural terminology (CPT) codes. Our predictive hierarchical clustering aims to cluster subgroups, not individual observations, found within our data, such that the clusters discovered result in optimal performance of a classification model. Therefore, merges are chosen b… ▽ More

    Submitted 1 August, 2017; v1 submitted 24 April, 2016; originally announced April 2016.

    Comments: Accepted at MLHC 2017 to appear in JMLR

  16. arXiv:1511.04157  [pdf, other

    stat.ML

    $k$-means: Fighting against Degeneracy in Sequential Monte Carlo with an Application to Tracking

    Authors: Kai Fan, Katherine Heller

    Abstract: For regular particle filter algorithm or Sequential Monte Carlo (SMC) methods, the initial weights are traditionally dependent on the proposed distribution, the posterior distribution at the current timestamp in the sampled sequence, and the target is the posterior distribution of the previous timestamp. This is technically correct, but leads to algorithms which usually have practical issues with… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

  17. arXiv:1509.02866  [pdf, other

    stat.ML

    Fast Second-Order Stochastic Backpropagation for Variational Inference

    Authors: Kai Fan, Ziteng Wang, Jeff Beck, James Kwok, Katherine Heller

    Abstract: We propose a second-order (Hessian or Hessian-free) based optimization method for variational inference inspired by Gaussian backpropagation, and argue that quasi-Newton optimization can be developed as well. This is accomplished by generalizing the gradient computation in stochastic backpropagation via a reparametrization trick with lower complexity. As an illustrative example, we apply this appr… ▽ More

    Submitted 28 March, 2017; v1 submitted 9 September, 2015; originally announced September 2015.

    Comments: Accepted by NIPS 2015

  18. arXiv:1509.00110  [pdf, other

    stat.AP

    Bayesian Models for Heterogeneous Personalized Health Data

    Authors: Kai Fan, Allison E. Aiello, Katherine A. Heller

    Abstract: The purpose of this study is to leverage modern technology (such as mobile or web apps in Beckman et al. (2014)) to enrich epidemiology data and infer the transmission of disease. Homogeneity related research on population level has been intensively studied in previous work. In contrast, we develop hierarchical Graph-Coupled Hidden Markov Models (hGCHMMs) to simultaneously track the spread of infe… ▽ More

    Submitted 31 August, 2015; originally announced September 2015.

    Comments: 35 pages; Heterogeneous Flu Diffusion, Social Networks, Dynamic Bayesian Modeling

  19. arXiv:1506.03164  [pdf, other

    stat.ML

    Parallelizing MCMC with Random Partition Trees

    Authors: Xiangyu Wang, Fangjian Guo, Katherine A. Heller, David B. Dunson

    Abstract: The modern scale of data has brought new challenges to Bayesian inference. In particular, conventional MCMC algorithms are computationally very expensive for large data sets. A promising approach to solve this problem is embarrassingly parallel MCMC (EP-MCMC), which first partitions the data into multiple subsets and runs independent sampling algorithms on each subset. The subset posterior draws a… ▽ More

    Submitted 26 October, 2015; v1 submitted 10 June, 2015; originally announced June 2015.

    Comments: 25 pages, 9 figures

  20. arXiv:1411.2674  [pdf, other

    stat.ML cs.CL cs.LG cs.SI

    The Bayesian Echo Chamber: Modeling Social Influence via Linguistic Accommodation

    Authors: Fangjian Guo, Charles Blundell, Hanna Wallach, Katherine Heller

    Abstract: We present the Bayesian Echo Chamber, a new Bayesian generative model for social interaction data. By modeling the evolution of people's language usage over time, this model discovers latent influence relationships between them. Unlike previous work on inferring influence, which has primarily focused on simple temporal dynamics evidenced via turn-taking behavior, our model captures more nuanced in… ▽ More

    Submitted 27 January, 2015; v1 submitted 10 November, 2014; originally announced November 2014.

    Comments: 14 pages, 7 figures, to appear in AISTATS 2015. Fixed minor formatting issues

  21. arXiv:1210.4864  [pdf

    cs.SI physics.soc-ph stat.AP

    Graph-Coupled HMMs for Modeling the Spread of Infection

    Authors: Wen Dong, Alex Pentland, Katherine A. Heller

    Abstract: We develop Graph-Coupled Hidden Markov Models (GCHMMs) for modeling the spread of infectious disease locally within a social network. Unlike most previous research in epidemiology, which typically models the spread of infection at the level of entire populations, we successfully leverage mobile phone data collected from 84 people over an extended period of time to model the spread of infection on… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-227-236

  22. arXiv:1204.0168  [pdf

    stat.AP cs.MA cs.SI physics.soc-ph

    Modeling Infection with Multi-agent Dynamics

    Authors: Wen Dong, Katherine A. Heller, Alex Sandy Pentland

    Abstract: Developing the ability to comprehensively study infections in small populations enables us to improve epidemic models and better advise individuals about potential risks to their health. We currently have a limited understanding of how infections spread within a small population because it has been difficult to closely track an infection within a complete community. The paper presents data closely… ▽ More

    Submitted 11 October, 2014; v1 submitted 1 April, 2012; originally announced April 2012.

  23. arXiv:1203.3468  [pdf

    cs.LG stat.ML

    Bayesian Rose Trees

    Authors: Charles Blundell, Yee Whye Teh, Katherine A. Heller

    Abstract: Hierarchical structure is ubiquitous in data across many domains. There are many hierarchical clustering methods, frequently used by domain experts, which strive to discover this structure. However, most of these methods limit discoverable hierarchies to those with binary branching structure. This limitation, while computationally convenient, is often undesirable. In this paper we explore a Bayesi… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-65-72

  24. arXiv:1106.1157  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian and L1 Approaches to Sparse Unsupervised Learning

    Authors: Shakir Mohamed, Katherine Heller, Zoubin Ghahramani

    Abstract: The use of L1 regularisation for sparse learning has generated immense research interest, with successful application in such diverse areas as signal acquisition, image coding, genomics and collaborative filtering. While existing work highlights the many advantages of L1 methods, in this paper we find that L1 regularisation often dramatically underperforms in terms of predictive performance when c… ▽ More

    Submitted 17 August, 2012; v1 submitted 6 June, 2011; originally announced June 2011.

    Comments: In Proceedings of the 29th International Conference on Machine Learning (ICML), Edinburgh, Scotland, 2012

  25. arXiv:0912.5193  [pdf, ps, other

    stat.ME cs.LG physics.soc-ph q-bio.QM stat.AP

    Ranking relations using analogies in biological and information networks

    Authors: Ricardo Silva, Katherine Heller, Zoubin Ghahramani, Edoardo M. Airoldi

    Abstract: Analogical reasoning depends fundamentally on the ability to learn and generalize about relations between objects. We develop an approach to relational learning which, given a set of pairs of objects $\mathbf{S}=\{A^{(1)}:B^{(1)},A^{(2)}:B^{(2)},\ldots,A^{(N)}:B ^{(N)}\}$, measures how well other pairs A:B fit in with the set $\mathbf{S}$. Our work addresses the following question: is the relation… ▽ More

    Submitted 29 August, 2013; v1 submitted 28 December, 2009; originally announced December 2009.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOAS321 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS321

    Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 2, 615-644

  26. arXiv:0801.0461  [pdf, other

    stat.ME math.ST

    An Alternative Prior Process for Nonparametric Bayesian Clustering

    Authors: Hanna M. Wallach, Shane T. Jensen, Lee Dicker, Katherine A. Heller

    Abstract: Prior distributions play a crucial role in Bayesian approaches to clustering. Two commonly-used prior distributions are the Dirichlet and Pitman-Yor processes. In this paper, we investigate the predictive probabilities that underlie these processes, and the implicit "rich-get-richer" characteristic of the resulting partitions. We explore an alternative prior for nonparametric Bayesian clustering -… ▽ More

    Submitted 15 October, 2010; v1 submitted 2 January, 2008; originally announced January 2008.

    Journal ref: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS) 2010, JMLR W & CP 9, pp. 892-899