Skip to main content

Showing 1–15 of 15 results for author: Bahadori, M T

.
  1. arXiv:2412.02893  [pdf, other

    cs.CL cs.AI cs.LG stat.AP stat.ME

    Removing Spurious Correlation from Neural Network Interpretations

    Authors: Milad Fotouhi, Mohammad Taha Bahadori, Oluwaseyi Feyisetan, Payman Arabshahi, David Heckerman

    Abstract: The existing algorithms for identification of neurons responsible for undesired and harmful behaviors do not consider the effects of confounders such as topic of the conversation. In this work, we show that confounders can create spurious correlations and propose a new causal mediation approach that controls the impact of the topic. In experiments with two large language models, we study the local… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  2. arXiv:2408.11852  [pdf, other

    cs.CL cs.AI cs.LG

    Fast Training Dataset Attribution via In-Context Learning

    Authors: Milad Fotouhi, Mohammad Taha Bahadori, Oluwaseyi Feyisetan, Payman Arabshahi, David Heckerman

    Abstract: We investigate the use of in-context learning and prompt engineering to estimate the contributions of training data in the outputs of instruction-tuned large language models (LLMs). We propose two novel approaches: (1) a similarity-based approach that measures the difference between LLM outputs with and without provided context, and (2) a mixture distribution model approach that frames the problem… ▽ More

    Submitted 18 March, 2025; v1 submitted 14 August, 2024; originally announced August 2024.

  3. arXiv:2404.08839  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Multiply-Robust Causal Change Attribution

    Authors: Victor Quintas-Martinez, Mohammad Taha Bahadori, Eduardo Santiago, Jeff Mu, Dominik Janzing, David Heckerman

    Abstract: Comparing two samples of data, we observe a change in the distribution of an outcome variable. In the presence of multiple explanatory variables, how much of the change can be explained by each possible cause? We develop a new estimation strategy that, given a causal model, combines regression and re-weighting methods to quantify the contribution of each causal mechanism. Our proposed methodology… ▽ More

    Submitted 5 September, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  4. arXiv:2107.13068  [pdf, other

    cs.LG stat.ME stat.ML

    End-to-End Balancing for Causal Continuous Treatment-Effect Estimation

    Authors: Mohammad Taha Bahadori, Eric Tchetgen Tchetgen, David E. Heckerman

    Abstract: We study the problem of observational causal inference with continuous treatments in the framework of inverse propensity-score weighting. To obtain stable weights, we design a new algorithm based on entropy balancing that learns weights to directly maximize causal inference accuracy using end-to-end optimization. In the process of optimization, these weights are automatically tuned to the specific… ▽ More

    Submitted 10 July, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: To be presented in ICML 2022

    MSC Class: 62D20 ACM Class: I.2.6

  5. arXiv:2007.11500  [pdf, other

    cs.LG stat.ML

    Debiasing Concept-based Explanations with Causal Analysis

    Authors: Mohammad Taha Bahadori, David E. Heckerman

    Abstract: Concept-based explanation approach is a popular model interpertability tool because it expresses the reasons for a model's predictions in terms of concepts that are meaningful for the domain experts. In this work, we study the problem of the concepts being correlated with confounding information in the features. We propose a new causal prior graph for modeling the impacts of unobserved variables a… ▽ More

    Submitted 22 May, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: Accepted in ICLR 2021

  6. arXiv:1911.03295  [pdf, other

    cs.LG q-bio.QM stat.ML

    Discovering Invariances in Healthcare Neural Networks

    Authors: Mohammad Taha Bahadori, Layne C. Price

    Abstract: We study the invariance characteristics of pre-trained predictive models by empirically learning transformations on the input that leave the prediction function approximately unchanged. To learn invariant transformations, we minimize the Wasserstein distance between the predictive distribution conditioned on the data instances and the predictive distribution conditioned on the transformed data ins… ▽ More

    Submitted 3 March, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: The extended version

  7. arXiv:1904.12206  [pdf, other

    cs.LG q-bio.QM stat.ML

    Temporal-Clustering Invariance in Irregular Healthcare Time Series

    Authors: Mohammad Taha Bahadori, Zachary Chase Lipton

    Abstract: Electronic records contain sequences of events, some of which take place all at once in a single visit, and others that are dispersed over multiple visits, each with a different timestamp. We postulate that fine temporal detail, e.g., whether a series of blood tests are completed at once or in rapid succession should not alter predictions based on this data. Motivated by this intuition, we propose… ▽ More

    Submitted 27 April, 2019; originally announced April 2019.

  8. arXiv:1811.12276  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Hospital Mortality Prediction with Medical Named Entities and Multimodal Learning

    Authors: Mengqi Jin, Mohammad Taha Bahadori, Aaron Colak, Parminder Bhatia, Busra Celikkaya, Ram Bhakta, Selvan Senthivel, Mohammed Khalilia, Daniel Navarro, Borui Zhang, Tiberiu Doman, Arun Ravi, Matthieu Liger, Taha Kass-hout

    Abstract: Clinical text provides essential information to estimate the acuity of a patient during hospital stays in addition to structured clinical data. In this study, we explore how clinical text can complement a clinical predictive learning task. We leverage an internal medical natural language processing service to perform named entity extraction and negation detection on clinical notes and compose sele… ▽ More

    Submitted 3 December, 2018; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

  9. arXiv:1702.02604  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Causal Regularization

    Authors: Mohammad Taha Bahadori, Krzysztof Chalupka, Edward Choi, Robert Chen, Walter F. Stewart, Jimeng Sun

    Abstract: In application domains such as healthcare, we want accurate predictive models that are also causally interpretable. In pursuit of such models, we propose a causal regularizer to steer predictive models towards causally-interpretable solutions and theoretically study its properties. In a large-scale analysis of Electronic Health Records (EHR), our causally-regularized model outperforms its L1-regul… ▽ More

    Submitted 23 February, 2017; v1 submitted 8 February, 2017; originally announced February 2017.

    Comments: Adding theoretical analysis, revising the text

  10. arXiv:1611.07012  [pdf, other

    cs.LG stat.ML

    GRAM: Graph-based Attention Model for Healthcare Representation Learning

    Authors: Edward Choi, Mohammad Taha Bahadori, Le Song, Walter F. Stewart, Jimeng Sun

    Abstract: Deep learning methods exhibit promising performance for predictive modeling in healthcare, but two important challenges remain: -Data insufficiency:Often in healthcare predictive modeling, the sample size is insufficient for deep learning methods to achieve satisfactory results. -Interpretation:The representations learned by deep learning methods should align with medical knowledge. To address the… ▽ More

    Submitted 1 April, 2017; v1 submitted 21 November, 2016; originally announced November 2016.

  11. arXiv:1608.05745  [pdf, other

    cs.LG cs.AI cs.NE

    RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism

    Authors: Edward Choi, Mohammad Taha Bahadori, Joshua A. Kulas, Andy Schuetz, Walter F. Stewart, Jimeng Sun

    Abstract: Accuracy and interpretability are two dominant features of successful predictive models. Typically, a choice must be made in favor of complex black box models such as recurrent neural networks (RNN) for accuracy versus less accurate but more interpretable traditional models such as logistic regression. This tradeoff poses challenges in medicine where both accuracy and interpretability are importan… ▽ More

    Submitted 26 February, 2017; v1 submitted 19 August, 2016; originally announced August 2016.

    Comments: Accepted at Neural Information Processing Systems (NIPS) 2016

  12. arXiv:1608.03686  [pdf, other

    stat.ME

    Scalable Interpretable Multi-Response Regression via SEED

    Authors: Mohammad Taha Bahadori, Zemin Zheng, Yan Liu, Jinchi Lv

    Abstract: Sparse reduced-rank regression is an important tool to uncover meaningful dependence structure between large numbers of predictors and responses in many big data applications such as genome-wide association studies and social media analysis. Despite the recent theoretical and algorithmic advances, scalable estimation of sparse reduced-rank regression remains largely unexplored. In this paper, we s… ▽ More

    Submitted 12 August, 2016; originally announced August 2016.

    Comments: 31 pages, 7 figures

  13. arXiv:1602.06468  [pdf, other

    cs.LG

    FLASH: Fast Bayesian Optimization for Data Analytic Pipelines

    Authors: Yuyu Zhang, Mohammad Taha Bahadori, Hang Su, Jimeng Sun

    Abstract: Modern data science relies on data analytic pipelines to organize interdependent computational steps. Such analytic pipelines often involve different algorithms across multiple steps, each with its own hyperparameters. To achieve the best performance, it is often critical to select optimal algorithms and to set appropriate hyperparameters, which requires large computational efforts. Bayesian optim… ▽ More

    Submitted 23 June, 2016; v1 submitted 20 February, 2016; originally announced February 2016.

    Comments: 21 pages, KDD 2016

  14. arXiv:1602.05568  [pdf, other

    cs.LG

    Multi-layer Representation Learning for Medical Concepts

    Authors: Edward Choi, Mohammad Taha Bahadori, Elizabeth Searles, Catherine Coffey, Jimeng Sun

    Abstract: Learning efficient representations for concepts has been proven to be an important basis for many applications such as machine translation or document classification. Proper representations of medical concepts such as diagnosis, medication, procedure codes and visits will have broad applications in healthcare analytics. However, in Electronic Health Records (EHR) the visit sequences of patients in… ▽ More

    Submitted 17 February, 2016; originally announced February 2016.

  15. arXiv:1511.05942  [pdf, other

    cs.LG

    Doctor AI: Predicting Clinical Events via Recurrent Neural Networks

    Authors: Edward Choi, Mohammad Taha Bahadori, Andy Schuetz, Walter F. Stewart, Jimeng Sun

    Abstract: Leveraging large historical data in electronic health record (EHR), we developed Doctor AI, a generic predictive model that covers observed medical conditions and medication uses. Doctor AI is a temporal model using recurrent neural networks (RNN) and was developed and applied to longitudinal time stamped EHR data from 260K patients over 8 years. Encounter records (e.g. diagnosis codes, medication… ▽ More

    Submitted 28 September, 2016; v1 submitted 18 November, 2015; originally announced November 2015.

    Comments: Presented at 2016 Machine Learning and Healthcare Conference (MLHC 2016), Los Angeles, CA