Skip to main content

Showing 1–24 of 24 results for author: Saria, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.11785  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Improving Coverage in Combined Prediction Sets with Weighted p-values

    Authors: Gina Wong, Drew Prinster, Suchi Saria, Rama Chellappa, Anqi Liu

    Abstract: Conformal prediction quantifies the uncertainty of machine learning models by augmenting point predictions with valid prediction sets, assuming exchangeability. For complex scenarios involving multiple trials, models, or data sources, conformal prediction sets can be aggregated to create a prediction set that captures the overall uncertainty, often improving precision. However, aggregating multipl… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  2. arXiv:2505.04608  [pdf, ps, other

    cs.LG cs.AI stat.ML

    WATCH: Adaptive Monitoring for AI Deployments via Weighted-Conformal Martingales

    Authors: Drew Prinster, Xing Han, Anqi Liu, Suchi Saria

    Abstract: Responsibly deploying artificial intelligence (AI) / machine learning (ML) systems in high-stakes settings arguably requires not only proof of system reliability, but also continual, post-deployment monitoring to quickly detect and address any unsafe behavior. Methods for nonparametric sequential testing -- especially conformal test martingales (CTMs) and anytime-valid inference -- offer promising… ▽ More

    Submitted 1 June, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

    Comments: To be published in The International Conference on Machine Learning (ICML), 2025

  3. arXiv:2410.02935  [pdf, other

    stat.ML cs.LG

    On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions

    Authors: Huy Nguyen, Xing Han, Carl Harris, Suchi Saria, Nhat Ho

    Abstract: With the growing prominence of the Mixture of Experts (MoE) architecture in developing large-scale foundation models, we investigate the Hierarchical Mixture of Experts (HMoE), a specialized variant of MoE that excels in handling complex inputs and improving performance on targeted tasks. Our analysis highlights the advantages of using the Laplace gating function over the traditional Softmax gatin… ▽ More

    Submitted 6 March, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: Huy Nguyen and Xing Han contributed equally to this work

  4. arXiv:2405.06627  [pdf, other

    cs.LG cs.AI stat.ML

    Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)

    Authors: Drew Prinster, Samuel Stanton, Anqi Liu, Suchi Saria

    Abstract: As artificial intelligence (AI) / machine learning (ML) gain widespread adoption, practitioners are increasingly seeking means to quantify and control the risk these systems incur. This challenge is especially salient when such systems have autonomy to collect their own data, such as in black-box optimization and active learning, where their actions induce sequential feedback-loop shifts in the da… ▽ More

    Submitted 5 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Code available at https://github.com/drewprinster/conformal-mfcs

  5. arXiv:2207.10716  [pdf, other

    cs.LG stat.ML

    JAWS: Auditing Predictive Uncertainty Under Covariate Shift

    Authors: Drew Prinster, Anqi Liu, Suchi Saria

    Abstract: We propose \textbf{JAWS}, a series of wrapper methods for distribution-free uncertainty quantification tasks under covariate shift, centered on the core method \textbf{JAW}, the \textbf{JA}ckknife+ \textbf{W}eighted with data-dependent likelihood-ratio weights. JAWS also includes computationally efficient \textbf{A}pproximations of JAW using higher-order influence functions: \textbf{JAWA}. Theoret… ▽ More

    Submitted 23 November, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: Thirty-sixth Conference on Neural Information Processing Systems

  6. arXiv:2012.12449  [pdf, other

    stat.ML cs.LG stat.ME

    Partial Identifiability in Discrete Data With Measurement Error

    Authors: Noam Finkelstein, Roy Adams, Suchi Saria, Ilya Shpitser

    Abstract: When data contains measurement errors, it is necessary to make assumptions relating the observed, erroneous data to the unobserved true phenomena of interest. These assumptions should be justifiable on substantive grounds, but are often motivated by mathematical convenience, for the sake of exactly identifying the target of inference. We adopt the view that it is preferable to present bounds under… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  7. arXiv:2011.15099  [pdf, other

    stat.ME stat.AP

    The Impact of Time Series Length and Discretization on Longitudinal Causal Estimation Methods

    Authors: Roy Adams, Suchi Saria, Michael Rosenblum

    Abstract: The use of observational time series data to assess the impact of multi-time point interventions is becoming increasingly common as more health and activity data are collected and digitized via wearables, social media, and electronic health records. Such time series may involve hundreds or thousands of irregularly sampled observations. One common analysis approach is to simplify such time series b… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

  8. arXiv:2010.15100  [pdf, other

    cs.LG stat.ML

    Evaluating Model Robustness and Stability to Dataset Shift

    Authors: Adarsh Subbaswamy, Roy Adams, Suchi Saria

    Abstract: As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robust a model is to changes in setting or population, which typically requires applying the model to multiple, independent datasets. Since the cost of collecting such datasets is often prohibitive, in this paper, we propose a fr… ▽ More

    Submitted 15 March, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: In Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS), 2021

  9. arXiv:2002.08948  [pdf, other

    stat.ML cs.AI cs.LG

    I-SPEC: An End-to-End Framework for Learning Transportable, Shift-Stable Models

    Authors: Adarsh Subbaswamy, Suchi Saria

    Abstract: Shifts in environment between development and deployment cause classical supervised learning to produce models that fail to generalize well to new target distributions. Recently, many solutions which find invariant predictive distributions have been developed. Among these, graph-based approaches do not require data from the target environment and can capture more stable information than alternativ… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  10. arXiv:1905.11374  [pdf, other

    stat.ML cs.AI cs.LG

    A Unifying Causal Framework for Analyzing Dataset Shift-stable Learning Algorithms

    Authors: Adarsh Subbaswamy, Bryant Chen, Suchi Saria

    Abstract: Recent interest in the external validity of prediction models (i.e., the problem of different train and test distributions, known as dataset shift) has produced many methods for finding predictive distributions that are invariant to dataset shifts and can be used for prediction in new, unseen environments. However, these methods consider different types of shifts and have been developed under disp… ▽ More

    Submitted 18 July, 2022; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: Published in the Journal of Causal Inference

    Journal ref: Journal of Causal Inference, 10(1), 64-89

  11. arXiv:1904.05268  [pdf, other

    stat.ML cs.LG

    Active Learning for Decision-Making from Imbalanced Observational Data

    Authors: Iiris Sundin, Peter Schulam, Eero Siivola, Aki Vehtari, Suchi Saria, Samuel Kaski

    Abstract: Machine learning can help personalized decision support by learning models to predict individual treatment effects (ITE). This work studies the reliability of prediction-based decision-making in a task of deciding which action $a$ to take for a target unit after observing its covariates $\tilde{x}$ and predicted outcomes $\hat{p}(\tilde{y} \mid \tilde{x}, a)$. An example case is personalized medic… ▽ More

    Submitted 6 June, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

    Comments: Published in Proceedings of the 36th International Conference on Machine Learning (ICML) 2019. 15 pages (10 paper + 5 supplementary), 7 figures

  12. arXiv:1901.09060  [pdf, other

    stat.ML cs.LG

    Learning Models from Data with Measurement Error: Tackling Underreporting

    Authors: Roy Adams, Yuelong Ji, Xiaobin Wang, Suchi Saria

    Abstract: Measurement error in observational datasets can lead to systematic bias in inferences based on these datasets. As studies based on observational data are increasingly used to inform decisions with real-world impact, it is critical that we develop a robust set of techniques for analyzing and adjusting for these biases. In this paper we present a method for estimating the distribution of an outcome… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

  13. arXiv:1901.00403  [pdf, other

    stat.ML cs.LG stat.ME

    Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

    Authors: Peter Schulam, Suchi Saria

    Abstract: To use machine learning in high stakes applications (e.g. medicine), we need tools for building confidence in the system and evaluating whether it is reliable. Methods to improve model reliability often require new learning algorithms (e.g. using Bayesian inference to obtain uncertainty estimates). An alternative is to audit a model after it is trained. In this paper, we describe resampling uncert… ▽ More

    Submitted 28 February, 2019; v1 submitted 2 January, 2019; originally announced January 2019.

    Comments: To appear in the proceedings of Artificial Intelligence and Statistics (AISTATS) 2019

  14. arXiv:1812.04597  [pdf, other

    stat.ML cs.AI cs.LG

    Preventing Failures Due to Dataset Shift: Learning Predictive Models That Transport

    Authors: Adarsh Subbaswamy, Peter Schulam, Suchi Saria

    Abstract: Classical supervised learning produces unreliable models when training and target distributions differ, with most existing solutions requiring samples from the target domain. We propose a proactive approach which learns a relationship in the training domain that will generalize to the target domain by incorporating prior knowledge of aspects of the data generating process that are expected to diff… ▽ More

    Submitted 28 February, 2019; v1 submitted 11 December, 2018; originally announced December 2018.

    Comments: In Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS), 2019. Previously presented at the NeurIPS 2018 Causal Learning Workshop

  15. arXiv:1810.03025  [pdf, other

    stat.ML cs.AI cs.LG eess.SY

    Discretizing Logged Interaction Data Biases Learning for Decision-Making

    Authors: Peter Schulam, Suchi Saria

    Abstract: Time series data that are not measured at regular intervals are commonly discretized as a preprocessing step. For example, data about customer arrival times might be simplified by summing the number of arrivals within hourly intervals, which produces a discrete-time time series that is easier to model. In this abstract, we show that discretization introduces a bias that affects models trained for… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: This is a standalone short paper describing a new type of bias that can arise when learning from time series data for sequential decision-making problems

  16. arXiv:1808.03253  [pdf, other

    stat.ML cs.LG

    Counterfactual Normalization: Proactively Addressing Dataset Shift and Improving Reliability Using Causal Mechanisms

    Authors: Adarsh Subbaswamy, Suchi Saria

    Abstract: Predictive models can fail to generalize from training to deployment environments because of dataset shift, posing a threat to model reliability and the safety of downstream decisions made in practice. Instead of using samples from the target distribution to reactively correct dataset shift, we use graphical knowledge of the causal mechanisms relating variables in a prediction problem to proactive… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

    Comments: Proceedings of the 34th Conference on Uncertainty in Artificial Intelligence (UAI), 2018. Revised from print version

  17. arXiv:1708.04757  [pdf, other

    stat.ML cs.AI cs.LG

    Scalable Joint Models for Reliable Uncertainty-Aware Event Prediction

    Authors: Hossein Soleimani, James Hensman, Suchi Saria

    Abstract: Missing data and noisy observations pose significant challenges for reliably predicting events from irregularly sampled multivariate time series (longitudinal) data. Imputation methods, which are typically used for completing the data prior to event prediction, lack a principled mechanism to account for the uncertainty due to missingness. Alternatively, state-of-the-art joint modeling techniques c… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: To appear in IEEE Transaction on Pattern Analysis and Machine Intelligence

  18. arXiv:1704.02038  [pdf, other

    stat.ML cs.AI cs.LG

    Treatment-Response Models for Counterfactual Reasoning with Continuous-time, Continuous-valued Interventions

    Authors: Hossein Soleimani, Adarsh Subbaswamy, Suchi Saria

    Abstract: Treatment effects can be estimated from observational data as the difference in potential outcomes. In this paper, we address the challenge of estimating the potential outcome when treatment-dose levels can vary continuously over time. Further, the outcome variable may not be measured at a regular frequency. Our proposed solution represents the treatment response curves using linear time-invariant… ▽ More

    Submitted 4 November, 2017; v1 submitted 6 April, 2017; originally announced April 2017.

    Comments: In Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence (UAI-2017), Sydney, Australia, August 2017. The first two authors contributed equally to this work

  19. arXiv:1703.10651  [pdf, other

    stat.ML cs.AI cs.LG

    Reliable Decision Support using Counterfactual Models

    Authors: Peter Schulam, Suchi Saria

    Abstract: Decision-makers are faced with the challenge of estimating what is likely to happen when they take an action. For instance, if I choose not to treat this patient, are they likely to die? Practitioners commonly use supervised learning algorithms to fit predictive models that help decision-makers reason about likely future outcomes, but we show that this approach is unreliable, and sometimes even da… ▽ More

    Submitted 1 February, 2018; v1 submitted 30 March, 2017; originally announced March 2017.

    Comments: Published in the proceedings of Neural Information Processing Systems (NIPS) 2017

  20. arXiv:1608.05182  [pdf, other

    cs.LG stat.ML

    A Bayesian Nonparametric Approach for Estimating Individualized Treatment-Response Curves

    Authors: Yanbo Xu, Yanxun Xu, Suchi Saria

    Abstract: We study the problem of estimating the continuous response over time to interventions using observational time series---a retrospective dataset where the policy by which the data are generated is unknown to the learner. We are motivated by applications where response varies by individuals and therefore, estimating responses at the individual-level is valuable for personalizing decision-making. We… ▽ More

    Submitted 10 December, 2016; v1 submitted 18 August, 2016; originally announced August 2016.

  21. arXiv:1604.05819  [pdf, other

    stat.ML cs.LG

    Trading-Off Cost of Deployment Versus Accuracy in Learning Predictive Models

    Authors: Daniel P. Robinson, Suchi Saria

    Abstract: Predictive models are finding an increasing number of applications in many industries. As a result, a practical means for trading-off the cost of deploying a model versus its effectiveness is needed. Our work is motivated by risk prediction problems in healthcare. Cost-structures in domains such as healthcare are quite complex, posing a significant challenge to existing approaches. We propose a no… ▽ More

    Submitted 20 April, 2016; originally announced April 2016.

    Comments: Authors contributed equally to this work. To appear in IJCAI 2016, Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

  22. arXiv:1601.04674  [pdf, other

    stat.ML

    A Framework for Individualizing Predictions of Disease Trajectories by Exploiting Multi-Resolution Structure

    Authors: Peter Schulam, Suchi Saria

    Abstract: For many complex diseases, there is a wide variety of ways in which an individual can manifest the disease. The challenge of personalized medicine is to develop tools that can accurately predict the trajectory of an individual's disease, which can in turn enable clinicians to optimize treatments. We represent an individual's disease trajectory as a continuous-valued continuous-time function descri… ▽ More

    Submitted 21 January, 2016; v1 submitted 18 January, 2016; originally announced January 2016.

    Comments: Appeared in Neural Information Processing Systems (NIPS) 2015

  23. arXiv:1507.07295  [pdf, other

    cs.AI stat.AP

    Learning (Predictive) Risk Scores in the Presence of Censoring due to Interventions

    Authors: Kirill Dyagilev, Suchi Saria

    Abstract: A large and diverse set of measurements are regularly collected during a patient's hospital stay to monitor their health status. Tools for integrating these measurements into severity scores, that accurately track changes in illness severity, can improve clinicians ability to provide timely interventions. Existing approaches for creating such scores either 1) rely on experts to fully specify the s… ▽ More

    Submitted 26 July, 2015; originally announced July 2015.

    Journal ref: Machine Learning Journal, Special Issue on on Machine Learning for Health and Medicine, pp. 1-26, 2015

  24. arXiv:1008.2028  [pdf, ps, other

    stat.ML cs.AI stat.ME

    Discovering shared and individual latent structure in multiple time series

    Authors: Suchi Saria, Daphne Koller, Anna Penn

    Abstract: This paper proposes a nonparametric Bayesian method for exploratory data analysis and feature construction in continuous time series. Our method focuses on understanding shared features in a set of time series that exhibit significant individual variability. Our method builds on the framework of latent Diricihlet allocation (LDA) and its extension to hierarchical Dirichlet processes, which allows… ▽ More

    Submitted 11 August, 2010; originally announced August 2010.

    Comments: Additional supplementary section in tex file