Skip to main content

Showing 1–11 of 11 results for author: Desai, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.22943  [pdf

    cs.CL stat.ME

    A chart review process aided by natural language processing and multi-wave adaptive sampling to expedite validation of code-based algorithms for large database studies

    Authors: Shirley V Wang, Georg Hahn, Sushama Kattinakere Sreedhara, Mufaddal Mahesri, Haritha S. Pillai, Rajendra Aldis, Joyce Lii, Sarah K. Dutcher, Rhoda Eniafe, Jamal T. Jones, Keewan Kim, Jiwei He, Hana Lee, Sengwee Toh, Rishi J Desai, Jie Yang

    Abstract: Background: One of the ways to enhance analyses conducted with large claims databases is by validating the measurement characteristics of code-based algorithms used to identify health outcomes or other key study parameters of interest. These metrics can be used in quantitative bias analyses to assess the robustness of results for an inferential study given potential bias from outcome misclassifica… ▽ More

    Submitted 25 July, 2025; originally announced July 2025.

  2. arXiv:2504.11740  [pdf, other

    stat.ME

    A cautionary note for plasmode simulation studies in the setting of causal inference

    Authors: Pamela A Shaw, Susan Gruber, Brian D. Williamson, Rishi Desai, Susan M. Shortreed, Chloe Krakauer, Jennifer C. Nelson, Mark J. van der Laan

    Abstract: Plasmode simulation has become an important tool for evaluating the operating characteristics of different statistical methods in complex settings, such as pharmacoepidemiological studies of treatment effectiveness using electronic health records (EHR) data. These studies provide insight into how estimator performance is impacted by challenges including rare events, small sample size, etc., that c… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 55 pages, 6 tables, 2 figures, 8 supplemental tables, 4 supplemental figures

  3. arXiv:2412.15012  [pdf, other

    stat.ME

    Assessing treatment effects in observational data with missing confounders: A comparative study of practical doubly-robust and traditional missing data methods

    Authors: Brian D. Williamson, Chloe Krakauer, Eric Johnson, Susan Gruber, Bryan E. Shepherd, Mark J. van der Laan, Thomas Lumley, Hana Lee, Jose J. Hernandez Munoz, Fengyu Zhao, Sarah K. Dutcher, Rishi Desai, Gregory E. Simon, Susan M. Shortreed, Jennifer C. Nelson, Pamela A. Shaw

    Abstract: In pharmacoepidemiology, safety and effectiveness are frequently evaluated using readily available administrative and electronic health records data. In these settings, detailed confounder data are often not available in all data sources and therefore missing on a subset of individuals. Multiple imputation (MI) and inverse-probability weighting (IPW) are go-to analytical methods to handle missing… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 142 pages (27 main, 115 supplemental); 6 figures, 2 tables

  4. arXiv:2405.10925  [pdf

    stat.ME cs.AI cs.LG

    High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates

    Authors: Janick Weberpals, Pamela A. Shaw, Kueiyu Joshua Lin, Richard Wyss, Joseph M Plasek, Li Zhou, Kerry Ngan, Thomas DeRamus, Sudha R. Raman, Bradley G. Hammill, Hana Lee, Sengwee Toh, John G. Connolly, Kimberly J. Dandreo, Fang Tian, Wei Liu, Jie Li, José J. Hernández-Muñoz, Sebastian Schneeweiss, Rishi J. Desai

    Abstract: Multiple imputation (MI) models can be improved by including auxiliary covariates (AC), but their performance in high-dimensional data is not well understood. We aimed to develop and compare high-dimensional MI (HDMI) approaches using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation study using data from… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  5. arXiv:2402.09483  [pdf, ps, other

    stat.ML cs.CR cs.LG

    Oracle-Efficient Differentially Private Learning with Public Data

    Authors: Adam Block, Mark Bun, Rathin Desai, Abhishek Shetty, Steven Wu

    Abstract: Due to statistical lower bounds on the learnability of many function classes under privacy constraints, there has been recent interest in leveraging public data to improve the performance of private learning algorithms. In this model, algorithms must always guarantee differential privacy with respect to the private samples while also ensuring learning guarantees when the private data distribution… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  6. arXiv:2311.01625  [pdf, other

    stat.ME q-bio.QM

    Topological inference on brain networks across subtypes of post-stroke aphasia

    Authors: Yuan Wang, Jian Yin, Rutvik H. Desai

    Abstract: Persistent homology (PH) characterizes the shape of brain networks through the persistence features. Group comparison of persistence features from brain networks can be challenging as they are inherently heterogeneous. A recent scale-space representation of persistence diagram (PD) through heat diffusion reparameterizes using the finite number of Fourier coefficients with respect to the Laplace-Be… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  7. arXiv:2306.12115  [pdf

    stat.ML cs.HC stat.AP

    Explaining human body responses in random vibration: Effect of motion direction, sitting posture, and anthropometry

    Authors: M. M. Cvetković, R. Desai, K. N. de Winkel, G. Papaioannou, R. Happee

    Abstract: This study investigates the effects of anthropometric attributes, biological sex, and posture on translational body kinematic responses in translational vibrations. In total, 35 participants were recruited. Perturbations were applied on a standard car seat using a motion-based platform with 0.1 to 12.0 Hz random noise signals, with 0.3 m/s2 rms acceleration, for 60 seconds. Multiple linear regress… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 8 pages, 6 figures, conference named "26th IEEE International Conference on Intelligent Transportation Systems ITSC 2023"

    MSC Class: 82Dxx ACM Class: I.6.5

  8. arXiv:2302.03250  [pdf, other

    q-bio.NC stat.AP

    Network-based Statistics Distinguish Anomic and Broca Aphasia

    Authors: Xingpei Zhao, Nicholas Riccardi, Rutvik H. Desai, Dirk-Bart den Ouden, Julius Fridriksson, Yuan Wang

    Abstract: Aphasia is a speech-language impairment commonly caused by damage to the left hemisphere. Due to the complexity of speech-language processing, the neural mechanisms that underpin various symptoms between different types of aphasia are still not fully understood. We used the network-based statistic method to identify distinct subnetwork(s) of connections differentiating the resting-state functional… ▽ More

    Submitted 17 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  9. arXiv:2006.16241  [pdf, other

    cs.CV cs.LG stat.ML

    The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization

    Authors: Dan Hendrycks, Steven Basart, Norman Mu, Saurav Kadavath, Frank Wang, Evan Dorundo, Rahul Desai, Tyler Zhu, Samyak Parajuli, Mike Guo, Dawn Song, Jacob Steinhardt, Justin Gilmer

    Abstract: We introduce four new real-world distribution shift datasets consisting of changes in image style, image blurriness, geographic location, camera operation, and more. With our new datasets, we take stock of previously proposed methods for improving out-of-distribution robustness and put them to the test. We find that using larger models and artificial data augmentations can improve robustness on re… ▽ More

    Submitted 24 July, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: ICCV 2021; Datasets, code, and models available at https://github.com/hendrycks/imagenet-r

  10. arXiv:2006.09735  [pdf, other

    stat.ML cs.DS cs.LG math.ST stat.CO

    Efficient Statistics for Sparse Graphical Models from Truncated Samples

    Authors: Arnab Bhattacharyya, Rathin Desai, Sai Ganesh Nagarajan, Ioannis Panageas

    Abstract: In this paper, we study high-dimensional estimation from truncated samples. We focus on two fundamental and classical problems: (i) inference of sparse Gaussian graphical models and (ii) support recovery of sparse linear models. (i) For Gaussian graphical models, suppose $d$-dimensional samples ${\bf x}$ are generated from a Gaussian $N(μ,Σ)$ and observed only if they belong to a subset… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  11. arXiv:1907.04805  [pdf, other

    cs.LG stat.ME stat.ML

    Quantifying Error in the Presence of Confounders for Causal Inference

    Authors: Rathin Desai, Amit Sharma

    Abstract: Estimating average causal effect (ACE) is useful whenever we want to know the effect of an intervention on a given outcome. In the absence of a randomized experiment, many methods such as stratification and inverse propensity weighting have been proposed to estimate ACE. However, it is hard to know which method is optimal for a given dataset or which hyperparameters to use for a chosen method. To… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: 13 pages