Skip to main content

Showing 1–48 of 48 results for author: Kosorok, M R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.16780  [pdf, other

    math.ST stat.ME

    Linear Regression Using Hilbert-Space-Valued Covariates with Unknown Reproducing Kernel

    Authors: Xinyi Li, Margaret Hoch, Michael R. Kosorok

    Abstract: We present a new method of linear regression based on principal components using Hilbert-space-valued covariates with unknown reproducing kernels. We develop a computationally efficient approach to estimation and derive asymptotic theory for the regression parameter estimates under mild assumptions. We demonstrate the approach in simulation studies as well as in data analysis using two-dimensional… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  2. arXiv:2503.19127  [pdf, other

    stat.AP

    Statistical Design and Rationale of the Biomarkers for Evaluating Spine Treatments (BEST) Trial

    Authors: John Sperger, Kelley M. Kidwell, Matthew C. Mauck, Beibo Zhao, Kevin J. Anstrom, Anna Batorsky, Timothy S. Carey, Daniel J. Clauw, Nikki L. B. Freeman, Carol M. Greco, Anastasia Ivanova, Sara Jones Berkeley, Samuel A. McLean, Matthew A. Psioda, Bryce Rowland, Gwendolyn A. Sowa, Ajay D. Wasan, Joshua P Zitovsky, Michael R. Kosorok

    Abstract: Chronic low back pain (cLBP) is a prevalent condition with profound impacts on functioning and quality of life. While multiple evidence-based treatments exist, they all have modest average treatment effects$\unicode{x2013}$potentially due to individual variation in treatment response and the diverse etiologies of cLBP. This multi-site sequential, multiple-assignment randomized trial (SMART) invest… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 18 pages, 3 figures, 1 algorithm, 1 table

  3. arXiv:2501.18070  [pdf, other

    stat.ME

    An optimal dynamic treatment regime estimator for indefinite-horizon survival outcomes

    Authors: Jane She, Matthew Egberg, Michael R. Kosorok

    Abstract: We propose a new method in indefinite-horizon settings for estimating optimal dynamic treatment regimes for time-to-event outcomes. This method allows patients to have different numbers of treatment stages and is constructed using generalized survival random forests to maximize mean survival time. We use summarized history and data pooling, preventing data from growing in dimension as a patient's… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Comments: 24 pages, 7 figures

  4. arXiv:2501.07789  [pdf

    stat.AP

    Using Statistical Precision Medicine to Identify Optimal Treatments in a Heart Failure Setting

    Authors: Arti Virkud, Jessie K. Edwards, Michele Jonsson Funk, Patricia Chang, Abhijit V. Kshirsagar, Emily W. Gower, Michael R. Kosorok

    Abstract: Identifying optimal medical treatments to improve survival has long been a critical goal of pharmacoepidemiology. Traditionally, we use an average treatment effect measure to compare outcomes between treatment plans. However, new methods leveraging advantages of machine learning combined with the foundational tenets of causal inference are offering an alternative to the average treatment effect. H… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

  5. arXiv:2411.08315  [pdf, other

    stat.ME

    Optimal individualized treatment regimes for survival data with competing risks

    Authors: Christina W. Zhou, Nikki L. B. Freeman, Katharine L. McGinigle, Michael R. Kosorok

    Abstract: Precision medicine leverages patient heterogeneity to estimate individualized treatment regimens, formalized, data-driven approaches designed to match patients with optimal treatments. In the presence of competing events, where multiple causes of failure can occur and one cause precludes others, it is crucial to assess the risk of the specific outcome of interest, such as one type of failure over… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Comments: 22 pages, 4 figures

  6. arXiv:2408.16381  [pdf, other

    stat.ME math.ST

    Uncertainty quantification for intervals

    Authors: Carlos García Meixide, Michael R. Kosorok, Marcos Matabuena

    Abstract: Data following an interval structure are increasingly prevalent in many scientific applications. In medicine, clinical events are often monitored between two clinical visits, making the exact time of the event unknown and generating outcomes with a range format. As interest in automating healthcare decisions grows, uncertainty quantification via predictive regions becomes essential for developing… ▽ More

    Submitted 30 March, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

  7. arXiv:2408.14691  [pdf, other

    stat.ME

    Effects Among the Affected

    Authors: Lina M. Montoya, Elvin H. Geng, Michael Valancius, Michael R. Kosorok, Maya L. Petersen

    Abstract: Many interventions are both beneficial to initiate and harmful to stop. Traditionally, to determine whether to deploy that intervention in a time-limited way depends on if, on average, the increase in the benefits of starting it outweigh the increase in the harms of stopping it. We propose a novel causal estimand that provides a more nuanced understanding of the effects of such treatments, particu… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  8. arXiv:2408.07660  [pdf, other

    stat.ML cs.LG

    Off-Policy Reinforcement Learning with High Dimensional Reward

    Authors: Dong Neuck Lee, Michael R. Kosorok

    Abstract: Conventional off-policy reinforcement learning (RL) focuses on maximizing the expected return of scalar rewards. Distributional RL (DRL), in contrast, studies the distribution of returns with the distributional Bellman operator in a Euclidean space, leading to highly flexible choices for utility. This paper establishes robust theoretical foundations for DRL. We prove the contraction property of th… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 24 pages, 12 figures

    MSC Class: 68T05; 46B09 (Primary) 46B06 (Secondary)

  9. arXiv:2407.00364  [pdf, other

    stat.ME stat.ML

    Medical Knowledge Integration into Reinforcement Learning Algorithms for Dynamic Treatment Regimes

    Authors: Sophia Yazzourh, Nicolas Savy, Philippe Saint-Pierre, Michael R. Kosorok

    Abstract: The goal of precision medicine is to provide individualized treatment at each stage of chronic diseases, a concept formalized by Dynamic Treatment Regimes (DTR). These regimes adapt treatment strategies based on decision rules learned from clinical data to enhance therapeutic effectiveness. Reinforcement Learning (RL) algorithms allow to determine these decision rules conditioned by individual pat… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  10. arXiv:2312.15217  [pdf, other

    stat.ME stat.AP

    Constructing a T-test for Value Function Comparison of Individualized Treatment Regimes in the Presence of Multiple Imputation for Missing Data

    Authors: Minxin Lu, Annie Green Howard, Penny Gordon-Larsen, Katie A. Meyer, Hsiao-Chuan Tien, Shufa Du, Huijun Wang, Bing Zhang, Michael R. Kosorok

    Abstract: Optimal individualized treatment decision-making has improved health outcomes in recent years. The value function is commonly used to evaluate the goodness of an individualized treatment decision rule. Despite recent advances, comparing value functions between different treatment decision rules or constructing confidence intervals around value functions remains difficult. We propose a t-test based… ▽ More

    Submitted 19 April, 2025; v1 submitted 23 December, 2023; originally announced December 2023.

  11. arXiv:2307.12862  [pdf, other

    cs.SI cs.LG stat.CO stat.ML

    Stochastic Step-wise Feature Selection for Exponential Random Graph Models (ERGMs)

    Authors: Helal El-Zaatari, Fei Yu, Michael R Kosorok

    Abstract: Statistical analysis of social networks provides valuable insights into complex network interactions across various scientific disciplines. However, accurate modeling of networks remains challenging due to the heavy computational burden and the need to account for observed network dependencies. Exponential Random Graph Models (ERGMs) have emerged as a promising technique used in social network mod… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 23 pages, 6 tables and 18 figures

  12. arXiv:2307.12022  [pdf, other

    stat.ML cs.LG stat.ME

    A Flexible Framework for Incorporating Patient Preferences Into Q-Learning

    Authors: Joshua P. Zitovsky, Leslie Wilson, Michael R. Kosorok

    Abstract: In real-world healthcare problems, there are often multiple competing outcomes of interest, such as treatment efficacy and side effect severity. However, statistical methods for estimating dynamic treatment regimes (DTRs) usually assume a single outcome of interest, and the few methods that deal with composite outcomes suffer from important limitations. This includes restrictions to a single time… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: Under Review

  13. arXiv:2305.08969  [pdf, other

    stat.ME stat.ML

    A Causal Inference Framework for Leveraging External Controls in Hybrid Trials

    Authors: Michael Valancius, Herb Pang, Jiawen Zhu, Stephen R Cole, Michele Jonsson Funk, Michael R Kosorok

    Abstract: We consider the challenges associated with causal inference in settings where data from a randomized trial is augmented with control data from an external source to improve efficiency in estimating the average treatment effect (ATE). Through the development of a formal causal inference framework, we outline sufficient causal assumptions about the exchangeability between the internal and external c… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  14. arXiv:2304.13003  [pdf, other

    stat.ME

    Functional Individualized Treatment Regimes with Imaging Features

    Authors: Xinyi Li, Michael R. Kosorok

    Abstract: Precision medicine seeks to discover an optimal personalized treatment plan and thereby provide informed and principled decision support, based on the characteristics of individual patients. With recent advancements in medical imaging, it is crucial to incorporate patient-specific imaging features in the study of individualized treatment regimes. We propose a novel, data-driven method to construct… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  15. arXiv:2302.00141  [pdf, other

    cs.LG cs.AI stat.ML

    Revisiting Bellman Errors for Offline Model Selection

    Authors: Joshua P. Zitovsky, Daniel de Marchi, Rishabh Agarwal, Michael R. Kosorok

    Abstract: Offline model selection (OMS), that is, choosing the best policy from a set of many policies given only logged data, is crucial for applying offline RL in real-world settings. One idea that has been extensively explored is to select policies based on the mean squared Bellman error (MSBE) of the associated Q-functions. However, previous work has struggled to obtain adequate OMS performance with Bel… ▽ More

    Submitted 6 June, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Published in ICML 2023

    ACM Class: I.2.8; I.6.4

    Journal ref: In ICML (pp. 43369-43406). PMLR (2023)

  16. arXiv:2212.00650  [pdf, other

    stat.ME

    Dynamic treatment regime characterization via value function surrogate with an application to partial compliance

    Authors: Nikki L. B. Freeman, Sydney E. Browder, Katharine L. McGinigle, Michael R. Kosorok

    Abstract: Precision medicine is a promising framework for generating evidence to improve health and health care. Yet, a gap persists between the ever-growing number of statistical precision medicine strategies for evidence generation and implementation in real world clinical settings, and the strategies for closing this gap will likely be context dependent. In this paper, we consider the specific context of… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 18 pages, 7 figures

  17. arXiv:2210.03316  [pdf, other

    stat.ME stat.AP

    Efficient and Robust Approaches for Analysis of SMARTs: Illustration using the ADAPT-R Trial

    Authors: Lina M. Montoya, Michael R. Kosorok, Elvin H. Geng, Joshua Schwab, Thomas A. Odeny, Maya L. Petersen

    Abstract: Personalized intervention strategies, in particular those that modify treatment based on a participant's own response, are a core component of precision medicine approaches. Sequential Multiple Assignment Randomized Trials (SMARTs) are growing in popularity and are specifically designed to facilitate the evaluation of sequential adaptive strategies, in particular those embedded within the SMART. A… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  18. arXiv:2209.08666  [pdf, other

    cs.LG stat.ME

    Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

    Authors: Zuyue Fu, Zhengling Qi, Zhaoran Wang, Zhuoran Yang, Yanxun Xu, Michael R. Kosorok

    Abstract: We study the offline reinforcement learning (RL) in the face of unmeasured confounders. Due to the lack of online interaction with the environment, offline RL is facing the following two significant challenges: (i) the agent may be confounded by the unobserved state variables; (ii) the offline data collected a prior does not provide sufficient coverage for the environment. To tackle the above chal… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

  19. arXiv:2207.12992  [pdf, other

    stat.ME stat.AP

    Risk-Adjusted Incidence Modeling on Hierarchical Survival Data with Recurrent Events

    Authors: Xiaotong Jiang, William Stoudemire, Marianne S. Muhlebach, Michael R. Kosorok

    Abstract: There is a constant need for many healthcare programs to timely address problems with infection prevention and control (IP&C). For example, pathogens can be transmitted among patients with cystic fibrosis (CF) in both the inpatient and outpatient settings within the healthcare system even with the existing recommended IP&C practices, and these pathogens are often associated with negative clinical… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  20. arXiv:2206.06885  [pdf, other

    stat.ML cs.LG stat.ME

    Neural interval-censored survival regression with feature selection

    Authors: Carlos García Meixide, Marcos Matabuena, Louis Abraham, Michael R. Kosorok

    Abstract: Survival analysis is a fundamental area of focus in biomedical research, particularly in the context of personalized medicine. This prominence is due to the increasing prevalence of large and high-dimensional datasets, such as omics and medical image data. However, the literature on non-linear regression algorithms and variable selection techniques for interval-censoring is either limited or non-e… ▽ More

    Submitted 22 August, 2024; v1 submitted 14 June, 2022; originally announced June 2022.

    Journal ref: Statistical Analysis and Data Mining: The ASA Data Science Journal 17.4 (2024):

  21. arXiv:2204.12319  [pdf, other

    math.ST stat.ME stat.ML

    Discussion of Multiscale Fisher's Independence Test for Multivariate Dependence

    Authors: Duyeol Lee, Helal El-Zaatari, Michael R. Kosorok, Xinyi Li, Kai Zhang

    Abstract: The multiscale Fisher's independence test (MULTIFIT hereafter) proposed by Gorsky & Ma (2022) is a novel method to test independence between two random vectors. By its design, this test is particularly useful in detecting local dependence. Moreover, by adopting a resampling-free approach, it can easily accommodate massive sample sizes. Another benefit of the proposed method is its ability to inter… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  22. arXiv:2112.03981  [pdf, other

    stat.ME

    Stabilized Direct Learning for Efficient Estimation of Individualized Treatment Rules

    Authors: Kushal S. Shah, Haoda Fu, Michael R. Kosorok

    Abstract: In recent years, the field of precision medicine has seen many advancements. Significant focus has been placed on creating algorithms to estimate individualized treatment rules (ITR), which map from patient covariates to the space of available treatments with the goal of maximizing patient outcome. Direct Learning (D-Learning) is a recent one-step method which estimates the ITR by directly modelin… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  23. arXiv:2012.03294  [pdf, ps, other

    stat.ME

    Multi-stage optimal dynamic treatment regimes for survival outcomes with dependent censoring

    Authors: Hunyong Cho, Shannon T. Holloway, David J. Couper, Michael R. Kosorok

    Abstract: We propose a reinforcement learning method for estimating an optimal dynamic treatment regime for survival outcomes with dependent censoring. The estimator allows the failure time to be conditionally independent of censoring and dependent on the treatment decision times, supports a flexible number of treatment arms and treatment stages, and can maximize either the mean survival time or the surviva… ▽ More

    Submitted 12 May, 2022; v1 submitted 6 December, 2020; originally announced December 2020.

  24. arXiv:2007.09811  [pdf, ps, other

    stat.ME math.ST stat.AP stat.ML

    Kernel Assisted Learning for Personalized Dose Finding

    Authors: Liangyu Zhu, Wenbin Lu, Michael R. Kosorok, Rui Song

    Abstract: An individualized dose rule recommends a dose level within a continuous safe dose range based on patient level information such as physical conditions, genetic factors and medication histories. Traditionally, personalized dose finding process requires repeating clinical visits of the patient and frequent adjustments of the dosage. Thus the patient is constantly exposed to the risk of underdosing a… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: Accepted for KDD 2020

  25. arXiv:2002.10709  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Missing Data Imputation for Classification Problems

    Authors: Arkopal Choudhury, Michael R. Kosorok

    Abstract: Imputation of missing data is a common application in various classification problems where the feature training matrix has missingness. A widely used solution to this imputation problem is based on the lazy learning technique, $k$-nearest neighbor (kNN) approach. However, most of the previous work on missing data does not take into account the presence of the class label in the classification pro… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Comments: 27 pages, 5 figures

  26. arXiv:2001.09930  [pdf, other

    stat.ML cs.LG stat.AP

    Technical Background for "A Precision Medicine Approach to Develop and Internally Validate Optimal Exercise and Weight Loss Treatments for Overweight and Obese Adults with Knee Osteoarthritis"

    Authors: Xiaotong Jiang, Amanda E. Nelson, Rebecca J. Cleveland, Daniel P. Beavers, Todd A. Schwartz, Liubov Arbeeva, Carolina Alvarez, Leigh F. Callahan, Stephen Messier, Richard Loeser, Michael R. Kosorok

    Abstract: We provide additional statistical background for the methodology developed in the clinical analysis of knee osteoarthritis in "A Precision Medicine Approach to Develop and Internally Validate Optimal Exercise and Weight Loss Treatments for Overweight and Obese Adults with Knee Osteoarthritis" (Jiang et al. 2020). Jiang et al. 2020 proposed a pipeline to learn optimal treatment rules with precision… ▽ More

    Submitted 20 February, 2020; v1 submitted 27 January, 2020; originally announced January 2020.

  27. arXiv:2001.09887  [pdf, other

    stat.ME cs.LG stat.ML

    Estimating heterogeneous treatment effects with right-censored data via causal survival forests

    Authors: Yifan Cui, Michael R. Kosorok, Erik Sverdrup, Stefan Wager, Ruoqing Zhu

    Abstract: Forest-based methods have recently gained in popularity for non-parametric treatment effect estimation. Building on this line of work, we introduce causal survival forests, which can be used to estimate heterogeneous treatment effects in a survival and observational setting where outcomes may be right-censored. Our approach relies on orthogonal estimating equations to robustly adjust for both cens… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 January, 2020; originally announced January 2020.

    Comments: To appear in the Journal of the Royal Statistical Society, Series B

    MSC Class: 62N01

  28. arXiv:1912.09983  [pdf, other

    stat.ME

    Interval censored recursive forests

    Authors: Hunyong Cho, Nicholas P. Jewell, Michael R. Kosorok

    Abstract: We propose the interval censored recursive forests (ICRF) which is an iterative tree ensemble method for interval censored survival data. This nonparametric regression estimator makes the best use of censored information by iteratively updating the survival estimate, and can be viewed as a self-consistent estimator with convergence monitored using out-of-bag samples. Splitting rules optimized for… ▽ More

    Submitted 20 May, 2021; v1 submitted 20 December, 2019; originally announced December 2019.

  29. arXiv:1912.06667  [pdf, other

    stat.ML cs.LG q-bio.GN stat.AP stat.ME

    High dimensional precision medicine from patient-derived xenografts

    Authors: Naim U. Rashid, Daniel J. Luckett, Jingxiang Chen, Michael T. Lawson, Longshaokan Wang, Yunshu Zhang, Eric B. Laber, Yufeng Liu, Jen Jen Yeh, Donglin Zeng, Michael R. Kosorok

    Abstract: The complexity of human cancer often results in significant heterogeneity in response to treatment. Precision medicine offers potential to improve patient outcomes by leveraging this heterogeneity. Individualized treatment rules (ITRs) formalize precision medicine as maps from the patient covariate space into the space of allowable treatments. The optimal ITR is that which maximizes the mean of a… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  30. arXiv:1912.03662  [pdf, other

    math.ST stat.CO stat.ME stat.ML

    The Binary Expansion Randomized Ensemble Test (BERET)

    Authors: Duyeol Lee, Kai Zhang, Michael R. Kosorok

    Abstract: Recently, the binary expansion testing framework was introduced to test the independence of two continuous random variables by utilizing symmetry statistics that are complete sufficient statistics for dependence. We develop a new test based on an ensemble approach that uses the sum of squared symmetry statistics and distance correlation. Simulation studies suggest that this method improves the pow… ▽ More

    Submitted 7 January, 2021; v1 submitted 8 December, 2019; originally announced December 2019.

  31. arXiv:1911.05728  [pdf, other

    stat.ME

    Balanced Policy Evaluation and Learning for Right Censored Data

    Authors: Owen E. Leete, Nathan Kallus, Michael G. Hudgens, Sonia Napravnik, Michael R. Kosorok

    Abstract: Individualized treatment rules can lead to better health outcomes when patients have heterogeneous responses to treatment. Very few individualized treatment rule estimation methods are compatible with a multi-treatment observational study with right censored survival outcomes. In this paper we extend policy evaluation methods to the right censored data setting. Existing approaches either make rest… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  32. arXiv:1906.06646  [pdf, other

    stat.ME

    Sample Size Calculations for SMARTs

    Authors: Eric J. Rose, Eric B. Laber, Marie Davidian, Anastasios A. Tsiatis, Ying-Qi Zhao, Michael R. Kosorok

    Abstract: Sequential Multiple Assignment Randomized Trials (SMARTs) are considered the gold standard for estimation and evaluation of treatment regimes. SMARTs are typically sized to ensure sufficient power for a simple comparison, e.g., the comparison of two fixed treatment sequences. Estimation of an optimal treatment regime is conducted as part of a secondary and hypothesis-generating analysis with forma… ▽ More

    Submitted 16 June, 2019; originally announced June 2019.

  33. arXiv:1904.12652  [pdf

    q-bio.GN stat.AP stat.ME

    Genome analysis and pleiotropy assessment using causal networks with loss of function mutation and metabolomics

    Authors: Azam Yazdani, Akram Yazdani, Sarah H. Elsea, Daniel J. Schaid, Michael R. Kosorok, Gita Dangol, Ahmad Samiei

    Abstract: Background: Many genome-wide association studies have detected genomic regions associated with traits, yet understanding the functional causes of association often remains elusive. Utilizing systems approaches and focusing on intermediate molecular phenotypes might facilitate biologic understanding. Results: The availability of exome sequencing of two populations of African-Americans and European-… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

  34. arXiv:1902.05499  [pdf, other

    stat.AP cs.LG stat.ME stat.ML

    Estimating Individualized Treatment Regimes from Crossover Designs

    Authors: Crystal T. Nguyen, Daniel J. Luckett, Anna R. Kahkoska, Grace E. Shearrer, Donna Spruijt-Metz, Jaimie N. Davis, Michael R. Kosorok

    Abstract: The field of precision medicine aims to tailor treatment based on patient-specific factors in a reproducible way. To this end, estimating an optimal individualized treatment regime (ITR) that recommends treatment decisions based on patient characteristics to maximize the mean of a pre-specified outcome is of particular interest. Several methods have been proposed for estimating an optimal ITR from… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

  35. arXiv:1807.06711  [pdf, ps, other

    stat.ML cs.LG

    Receiver Operating Characteristic Curves and Confidence Bands for Support Vector Machines

    Authors: Daniel J. Luckett, Eric B. Laber, Samer S. El-Kamary, Cheng Fan, Ravi Jhaveri, Charles M. Perou, Fatma M. Shebl, Michael R. Kosorok

    Abstract: Many problems that appear in biomedical decision making, such as diagnosing disease and predicting response to treatment, can be expressed as binary classification problems. The costs of false positives and false negatives vary across application domains and receiver operating characteristic (ROC) curves provide a visual representation of this trade-off. Nonparametric estimators for the ROC curve,… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

  36. arXiv:1804.00096  [pdf, other

    stat.ME

    A proportional hazards model for interval-censored data subject to instantaneous failures

    Authors: Prabhashi W. Withana Gamage, Monica Chaudari, Christopher S. McMahan, Michael R. Kosorok

    Abstract: The proportional hazards (PH) model is arguably one of the most popular models used to analyze time to event data arising from clinical trials and longitudinal studies, among many others. In many such studies, the event time of interest is not directly observed but is known relative to periodic examination times; i.e., practitioners observe either current status or interval-censored data. The anal… ▽ More

    Submitted 3 April, 2018; v1 submitted 30 March, 2018; originally announced April 2018.

  37. arXiv:1711.10654  [pdf, ps, other

    stat.ME

    Augmented Outcome-weighted Learning for Optimal Treatment Regimes

    Authors: Xin Zhou, Michael R. Kosorok

    Abstract: Precision medicine is of considerable interest in clinical, academic and regulatory parties. The key to precision medicine is the optimal treatment regime. Recently, Zhou et al. (2017) developed residual weighted learning (RWL) to construct the optimal regime that directly optimize the clinical outcome. However, this method involves computationally intensive non-convex optimization, which cannot g… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  38. arXiv:1711.10581  [pdf, ps, other

    stat.ML

    Estimation and Optimization of Composite Outcomes

    Authors: Daniel J. Luckett, Eric B. Laber, Michael R. Kosorok

    Abstract: There is tremendous interest in precision medicine as a means to improve patient outcomes by tailoring treatment to individual characteristics. An individualized treatment rule formalizes precision medicine as a map from patient information to a recommended treatment. A treatment rule is defined to be optimal if it maximizes the mean of a scalar outcome in a population of interest, e.g., symptom r… ▽ More

    Submitted 26 May, 2020; v1 submitted 28 November, 2017; originally announced November 2017.

  39. arXiv:1711.08451  [pdf, ps, other

    stat.ML

    Causal nearest neighbor rules for optimal treatment regimes

    Authors: Xin Zhou, Michael R. Kosorok

    Abstract: The estimation of optimal treatment regimes is of considerable interest to precision medicine. In this work, we propose a causal $k$-nearest neighbor method to estimate the optimal treatment regime. The method roots in the framework of causal inference, and estimates the causal treatment effects within the nearest neighborhood. Although the method is simple, it possesses nice theoretical propertie… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

  40. arXiv:1706.01426  [pdf, other

    stat.ME

    Double Sparsity Kernel Learning with Automatic Variable Selection and Data Extraction

    Authors: Jingxiang Chen, Chong Zhang, Michael R. Kosorok, Yufeng Liu

    Abstract: Learning with Reproducing Kernel Hilbert Spaces (RKHS) has been widely used in many scientific disciplines. Because a RKHS can be very flexible, it is common to impose a regularization term in the optimization to prevent overfitting. Standard RKHS learning employs the squared norm penalty of the learning function. Despite its success, many challenges remain. In particular, one cannot directly use… ▽ More

    Submitted 5 June, 2017; originally announced June 2017.

  41. arXiv:1702.04755  [pdf, other

    stat.ME

    Estimating Individualized Treatment Rules for Ordinal Treatments

    Authors: Jingxiang Chen, Haoda Fu, Xuanyao He, Michael R. Kosorok, Yufeng Liu

    Abstract: Precision medicine is an emerging scientific topic for disease treatment and prevention that takes into account individual patient characteristics. It is an important direction for clinical research, and many statistical methods have been recently proposed. One of the primary goals of precision medicine is to obtain an optimal individual treatment rule (ITR), which can help make decisions on treat… ▽ More

    Submitted 15 February, 2017; originally announced February 2017.

  42. arXiv:1611.03531  [pdf, ps, other

    stat.ML

    Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning

    Authors: Daniel J. Luckett, Eric B. Laber, Anna R. Kahkoska, David M. Maahs, Elizabeth Mayer-Davis, Michael R. Kosorok

    Abstract: The vision for precision medicine is to use individual patient characteristics to inform a personalized treatment plan that leads to the best healthcare possible for each patient. Mobile technologies have an important role to play in this vision as they offer a means to monitor a patient's health status in real-time and subsequently to deliver interventions if, when, and in the dose that they are… ▽ More

    Submitted 14 October, 2017; v1 submitted 10 November, 2016; originally announced November 2016.

  43. arXiv:1611.02314  [pdf, other

    stat.ME

    Robust Hybrid Learning for Estimating Personalized Dynamic Treatment Regimens

    Authors: Ying Liu, Yuanjia Wang, Michael R. Kosorok, Yingqi Zhao, Donglin Zeng

    Abstract: Dynamic treatment regimens (DTRs) are sequential decision rules tailored at each stage by potentially time-varying patient features and intermediate outcomes observed in previous stages. The complexity, patient heterogeneity and chronicity of many diseases and disorders call for learning optimal DTRs which best dynamically tailor treatment to each individual's response over time. Proliferation of… ▽ More

    Submitted 7 November, 2016; originally announced November 2016.

  44. arXiv:1508.03179  [pdf, ps, other

    stat.ME

    Residual Weighted Learning for Estimating Individualized Treatment Rules

    Authors: Xin Zhou, Nicole Mayer-Hamblett, Umer Khan, Michael R. Kosorok

    Abstract: Personalized medicine has received increasing attention among statisticians, computer scientists, and clinical practitioners. A major component of personalized medicine is the estimation of individualized treatment rules (ITRs). Recently, Zhao et al. (2012) proposed outcome weighted learning (OWL) to construct ITRs that directly optimize the clinical outcome. Although OWL opens the door to introdu… ▽ More

    Submitted 13 August, 2015; originally announced August 2015.

    Comments: 48 pages, 3 figures

  45. arXiv:1407.3010  [pdf, other

    stat.ME stat.ML

    Biclustering Via Sparse Clustering

    Authors: Qian Liu, Guanhua Chen, Michael R. Kosorok, Eric Bair

    Abstract: In many situations it is desirable to identify clusters that differ with respect to only a subset of features. Such clusters may represent homogeneous subgroups of patients with a disease, such as cancer or chronic pain. We define a bicluster to be a submatrix U of a larger data matrix X such that the features and observations in U differ from those not contained in U. For example, the observation… ▽ More

    Submitted 10 July, 2014; originally announced July 2014.

    Comments: 40 pages, 8 figures, 10 tables

  46. arXiv:1202.5130  [pdf, other

    stat.ML math.ST

    Support Vector Regression for Right Censored Data

    Authors: Yair Goldberg, Michael R. Kosorok

    Abstract: We develop a unified approach for classification and regression support vector machines for data subject to right censoring. We provide finite sample bounds on the generalization error of the algorithm, prove risk consistency for a wide class of probability measures, and study the associated learning rates. We apply the general methodology to estimation of the (truncated) mean, median, quantiles,… ▽ More

    Submitted 12 January, 2013; v1 submitted 23 February, 2012; originally announced February 2012.

    Comments: In this version, we strengthened the theoretical results and corrected a few mistakes

  47. arXiv:1108.5338  [pdf, ps, other

    stat.ME

    Penalized Q-Learning for Dynamic Treatment Regimes

    Authors: Rui Song, Weiwei Wang, Donglin Zeng, Michael R. Kosorok

    Abstract: A dynamic treatment regime effectively incorporates both accrued information and long-term effects of treatment from specially designed clinical trials. As these become more and more popular in conjunction with longitudinal data from clinical studies, the development of statistical inference for optimal dynamic treatment regimes is a high priority. This is very challenging due to the difficulties… ▽ More

    Submitted 26 August, 2011; originally announced August 2011.

  48. Discussion of: Brownian distance covariance

    Authors: Michael R. Kosorok

    Abstract: We discuss briefly the very interesting concept of Brownian distance covariance developed by Székely and Rizzo [Ann. Appl. Statist. (2009), to appear] and describe two possible extensions. The first extension is for high dimensional data that can be coerced into a Hilbert space, including certain high throughput screening and functional data settings. The second extension involves very simple modi… ▽ More

    Submitted 9 December, 2013; v1 submitted 5 October, 2010; originally announced October 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOAS312B the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org). With Corrections

    Report number: IMS-AOAS-AOAS312B

    Journal ref: Annals of Applied Statistics 2009, Vol. 3, No. 4, 1270-1278