Skip to main content

Showing 1–28 of 28 results for author: Daniels, M J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.21719  [pdf, ps, other

    stat.ME

    A new algorithm for sampling parameters in a structured correlation matrix with application to estimating optimal combinations of muscles to quantify progression in Duchenne muscular dystrophy

    Authors: Michael K. Kim, Michael J. Daniels, William D. Rooney, Rebecca J. Willcocks, Glenn A. Walter, Krista H. Vandenborne

    Abstract: The goal of this paper is to estimate an optimal combination of biomarkers for individuals with Duchenne muscular dystrophy (DMD), which provides the most sensitive combinations of biomarkers to assess disease progression (in this case, optimal with respect to standardized response mean (SRM) for 4 muscle biomarkers). The biomarker data is an incomplete (missing and irregular) multivariate longitu… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  2. arXiv:2506.20058  [pdf, ps, other

    stat.ME stat.AP

    Causal mediation analysis for longitudinal and survival data in continuous time using Bayesian non-parametric joint models

    Authors: Saurabh Bhandari, Michael J. Daniels, Juned Siddique

    Abstract: Observational cohort data is an important source of information for understanding the causal effects of treatments on survival and the degree to which these effects are mediated through changes in disease-related risk factors. However, these analyses are often complicated by irregular data collection intervals and the presence of longitudinal confounders and mediators. We propose a causal mediatio… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  3. arXiv:2506.19066  [pdf, ps, other

    stat.ME

    A Bayesian approach for unadjudicated events in cardiovascular disease cohort studies

    Authors: Mirajul Islam, Michael J. Daniels, Donald Lloyd-Jones, Juned Siddique

    Abstract: An important issue in joint modelling for outcomes and longitudinal risk factors in cohort studies is to have an accurate assessment of events. Events determined based on ICD-9 codes can be very inaccurate, in particular for cardiovascular disease (CVD) where ICD-9 codes may overestimate the frequency of CVD. Motivated by the lack of adjudicated events in the Established Populations for Epidemiolo… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  4. arXiv:2506.17835  [pdf, ps, other

    stat.ME

    Personalized feature threshold estimation in joint modelling of longitudinal and time-to-event data

    Authors: Mirajul Islam, Michael J. Daniels, Juned Siddique

    Abstract: Cardiovascular disease (CVD) cohort studies collect longitudinal data on numerous CVD risk factors including body mass index (BMI), systolic blood pressure (SBP), diastolic blood pressure (DBP), glucose, and total cholesterol. The commonly used threshold values for identifying subjects at high risk are 30 kg/$m^2$ for BMI, 120 mmHg for SBP, 80 mmHg for DBP, 126 mg/dL for glucose, and 230 mg/dL for… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  5. arXiv:2506.07387  [pdf, other

    stat.ME

    Integrating tumor burden with survival outcome for treatment effect evaluation in oncology trials

    Authors: Saurabh Bhandari, Michael J. Daniels, Chenguang Wang

    Abstract: In early-phase cancer clinical trials, the limited availability of data presents significant challenges in developing a framework to efficiently quantify treatment effectiveness. To address this, we propose a novel utility-based Bayesian approach for assessing treatment effects in these trials, where data scarcity is a major concern. Our approach synthesizes tumor burden, a key biomarker for evalu… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  6. arXiv:2503.17606  [pdf, other

    stat.ME stat.AP

    Combining longitudinal cohort studies to examine cardiovascular risk factor trajectories across the adult lifespan

    Authors: Zeynab Aghabazaz, Michael J Daniels, Hongyan Ning, Juned Siddique

    Abstract: We introduce a statistical framework for combining data from multiple large longitudinal cardiovascular cohorts to enable the study of long-term cardiovascular health starting in early adulthood. Using data from seven cohorts belonging to the Lifetime Risk Pooling Project (LRPP), we present a Bayesian hierarchical multivariate approach that jointly models multiple longitudinal risk factors over ti… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  7. arXiv:2503.17576  [pdf, other

    stat.ME stat.AP

    A Joint Model of Longitudinal CVD Risk Factors, Medication Use, and Time-to-Terminal Events

    Authors: Zeynab Aghabazaz, Michael J Daniels, Donald M Lloyd-Jones, Juned Siddique

    Abstract: We introduce a novel Bayesian approach for jointly modeling longitudinal cardiovascular disease (CVD) risk factor trajectories, medication use, and time-to-events. Our methodology incorporates longitudinal risk factor trajectories into the time-to-event model, considers the temporal aspect of medication use, incorporates uncertainty due to missing medication status and medication switching, and an… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  8. arXiv:2412.00926  [pdf, other

    stat.ME

    A sensitivity analysis approach to principal stratification with a continuous longitudinal intermediate outcome: Applications to a cohort stepped wedge trial

    Authors: Lei Yang, Michael J. Daniels, Fan Li

    Abstract: Causal inference in the presence of intermediate variables is a challenging problem in many applications. Principal stratification (PS) provides a framework to estimate principal causal effects (PCE) in such settings. However, existing PS methods primarily focus on settings with binary intermediate variables. We propose a novel approach to estimate PCE with continuous intermediate variables in the… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  9. arXiv:2412.00885  [pdf, ps, other

    stat.ME

    Bayesian feature selection in joint models with application to a cardiovascular disease cohort study

    Authors: Mirajul Islam, Michael J. Daniels, Zeynab Aghabazaz, Juned Siddique

    Abstract: Cardiovascular disease (CVD) cohorts collect data longitudinally to study the association between CVD risk factors and event times. An important area of scientific research is to better understand what features of CVD risk factor trajectories are associated with the disease. We develop methods for feature selection in joint models where feature selection is viewed as a bi-level variable selection… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  10. arXiv:2411.18739  [pdf, ps, other

    stat.ME stat.AP

    A Bayesian semi-parametric approach to causal mediation for longitudinal mediators and time-to-event outcomes with application to a cardiovascular disease cohort study

    Authors: Saurabh Bhandari, Michael J. Daniels, Maria Josefsson, Donald M. Lloyd-Jones, Juned Siddique

    Abstract: Causal mediation analysis of observational data is an important tool for investigating the potential causal effects of medications on disease-related risk factors, and on time-to-death (or disease progression) through these risk factors. However, when analyzing data from a cohort study, such analyses are complicated by the longitudinal structure of the risk factors and the presence of time-varying… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  11. arXiv:2305.05099  [pdf, ps, other

    stat.ME stat.AP

    Dirichlet process mixture models for the Analysis of Repeated Attempt Designs

    Authors: Michael J. Daniels, Minji Lee, Wei Feng

    Abstract: In longitudinal studies, it is not uncommon to make multiple attempts to collect a measurement after baseline. Recording whether these attempts are successful provides useful information for the purposes of assessing missing data assumptions. This is because measurements from subjects who provide the data after numerous failed attempts may differ from those who provide the measurement after fewer… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 24 pages, additional 16 pages of supplementary material

  12. arXiv:2305.05017  [pdf, ps, other

    stat.ME stat.AP

    A Bayesian Non-parametric Approach for Causal Mediation with a Post-treatment Confounder

    Authors: Woojung Bae, Michael J. Daniels, Michael G. Perri

    Abstract: We propose a new Bayesian non-parametric (BNP) method for estimating the causal effects of mediation in the presence of a post-treatment confounder. We specify an enriched Dirichlet process mixture (EDPM) to model the joint distribution of the observed data (outcome, mediator, post-treatment confounders, treatment, and baseline confounders). The proposed BNP model allows more confounder-based clus… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  13. arXiv:2305.01631  [pdf, other

    stat.CO

    Truncation Approximation for Enriched Dirichlet Process Mixture Models

    Authors: Natalie Burns, Michael J. Daniels

    Abstract: Enriched Dirichlet process mixture (EDPM) models are Bayesian nonparametric models which can be used for nonparametric regression and conditional density estimation and which overcome a key disadvantage of jointly modeling the response and predictors as a Dirichlet process mixture (DPM) model: when there is a large number of predictors, the clusters induced by the DPM will be overwhelmingly determ… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  14. arXiv:2208.13382  [pdf, other

    stat.ME stat.AP stat.OT

    A Bayesian nonparametric approach for causal inference with multiple mediators

    Authors: Samrat Roy, Michael J. Daniels, Brendan J. Kelly, Jason Roy

    Abstract: Mediation analysis with contemporaneously observed multiple mediators is an important area of causal inference. Recent approaches for multiple mediators are often based on parametric models and thus may suffer from model misspecification. Also, much of the existing literature either only allow estimation of the joint mediation effect, or, estimate the joint mediation effect as the sum of individua… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    ACM Class: G.3

  15. arXiv:2208.09869  [pdf, other

    stat.ME

    Flexible evaluation of surrogacy in Bayesian adaptive platform studies

    Authors: Michael C Sachs, Erin E Gabriel, Alessio Crippa, Michael J Daniels

    Abstract: Trial level surrogates are useful tools for improving the speed and cost effectiveness of trials, but surrogates that have not been properly evaluated can cause misleading results. The evaluation procedure is often contextual and depends on the type of trial setting. There have been many proposed methods for trial level surrogate evaluation, but none, to our knowledge, for the specific setting of… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

    Comments: 21 pages, 4 figures

  16. arXiv:2201.03077  [pdf, other

    stat.ME

    Information Borrowing in Regression Models

    Authors: Amy Zhang, Le Bao, Michael J. Daniels

    Abstract: Model development often takes data structure, subject matter considerations, model assumptions, and goodness of fit into consideration. To diagnose issues with any of these factors, it can be helpful to understand regression model estimates at a more granular level. We propose a new method for decomposing point estimates from a regression model via weights placed on data clusters. The weights are… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

  17. arXiv:2112.13998  [pdf, other

    stat.ME stat.AP

    Variable Selection Using Bayesian Additive Regression Trees

    Authors: Chuji Luo, Michael J. Daniels

    Abstract: Variable selection is an important statistical problem. This problem becomes more challenging when the candidate predictors are of mixed type (e.g. continuous and binary) and impact the response variable in nonlinear and/or non-additive ways. In this paper, we review existing variable selection approaches for the Bayesian additive regression trees (BART) model, a nonparametric regression model, wh… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

    Comments: 40 pages, 13 figures

  18. arXiv:2106.14599  [pdf, other

    stat.CO stat.ME

    BNPqte: A Bayesian Nonparametric Approach to Causal Inference on Quantiles in R

    Authors: Chuji Luo, Michael J. Daniels

    Abstract: In this article, we introduce the BNPqte R package which implements the Bayesian nonparametric approach of Xu, Daniels and Winterstein (2018) for estimating quantile treatment effects in observational studies. This approach provides flexible modeling of the distributions of potential outcomes, so it is capable of capturing a variety of underlying relationships among the outcomes, treatments and co… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: 44 pages, 13 figures

  19. arXiv:2101.06823  [pdf, other

    stat.ME cs.LG stat.ML

    Inference for BART with Multinomial Outcomes

    Authors: Yizhen Xu, Joseph W. Hogan, Michael J. Daniels, Rami Kantor, Ann Mwangi

    Abstract: The multinomial probit Bayesian additive regression trees (MPBART) framework was proposed by Kindo et al. (KD), approximating the latent utilities in the multinomial probit (MNP) model with BART (Chipman et al. 2010). Compared to multinomial logistic models, MNP does not assume independent alternatives and the correlation structure among alternatives can be specified through multivariate Gaussian… ▽ More

    Submitted 12 August, 2022; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: 23 pages, 12 tables, 6 figures, with appendix, 49 pages total

  20. arXiv:2011.14238  [pdf, other

    stat.ML cs.LG stat.CO

    Approximate Cross-validated Mean Estimates for Bayesian Hierarchical Regression Models

    Authors: Amy X. Zhang, Le Bao, Changcheng Li, Michael J. Daniels

    Abstract: We introduce a novel procedure for obtaining cross-validated predictive estimates for Bayesian hierarchical regression models (BHRMs). Bayesian hierarchical models are popular for their ability to model complex dependence structures and provide probabilistic uncertainty estimates, but can be computationally expensive to run. Cross-validation (CV) is therefore not a common practice to evaluate the… ▽ More

    Submitted 27 September, 2024; v1 submitted 28 November, 2020; originally announced November 2020.

    Comments: 25 pages, 2 figures

    Journal ref: Journal of Computational and Graphical Statistics (2024) 1-17

  21. arXiv:2011.12345  [pdf, ps, other

    stat.ME stat.AP

    A Bayesian semi-parametric approach for inference on the population partly conditional mean from longitudinal data with dropout

    Authors: Maria Josefsson, Michael J. Daniels, Sara Pudas

    Abstract: Studies of memory trajectories using longitudinal data often result in highly non-representative samples due to selective study enrollment and attrition. An additional bias comes from practice effects that result in improved or maintained performance due to familiarity with test content or context. These challenges may bias study findings and severely distort the ability to generalize to the targe… ▽ More

    Submitted 22 March, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

  22. arXiv:2011.00404  [pdf, other

    stat.ME

    Informed Pooled Testing with Quantitative Assays

    Authors: Tao Liu, Joseph W Hogan, Wanning Su, Yizhen Xu, Michael J Daniels, Kantor Rami

    Abstract: Pooled testing is widely used for screening for viral or bacterial infections with low prevalence when individual testing is not cost-efficient. Pooled testing with qualitative assays that give binary results has been well-studied. However, characteristics of pooling with quantitative assays were mostly demonstrated using simulations or empirical studies. We investigate properties of three pooling… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

  23. arXiv:1902.10787  [pdf, ps, other

    stat.AP

    Bayesian semi-parametric G-computation for causal inference in a cohort study with MNAR dropout and death

    Authors: Maria Josefsson, Michael J. Daniels

    Abstract: Causal inference with observational longitudinal data and time-varying exposures is often complicated by time-dependent confounding and attrition. The G-computation formula is one approach for estimating a causal effect in this setting. The parametric modeling approach typically used in practice relies on strong modeling assumptions for valid inference, and moreover depends on an assumption of mis… ▽ More

    Submitted 12 October, 2020; v1 submitted 27 February, 2019; originally announced February 2019.

  24. arXiv:1901.00908  [pdf, other

    stat.ME

    Bayesian Longitudinal Causal Inference in the Analysis of the Public Health Impact of Pollutant Emissions

    Authors: Chanmin Kim, Corwin M Zigler, Michael J Daniels, Christine Choirat, Jason A Roy

    Abstract: Pollutant emissions from coal-burning power plants have been deemed to adversely impact ambient air quality and public health conditions. Despite the noticeable reduction in emissions and the improvement of air quality since the Clean Air Act (CAA) became the law, the public-health benefits from changes in emissions have not been widely evaluated yet. In terms of the chain of accountability (HEI A… ▽ More

    Submitted 3 January, 2019; originally announced January 2019.

  25. arXiv:1812.06507  [pdf, ps, other

    stat.ML cs.LG

    Classification using Ensemble Learning under Weighted Misclassification Loss

    Authors: Yizhen Xu, Tao Liu, Michael J. Daniels, Rami Kantor, Ann Mwangi, Joseph W. Hogan

    Abstract: Binary classification rules based on covariates typically depend on simple loss functions such as zero-one misclassification. Some cases may require more complex loss functions. For example, individual-level monitoring of HIV-infected individuals on antiretroviral therapy (ART) requires periodic assessment of treatment failure, defined as having a viral load (VL) value above a certain threshold. I… ▽ More

    Submitted 10 May, 2019; v1 submitted 16 December, 2018; originally announced December 2018.

    Comments: 23 pages, 4 tables, 4 figures

    Journal ref: Statistics in Medicine 2019, Vol. 38, Issue 11, Pg. 2002-2012

  26. arXiv:1805.07147  [pdf, other

    stat.ME

    A Bayesian Parametric Approach to Handle Missing Longitudinal Outcome Data in Trial-Based Health Economic Evaluations

    Authors: Andrea Gabrio, Michael J. Daniels, Gianluca Baio

    Abstract: Trial-based economic evaluations are typically performed on cross-sectional variables, derived from the responses for only the completers in the study, using methods that ignore the complexities of utility and cost data (e.g. skewness and spikes). We present an alternative and more efficient Bayesian parametric approach to handle missing longitudinal outcomes in economic evaluations, while account… ▽ More

    Submitted 18 May, 2018; originally announced May 2018.

  27. arXiv:1702.08496  [pdf, other

    stat.ME

    Bayesian nonparametric generative models for causal inference with missing at random covariates

    Authors: Jason Roy, Kirsten J Lum, Michael J. Daniels, Bret Zeldow, Jordan Dworkin, Vincent Lo Re III

    Abstract: We propose a general Bayesian nonparametric (BNP) approach to causal inference in the point treatment setting. The joint distribution of the observed data (outcome, treatment, and confounders) is modeled using an enriched Dirichlet process. The combination of the observed data model and causal assumptions allows us to identify any type of causal effect - differences, ratios, or quantile effects, e… ▽ More

    Submitted 27 February, 2017; originally announced February 2017.

  28. arXiv:1507.01825  [pdf, other

    stat.ME

    Comparing Biomarkers as Trial Level General Surrogates

    Authors: Erin E. Gabriel, Michael J. Daniels, M. Elizabeth Halloran

    Abstract: An intermediate response measure that accurately predicts efficacy in a new setting can reduce trial cost and time to product licensure. In this paper, we define a trial level general surrogate as a trial level intermediate response that accurately predicts trial level clinical responses. Methods for evaluating trial level general surrogates have been developed previously. Many methods in the lite… ▽ More

    Submitted 7 July, 2015; originally announced July 2015.