-
A Bayesian approach to the survivor average causal effect in cluster-randomized crossover trials
Authors:
Dane Isenberg,
Michael O. Harhay,
Andrew B. Forbes,
Paul J. Young,
Fan Li,
Nandita Mitra
Abstract:
In cluster-randomized crossover (CRXO) trials, groups of individuals are randomly assigned to two or more sequences of alternating treatments. Since clusters act as their own control, the CRXO design is typically more statistically efficient than the usual parallel-arm trial. CRXO trials are increasingly popular in many areas of health research where the number of available clusters is limited. Fu…
▽ More
In cluster-randomized crossover (CRXO) trials, groups of individuals are randomly assigned to two or more sequences of alternating treatments. Since clusters act as their own control, the CRXO design is typically more statistically efficient than the usual parallel-arm trial. CRXO trials are increasingly popular in many areas of health research where the number of available clusters is limited. Further, in trials among severely ill patients, researchers often want to assess the effect of treatments on secondary non-terminal outcomes, but frequently in these studies, there are patients who do not survive to have these measurements fully recorded. In this paper, we provide a causal inference framework and treatment effect estimation methods for addressing truncation by death in the setting of CRXO trials. We target the survivor average causal effect (SACE) estimand, a well-defined subgroup treatment effect obtained via principal stratification. We propose novel structural and standard modeling assumptions to enable SACE identification followed by estimation within a Bayesian paradigm. We evaluate the small-sample performance of our proposed Bayesian approach for the estimation of the SACE in CRXO trial settings via simulation studies. We apply our methods to a previously conducted two-period cross-sectional CRXO study examining the impact of proton pump inhibitors compared to histamine-2 receptor blockers on length of hospitalization among adults requiring invasive mechanical ventilation.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
Bayesian inference for cluster-randomized trials with multivariate outcomes subject to both truncation by death and missingness
Authors:
Guangyu Tong,
Chenxi Li,
Eric Velazquez,
Michael O. Harhay,
Fan Li
Abstract:
Cluster-randomized trials (CRTs) on fragile populations frequently encounter complex attrition problems where the reasons for missing outcomes can be heterogeneous, with participants who are known alive, known to have died, or with unknown survival status, and with complex and distinct missing data mechanisms for each group. Although existing methods have been developed to address death truncation…
▽ More
Cluster-randomized trials (CRTs) on fragile populations frequently encounter complex attrition problems where the reasons for missing outcomes can be heterogeneous, with participants who are known alive, known to have died, or with unknown survival status, and with complex and distinct missing data mechanisms for each group. Although existing methods have been developed to address death truncation in CRTs, no existing methods can jointly accommodate participants who drop out for reasons unrelated to mortality or serious illnesses, or those with an unknown survival status. This paper proposes a Bayesian framework for estimating survivor average causal effects in CRTs while accounting for different types of missingness. Our approach uses a multivariate outcome that jointly estimates the causal effects, and in the posterior estimates, we distinguish the individual-level and the cluster-level survivor average causal effect. We perform simulation studies to evaluate the performance of our model and found low bias and high coverage on key parameters across several different scenarios. We use data from a geriatric CRT to illustrate the use of our model. Although our illustration focuses on the case of a bivariate continuous outcome, our model is straightforwardly extended to accommodate more than two endpoints as well as other types of endpoints (e.g., binary). Thus, this work provides a general modeling framework for handling complex missingness in CRTs and can be applied to a wide range of settings with aging and palliative care populations.
△ Less
Submitted 4 May, 2025;
originally announced May 2025.
-
What is estimated in cluster randomized crossover trials with informative sizes? -- A survey of estimands and common estimators
Authors:
Kenneth M. Lee,
Andrew B. Forbes,
Jessica Kasza,
Andrew Copas,
Brennan C. Kahan,
Paul J. Young,
Michael O. Harhay,
Fan Li
Abstract:
In the analysis of cluster randomized trials (CRTs), previous work has defined two meaningful estimands: the individual-average treatment effect (iATE) and cluster-average treatment effect (cATE) estimand, to address individual and cluster-level hypotheses. In multi-period CRT designs, such as the cluster randomized crossover (CRXO) trial, additional weighted average treatment effect estimands hel…
▽ More
In the analysis of cluster randomized trials (CRTs), previous work has defined two meaningful estimands: the individual-average treatment effect (iATE) and cluster-average treatment effect (cATE) estimand, to address individual and cluster-level hypotheses. In multi-period CRT designs, such as the cluster randomized crossover (CRXO) trial, additional weighted average treatment effect estimands help fully reflect the longitudinal nature of these trial designs, namely the cluster-period-average treatment effect (cpATE) and period-average treatment effect (pATE). We define different forms of informative sizes, where the treatment effects vary according to cluster, period, and/or cluster-period sizes, which subsequently cause these estimands to differ in magnitude. Under such conditions, we demonstrate which of the unweighted, inverse cluster-period size weighted, inverse cluster size weighted, and inverse period size weighted: (i.) independence estimating equation, (ii.) fixed effects model, (iii.) exchangeable mixed effects model, and (iv.) nested exchangeable mixed effects model treatment effect estimators are consistent for the aforementioned estimands in 2-period cross-sectional CRXO designs with continuous outcomes. We report a simulation study and conclude with a reanalysis of a CRXO trial testing different treatments on hospital length of stay among patients receiving invasive mechanical ventilation. Notably, with informative sizes, the unweighted and weighted nested exchangeable mixed effects model estimators are not consistent for any meaningful estimand and can yield biased results. In contrast, the unweighted and weighted independence estimating equation, and under specific scenarios, the fixed effects model and exchangeable mixed effects model, can yield consistent and empirically unbiased estimators for meaningful estimands in 2-period CRXO trials.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
Time-varying treatment effect models in stepped-wedge cluster-randomized trials with multiple interventions
Authors:
Zhe Chen,
Wei Wang,
Yingying Lu,
Scott D. Halpern,
Katherine R. Courtright,
Fan Li,
Michael O. Harhay
Abstract:
The traditional model specification of stepped-wedge cluster-randomized trials assumes a homogeneous treatment effect across time while adjusting for fixed-time effects. However, when treatment effects vary over time, the constant effect estimator may be biased. In the general setting of stepped-wedge cluster-randomized trials with multiple interventions, we derive the expected value of the consta…
▽ More
The traditional model specification of stepped-wedge cluster-randomized trials assumes a homogeneous treatment effect across time while adjusting for fixed-time effects. However, when treatment effects vary over time, the constant effect estimator may be biased. In the general setting of stepped-wedge cluster-randomized trials with multiple interventions, we derive the expected value of the constant effect estimator when the true treatment effects depend on exposure time periods. Applying this result to concurrent and factorial stepped wedge designs, we show that the estimator represents a weighted average of exposure-time-specific treatment effects, with weights that are not necessarily uniform across exposure periods. Extensive simulation studies reveal that ignoring time heterogeneity can result in biased estimates and poor coverage of the average treatment effect. In this study, we examine two models designed to accommodate multiple interventions with time-varying treatment effects: (1) a time-varying fixed treatment effect model, which allows treatment effects to vary by exposure time but remain fixed for each time point, and (2) a random treatment effect model, where the time-varying treatment effects are modeled as random deviations from an overall mean. In the simulations considered in this study, concurrent designs generally achieve higher power than factorial designs under a time-varying fixed treatment effect model, though the differences are modest. Finally, we apply the constant effect model and both time-varying treatment effect models to data from the Prognosticating Outcomes and Nudging Decisions in the Electronic Health Record (PONDER) trial. All three models indicate a lack of treatment effect for either intervention, though they differ in the precision of their estimates, likely due to variations in modeling assumptions.
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
On Anticipation Effect in Stepped Wedge Cluster Randomized Trials
Authors:
Hao Wang,
Xinyuan Chen,
Katherine R. Courtright,
Scott D. Halpern,
Michael O. Harhay,
Monica Taljaard,
Fan Li
Abstract:
In stepped wedge cluster randomized trials (SW-CRTs), the intervention is rolled out to clusters over multiple periods. A standard approach for analyzing SW-CRTs utilizes the linear mixed model where the treatment effect is only present after the treatment adoption, under the assumption of no anticipation. This assumption, however, may not always hold in practice because stakeholders, providers, o…
▽ More
In stepped wedge cluster randomized trials (SW-CRTs), the intervention is rolled out to clusters over multiple periods. A standard approach for analyzing SW-CRTs utilizes the linear mixed model where the treatment effect is only present after the treatment adoption, under the assumption of no anticipation. This assumption, however, may not always hold in practice because stakeholders, providers, or individuals who are aware of the treatment adoption timing (especially when blinding is challenging or infeasible) can inadvertently change their behaviors in anticipation of the intervention for maximizing potential benefits. We provide an analytical framework to address the anticipation effect in SW-CRTs and study its impact when the treatment effect may or may not depend on the exposure time. We derive expectations of the estimators based on a collection of linear mixed models and demonstrate that when the anticipation effect is ignored, these estimators give biased estimates of the treatment effect. We also provide updated sample size formulas that explicitly account for anticipation effects, exposure-time heterogeneity, or both in SW-CRTs and illustrate how failing to account for these effects when they exist may lead to an underpowered study. Through simulation studies and empirical analyses, we compare the treatment effect estimators under considerations and discuss practical considerations for addressing anticipation in SW-CRTs.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
A tutorial on conducting sample size and power calculations for detecting treatment effect heterogeneity in cluster randomized trials
Authors:
Mary Ryan Baumann,
Monica Taljaard,
Patrick J. Heagerty,
Michael O. Harhay,
Guangyu Tong,
Rui Wang,
Fan Li
Abstract:
Cluster-randomized trials (CRTs) are a well-established class of designs for evaluating large-scale, community-based research questions. An essential task in planning these trials is determining the required number of clusters and cluster sizes to achieve sufficient statistical power for detecting a clinically relevant effect size. Compared to methods for evaluating the average treatment effect (A…
▽ More
Cluster-randomized trials (CRTs) are a well-established class of designs for evaluating large-scale, community-based research questions. An essential task in planning these trials is determining the required number of clusters and cluster sizes to achieve sufficient statistical power for detecting a clinically relevant effect size. Compared to methods for evaluating the average treatment effect (ATE) for the entire study population, there is more recent development of sample size methods for testing the heterogeneity of treatment effects (HTEs), i.e., modification of treatment effects by subpopulation characteristics, in CRTs. For confirmatory analyses of HTEs in CRTs, effect modifiers must be pre-specified, and ideally, accompanied by sample size or power calculations to ensure the trial has adequate power for the planned analyses. Power analysis for HTE analyses is more complex than for ATEs due to the additional design parameters that must be specified. Power and sample size formulas for HTE analyses have been separately derived under several cluster-randomized designs, including single and multi-period parallel designs, crossover designs, and stepped-wedge designs, as well as under continuous and binary outcomes. This tutorial provides a consolidated reference guide for these methods and enhances their accessibility through the development of an online R Shiny calculator. We further discuss key considerations for researchers conducting sample size and power calculations for testing pre-specified HTE hypotheses in CRTs, including the essential role of advance estimates of intracluster correlation coefficients for both outcomes and covariates on power. The sample size methodology and calculator functionality are demonstrated through real CRT examples.
△ Less
Submitted 30 January, 2025;
originally announced January 2025.
-
Semiparametric principal stratification analysis beyond monotonicity
Authors:
Jiaqi Tong,
Brennan Kahan,
Michael O. Harhay,
Fan Li
Abstract:
Intercurrent events, common in clinical trials and observational studies, affect the existence or interpretation of final outcomes. Principal stratification addresses these challenges by defining local average treatment effects within latent subpopulations, but often relies on restrictive assumptions such as monotonicity and counterfactual intermediate independence. To address these limitations, w…
▽ More
Intercurrent events, common in clinical trials and observational studies, affect the existence or interpretation of final outcomes. Principal stratification addresses these challenges by defining local average treatment effects within latent subpopulations, but often relies on restrictive assumptions such as monotonicity and counterfactual intermediate independence. To address these limitations, we propose a unified semiparametric framework for principal stratification analysis leveraging a margin-free, conditional odds ratio sensitivity parameter. Under principal ignorability, we derive nonparametric identification formulas and develop efficient estimation methods, including a conditionally doubly robust parametric estimator and a de-biased machine learning estimator with data-adaptive nuisance estimators. Simulations show that incorrectly assuming monotonicity can often lead to suboptimal inference, while specifying non-trivial odds ratio sensitivity parameter can enable approximately valid inference under monotonicity. We apply our methods to a critical care trial and further suggest a semiparametric sensitivity analysis approach under violation of principal ignorability.
△ Less
Submitted 29 January, 2025;
originally announced January 2025.
-
Doubly robust estimation and sensitivity analysis with outcomes truncated by death in multi-arm clinical trials
Authors:
Jiaqi Tong,
Chao Cheng,
Guangyu Tong,
Michael O. Harhay,
Fan Li
Abstract:
In clinical trials, the observation of participant outcomes may frequently be hindered by death, leading to ambiguity in defining a scientifically meaningful final outcome for those who die. Principal stratification methods are valuable tools for addressing the average causal effect among always-survivors, i.e., the average treatment effect among a subpopulation in the principal strata of those wh…
▽ More
In clinical trials, the observation of participant outcomes may frequently be hindered by death, leading to ambiguity in defining a scientifically meaningful final outcome for those who die. Principal stratification methods are valuable tools for addressing the average causal effect among always-survivors, i.e., the average treatment effect among a subpopulation in the principal strata of those who would survive regardless of treatment assignment. Although robust methods for the truncation-by-death problem in two-arm clinical trials have been previously studied, its expansion to multi-arm clinical trials remains unknown. In this article, we study the identification of a class of survivor average causal effect estimands with multiple treatments under monotonicity and principal ignorability, and first propose simple weighting and regression approaches. As a further improvement, we then derive the efficient influence function to motivate doubly robust estimators for the survivor average causal effects in multi-arm clinical trials. We also articulate sensitivity methods under violations of key causal assumptions. Extensive simulations are conducted to investigate the finite-sample performance of the proposed methods, and a real data example is used to illustrate how to operationalize the proposed estimators and the sensitivity methods in practice.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Analysis of cohort stepped wedge cluster-randomized trials with non-ignorable dropout via joint modeling
Authors:
Alessandro Gasparini,
Michael J. Crowther,
Emiel O. Hoogendijk,
Fan Li,
Michael O. Harhay
Abstract:
Stepped wedge cluster-randomized trial (CRTs) designs randomize clusters of individuals to intervention sequences, ensuring that every cluster eventually transitions from a control period to receive the intervention under study by the end of the study period. The analysis of stepped wedge CRTs is usually more complex than parallel-arm CRTs due to more complex intra-cluster correlation structures.…
▽ More
Stepped wedge cluster-randomized trial (CRTs) designs randomize clusters of individuals to intervention sequences, ensuring that every cluster eventually transitions from a control period to receive the intervention under study by the end of the study period. The analysis of stepped wedge CRTs is usually more complex than parallel-arm CRTs due to more complex intra-cluster correlation structures. A further challenge in the analysis of closed-cohort stepped wedge CRTs, which follow groups of individuals enrolled in each period longitudinally, is the occurrence of dropout. This is particularly problematic in studies of individuals at high risk for mortality, which causes non-ignorable missing outcomes. If not appropriately addressed, missing outcomes from death will erode statistical power, at best, and bias treatment effect estimates, at worst. Joint longitudinal-survival models can accommodate informative dropout and missingness patterns in longitudinal studies. Specifically, within the joint longitudinal-survival modeling framework, one directly models the dropout process via a time-to-event submodel together with the longitudinal outcome of interest. The two submodels are then linked using a variety of possible association structures. This work extends linear mixed-effects models by jointly modeling the dropout process to accommodate informative missing outcome data in closed-cohort stepped wedge CRTs. We focus on constant intervention and general time-on-treatment effect parametrizations for the longitudinal submodel and study the performance of the proposed methodology using Monte Carlo simulation under several data-generating scenarios. We illustrate the methodology in practice by reanalyzing data from the 'Frail Older Adults: Care in Transition' (ACT) trial, a stepped wedge CRT of a multifaceted geriatric care model versus usual care in 35 primary care practices in the Netherlands.
△ Less
Submitted 18 February, 2025; v1 submitted 23 April, 2024;
originally announced April 2024.
-
Designing a Bayesian adaptive clinical trial to evaluate novel mechanical ventilation strategies in acute respiratory failure using Integrated Nested Laplace Approximations
Authors:
Reyhaneh Hosseini,
Ziming Chen,
Ewan Goligher,
Eddy Fan,
Niall D. Ferguson,
Michael O. Harhay,
Sarina Sahetya,
Martin Urner,
Christopher J. Yarnell,
Anna Heath
Abstract:
Background: We aimed to design a Bayesian adaption trial through extensive simulations to determine values for key design parameters, demonstrate error rates, and establish the expected sample size. The complexity of the proposed outcome and analysis meant that Markov Chain Monte Carlo methods were required, resulting in an infeasible computational burden. Thus, we leveraged the Integrated Nested…
▽ More
Background: We aimed to design a Bayesian adaption trial through extensive simulations to determine values for key design parameters, demonstrate error rates, and establish the expected sample size. The complexity of the proposed outcome and analysis meant that Markov Chain Monte Carlo methods were required, resulting in an infeasible computational burden. Thus, we leveraged the Integrated Nested Laplace Approximations (INLA) algorithm, a fast approximation method, to ensure the feasibility of these simulations. Methods: We simulated Bayesian adaptive two-arm superiority trials that stratified participants into two disease severity states. The outcome was analyzed with proportional odds logistic regression. Trials were stopped for superiority or futility, separately for each state. We calculated the type I error and power across 64 scenarios that varied the stopping thresholds and the minimum sample size before commencing adaptive analyses. We incorporated dynamic borrowing and used INLA to compute the posterior distributions at each adaptive analysis. Designs that maintained a type I error below 5%, a power above 80%, and a feasible mean sample size were then evaluated across 22 scenarios that varied the odds ratios for the two severity states. Results: Power generally increased as the initial sample size and the threshold for declaring futility increased. Two designs were selected for further analysis. In the comprehensive simulations, the one design had a higher chance of reaching a trial conclusion before the maximum sample size and higher probability of declaring superiority when appropriate without a substantial increase in sample size for the more realistic scenarios and was selected as the trial design. Conclusions: We designed a Bayesian adaptive trial to evaluate novel strategies for ventilation using the INLA algorithm to and optimize the trial design through simulation.
△ Less
Submitted 31 March, 2023;
originally announced March 2023.
-
Assessing treatment effect heterogeneity in the presence of missing effect modifier data in cluster-randomized trials
Authors:
Bryan S. Blette,
Scott D. Halpern,
Fan Li,
Michael O. Harhay
Abstract:
Understanding whether and how treatment effects vary across subgroups is crucial to inform clinical practice and recommendations. Accordingly, the assessment of heterogeneous treatment effects (HTE) based on pre-specified potential effect modifiers has become a common goal in modern randomized trials. However, when one or more potential effect modifiers are missing, complete-case analysis may lead…
▽ More
Understanding whether and how treatment effects vary across subgroups is crucial to inform clinical practice and recommendations. Accordingly, the assessment of heterogeneous treatment effects (HTE) based on pre-specified potential effect modifiers has become a common goal in modern randomized trials. However, when one or more potential effect modifiers are missing, complete-case analysis may lead to bias and under-coverage. While statistical methods for handling missing data have been proposed and compared for individually randomized trials with missing effect modifier data, few guidelines exist for the cluster-randomized setting, where intracluster correlations in the effect modifiers, outcomes, or even missingness mechanisms may introduce further threats to accurate assessment of HTE. In this article, the performance of several missing data methods are compared through a simulation study of cluster-randomized trials with continuous outcome and missing binary effect modifier data, and further illustrated using real data from the Work, Family, and Health Study. Our results suggest that multilevel multiple imputation (MMI) and Bayesian MMI have better performance than other available methods, and that Bayesian MMI has lower bias and closer to nominal coverage than standard MMI when there are model specification or compatibility issues.
△ Less
Submitted 1 December, 2023; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Using modified intention-to-treat as a principal stratum estimator for failure to initiate treatment
Authors:
Brennan C Kahan,
Ian R White,
Mark Edwards,
Michael O Harhay
Abstract:
Background: A common intercurrent event affecting many trials is when some participants do not begin their assigned treatment. Many trials use a modified intention-to-treat (mITT) approach, whereby participants who do not initiate treatment are excluded from the analysis. However, it is not clear the estimand being targeted by such an approach or the assumptions necessary for it to be unbiased.…
▽ More
Background: A common intercurrent event affecting many trials is when some participants do not begin their assigned treatment. Many trials use a modified intention-to-treat (mITT) approach, whereby participants who do not initiate treatment are excluded from the analysis. However, it is not clear the estimand being targeted by such an approach or the assumptions necessary for it to be unbiased.
Methods: We demonstrate that a mITT analysis which excludes participants who do not begin treatment is estimating a principal stratum estimand (i.e. the treatment effect in the subpopulation of participants who would begin treatment, regardless of which arm they were assigned to). The mITT estimator is unbiased for the principal stratum estimand under the assumption that the intercurrent event is not affected by the assigned treatment arm, that is, participants who initiate treatment in one arm would also do so in the other arm.
Results: We identify two key criteria in determining whether the mITT estimator is likely to be unbiased: first, we must be able to measure the participants in each treatment arm who experience the intercurrent event, and second, the assumption that treatment allocation will not affect whether the participant begins treatment must be reasonable. Most double-blind trials will satisfy these criteria, and we provide an example of an open-label trial where these criteria are likely to be satisfied as well.
Conclusions: A modified intention-to-treat analysis which excludes participants who do not begin treatment can be an unbiased estimator for the principal stratum estimand. Our framework can help identify when the assumptions for unbiasedness are likely to hold, and thus whether modified intention-to-treat is appropriate or not.
△ Less
Submitted 30 January, 2023; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Leveraging baseline covariates to analyze small cluster-randomized trials with a rare binary outcome
Authors:
Angela Y. Zhu,
Nandita Mitra,
Karla Hemming,
Michael O. Harhay,
Fan Li
Abstract:
Cluster-randomized trials (CRTs) involve randomizing entire groups of participants -- called clusters -- to treatment arms but are often comprised of a limited or fixed number of available clusters. While covariate adjustment can account for chance imbalances between treatment arms and increase statistical efficiency in individually-randomized trials, analytical methods for individual-level covari…
▽ More
Cluster-randomized trials (CRTs) involve randomizing entire groups of participants -- called clusters -- to treatment arms but are often comprised of a limited or fixed number of available clusters. While covariate adjustment can account for chance imbalances between treatment arms and increase statistical efficiency in individually-randomized trials, analytical methods for individual-level covariate adjustment in small CRTs have received little attention to date. In this paper, we systematically investigate, through extensive simulations, the operating characteristics of propensity score weighting and multivariable regression as two individual-level covariate adjustment strategies for estimating the participant-average causal effect in small CRTs with a rare binary outcome and identify scenarios where each adjustment strategy has a relative efficiency advantage over the other to make practical recommendations. We also examine the finite-sample performance of the bias-corrected sandwich variance estimators associated with propensity score weighting and multivariable regression for quantifying the uncertainty in estimating the participant-average treatment effect. To illustrate the methods for individual-level covariate adjustment, we reanalyze a recent CRT testing a sedation protocol in 31 pediatric intensive care units.
△ Less
Submitted 28 November, 2022; v1 submitted 11 May, 2022;
originally announced May 2022.
-
A Bayesian Machine Learning Approach for Estimating Heterogeneous Survivor Causal Effects: Applications to a Critical Care Trial
Authors:
Xinyuan Chen,
Michael O. Harhay,
Guangyu Tong,
Fan Li
Abstract:
Motivated by the Acute Respiratory Distress Syndrome Network (ARDSNetwork) ARDS respiratory management (ARMA) trial, we developed a flexible Bayesian machine learning approach to estimate the average causal effect and heterogeneous causal effects among the always-survivors stratum when clinical outcomes are subject to truncation. We adopted Bayesian additive regression trees (BART) to flexibly spe…
▽ More
Motivated by the Acute Respiratory Distress Syndrome Network (ARDSNetwork) ARDS respiratory management (ARMA) trial, we developed a flexible Bayesian machine learning approach to estimate the average causal effect and heterogeneous causal effects among the always-survivors stratum when clinical outcomes are subject to truncation. We adopted Bayesian additive regression trees (BART) to flexibly specify separate models for the potential outcomes and latent strata membership. In the analysis of the ARMA trial, we found that the low tidal volume treatment had an overall benefit for participants sustaining acute lung injuries on the outcome of time to returning home, but substantial heterogeneity in treatment effects among the always-survivors, driven most strongly by sex and the alveolar-arterial oxygen gradient at baseline (a physiologic measure of lung function and source of hypoxemia). These findings illustrate how the proposed methodology could guide the prognostic enrichment of future trials in the field. We also demonstrated through a simulation study that our proposed Bayesian machine learning approach outperforms other parametric methods in reducing the estimation bias in both the average causal effect and heterogeneous causal effects for always-survivors.
△ Less
Submitted 19 June, 2023; v1 submitted 13 April, 2022;
originally announced April 2022.
-
On the mixed-model analysis of covariance in cluster-randomized trials
Authors:
Bingkai Wang,
Michael O. Harhay,
Jiaqi Tong,
Dylan S. Small,
Tim P. Morris,
Fan Li
Abstract:
In the analyses of cluster-randomized trials, mixed-model analysis of covariance (ANCOVA) is a standard approach for covariate adjustment and handling within-cluster correlations. However, when the normality, linearity, or the random-intercept assumption is violated, the validity and efficiency of the mixed-model ANCOVA estimators for estimating the average treatment effect remain unclear. Under t…
▽ More
In the analyses of cluster-randomized trials, mixed-model analysis of covariance (ANCOVA) is a standard approach for covariate adjustment and handling within-cluster correlations. However, when the normality, linearity, or the random-intercept assumption is violated, the validity and efficiency of the mixed-model ANCOVA estimators for estimating the average treatment effect remain unclear. Under the potential outcomes framework, we prove that the mixed-model ANCOVA estimators for the average treatment effect are consistent and asymptotically normal under arbitrary misspecification of its working model. If the probability of receiving treatment is 0.5 for each cluster, we further show that the model-based variance estimator under mixed-model ANCOVA1 (ANCOVA without treatment-covariate interactions) remains consistent, clarifying that the confidence interval given by standard software is asymptotically valid even under model misspecification. Beyond robustness, we discuss several insights on precision among classical methods for analyzing cluster-randomized trials, including the mixed-model ANCOVA, individual-level ANCOVA, and cluster-level ANCOVA estimators. These insights may inform the choice of methods in practice. Our analytical results and insights are illustrated via simulation studies and analyses of three cluster-randomized trials.
△ Less
Submitted 8 October, 2023; v1 submitted 1 December, 2021;
originally announced December 2021.