-
Causal Inference in Randomized Trials with Partial Clustering
Authors:
Joshua Nugent,
Elijah Kakande,
Gabriel Chamie,
Jane Kabami,
Asiphas Owaraganise,
Diane V. Havlir,
Moses Kamya,
Laura Balzer
Abstract:
Clustering and dependence are common in trials. For example, in some cluster randomized trials (CRTs), pre-existing clusters are enrolled, randomized, and serve as the basis of intervention delivery. Such CRTs are "fully clustered": participants are dependent within clusters. In contrast, "partially clustered" trials contain a mix of participants that are dependent within clusters and participants…
▽ More
Clustering and dependence are common in trials. For example, in some cluster randomized trials (CRTs), pre-existing clusters are enrolled, randomized, and serve as the basis of intervention delivery. Such CRTs are "fully clustered": participants are dependent within clusters. In contrast, "partially clustered" trials contain a mix of participants that are dependent within clusters and participants that are completely independent. One example of this design is a trial where participants are artificially grouped together for the purposes of randomization only; then, for intervention participants, the groups are the basis for intervention delivery, while control participants are un-grouped. Another example is an individually randomized group treatment trial (IRGTT) where participants are individually randomized and, post-randomization, intervention participants are grouped for intervention delivery, while the control participants remain un-grouped. For the three trial designs, we use causal models to non-parametrically describe the data generating process and formalize the observed data dependence structure. We show that despite the different randomization approach, both designs can be represented with the same dependence structure, enabling the use of the same statistical methods for estimation and inference of causal effects. We propose a novel implementation of targeted minimum loss-based estimation (TMLE) for these trials. TMLE is model-robust, leverages covariate adjustment and machine learning, and estimates many causal effects. In simulations, TMLE achieved comparable higher statistical power than alternatives for partially clustered designs. Finally, application to real data from the SEARCH-IPT trial resulted in 20-57% efficiency gains, demonstrating the consequences of our proposed approach.
△ Less
Submitted 8 November, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
When exposure affects subgroup membership: Framing relevant causal questions in perinatal epidemiology and beyond
Authors:
Shalika Gupta,
Laura B. Balzer,
Moses R. Kamya,
Diane V. Havlir,
Maya L. Petersen
Abstract:
Perinatal epidemiology often aims to evaluate exposures on infant outcomes. When the exposure affects the composition of people who give birth to live infants (e.g., by affecting fertility, behavior, or birth outcomes), this "live birth process" mediates the exposure effect on infant outcomes. Causal estimands previously proposed for this setting include the total exposure effect on composite birt…
▽ More
Perinatal epidemiology often aims to evaluate exposures on infant outcomes. When the exposure affects the composition of people who give birth to live infants (e.g., by affecting fertility, behavior, or birth outcomes), this "live birth process" mediates the exposure effect on infant outcomes. Causal estimands previously proposed for this setting include the total exposure effect on composite birth and infant outcomes, controlled direct effects (e.g., enforcing birth), and principal stratum direct effects. Using perinatal HIV transmission in the SEARCH Study as a motivating example, we present two alternative causal estimands: 1) conditional total effects; and 2) conditional stochastic direct effects, formulated under a hypothetical intervention to draw mediator values from some distribution (possibly conditional on covariates). The proposed conditional total effect includes impacts of an intervention that operate by changing the types of people who have a live birth and the timing of births. The proposed conditional stochastic direct effects isolate the effect of an exposure on infant outcomes excluding any impacts through this live birth process. In SEARCH, this approach quantifies the impact of a universal testing and treatment intervention on infant HIV-free survival absent any effect of the intervention on the live birth process, within a clearly defined target population of women of reproductive age with HIV at study baseline. Our approach has implications for the evaluation of intervention effects in perinatal epidemiology broadly, and whenever causal effects within a subgroup are of interest and exposure affects membership in the subgroup.
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
Statistical Analysis Plan for Primary and Selected Secondary Health Endpoints of the SEARCH-Youth Study
Authors:
Laura B. Balzer,
Theodore Ruel,
Diane V. Havlir,
the SEARCH-Youth Study Team
Abstract:
This document provides the statistical analytic plan (SAP) for evaluating health outcomes in the SEARCH-Youth study, a cluster randomized trial designed to evaluate the effect of a combination intervention on HIV viral suppression among adolescents and young adults with HIV in rural Uganda and Kenya (Clinicaltrials.gov: NCT03848728). The SAP was locked prior to unblinding and effect estimation. Th…
▽ More
This document provides the statistical analytic plan (SAP) for evaluating health outcomes in the SEARCH-Youth study, a cluster randomized trial designed to evaluate the effect of a combination intervention on HIV viral suppression among adolescents and young adults with HIV in rural Uganda and Kenya (Clinicaltrials.gov: NCT03848728). The SAP was locked prior to unblinding and effect estimation. This SAP was embargoed until November 04, 2022 when it was submitted to arXiv.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
Statistical Analysis Plan for Health Outcomes in Phase 1 of the SEARCH-IPT Study
Authors:
Laura B. Balzer,
Joshua Nugent,
Diane V. Havlir,
Gabriel Chamie
Abstract:
This document provides the statistical analytic plan (SAP) for evaluating health outcomes in Phase 1 of the SEARCH-IPT Study, a cluster randomized trial to evaluate whether a multicomponent intervention increases uptake of isoniazid (INH) preventive therapy (IPT) and reduces the incidence of tuberculosis (TB) in Uganda (Clinicaltrials.gov: NCT03315962). The SAP was locked prior to unblinding and e…
▽ More
This document provides the statistical analytic plan (SAP) for evaluating health outcomes in Phase 1 of the SEARCH-IPT Study, a cluster randomized trial to evaluate whether a multicomponent intervention increases uptake of isoniazid (INH) preventive therapy (IPT) and reduces the incidence of tuberculosis (TB) in Uganda (Clinicaltrials.gov: NCT03315962). The SAP was locked prior to unblinding and effect estimation. This SAP was embargoed until November 19, 2021 when it was submitted to arXiv.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Two-Stage TMLE to Reduce Bias and Improve Efficiency in Cluster Randomized Trials
Authors:
Laura B. Balzer,
Mark van der Laan,
James Ayieko,
Moses Kamya,
Gabriel Chamie,
Joshua Schwab,
Diane V. Havlir,
Maya L. Petersen
Abstract:
Cluster randomized trials (CRTs) randomly assign an intervention to groups of individuals (e.g., clinics or communities) and measure outcomes on individuals in those groups. While offering many advantages, this experimental design introduces challenges that are only partially addressed by existing analytic approaches. First, outcomes are often missing for some individuals within clusters. Failing…
▽ More
Cluster randomized trials (CRTs) randomly assign an intervention to groups of individuals (e.g., clinics or communities) and measure outcomes on individuals in those groups. While offering many advantages, this experimental design introduces challenges that are only partially addressed by existing analytic approaches. First, outcomes are often missing for some individuals within clusters. Failing to appropriately adjust for differential outcome measurement can result in biased estimates and inference. Second, CRTs often randomize limited numbers of clusters, resulting in chance imbalances on baseline outcome predictors between arms. Failing to adaptively adjust for these imbalances and other predictive covariates can result in efficiency losses. To address these methodological gaps, we propose and evaluate a novel two-stage targeted minimum loss-based estimator (TMLE) to adjust for baseline covariates in a manner that optimizes precision, after controlling for baseline and post-baseline causes of missing outcomes. Finite sample simulations illustrate that our approach can nearly eliminate bias due to differential outcome measurement, while existing CRT estimators yield misleading results and inferences. Application to real data from the SEARCH community randomized trial demonstrates the gains in efficiency afforded through adaptive adjustment for baseline covariates, after controlling for missingness on individual-level outcomes.
△ Less
Submitted 20 October, 2021; v1 submitted 29 June, 2021;
originally announced June 2021.
-
Statistical Analysis Plan for SEARCH Phase I: Health Outcomes among Adults
Authors:
Laura B. Balzer,
Diane V. Havlir,
Joshua Schwab,
Mark J. Van Der Laan,
Maya L. Petersen
Abstract:
This document provides the analytic plan for evaluating adult HIV incidence, health, and implementation outcomes for the first phase of the SEARCH Study. Locked: November 27, 2017. Embargoed until July 25, 2018.
This document provides the analytic plan for evaluating adult HIV incidence, health, and implementation outcomes for the first phase of the SEARCH Study. Locked: November 27, 2017. Embargoed until July 25, 2018.
△ Less
Submitted 25 July, 2018;
originally announced August 2018.