-
Nonparametric estimation of an optimal treatment rule with fused randomized trials and missing effect modifiers
Authors:
Nicholas Williams,
Kara Rudolph,
Iván Díaz
Abstract:
A fundamental principle of clinical medicine is that a treatment should only be administered to those patients who would benefit from it. Treatment strategies that assign treatment to patients as a function of their individual characteristics are known as dynamic treatment rules. The dynamic treatment rule that optimizes the outcome in the population is called the optimal dynamic treatment rule. R…
▽ More
A fundamental principle of clinical medicine is that a treatment should only be administered to those patients who would benefit from it. Treatment strategies that assign treatment to patients as a function of their individual characteristics are known as dynamic treatment rules. The dynamic treatment rule that optimizes the outcome in the population is called the optimal dynamic treatment rule. Randomized clinical trials are considered the gold standard for estimating the marginal causal effect of a treatment on an outcome; they are often not powered to detect heterogeneous treatment effects, and thus, may rarely inform more personalized treatment decisions. The availability of multiple trials studying a common set of treatments presents an opportunity for combining data, often called data-fusion, to better estimate dynamic treatment rules. However, there may be a mismatch in the set of patient covariates measured across trials. We address this problem here; we propose a nonparametric estimator for the optimal dynamic treatment rule that leverages information across the set of randomized trials. We apply the estimator to fused randomized trials of medications for the treatment of opioid use disorder to estimate a treatment rule that would match patient subgroups with the medication that would minimize risk of return to regular opioid use.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Transporting results from a trial to an external target population when trial participation impacts adherence
Authors:
Rachael K. Ross,
Ivan Diaz,
Amy J. Pitts,
Elizabeth A. Stuart,
Kara E. Rudolph
Abstract:
Randomized clinical trials are considered the gold standard for informing treatment guidelines, but results may not generalize to real-world populations. Generalizability is hindered by distributional differences in baseline covariates and treatment-outcome mediators. Approaches to address differences in covariates are well established, but approaches to address differences in mediators are more l…
▽ More
Randomized clinical trials are considered the gold standard for informing treatment guidelines, but results may not generalize to real-world populations. Generalizability is hindered by distributional differences in baseline covariates and treatment-outcome mediators. Approaches to address differences in covariates are well established, but approaches to address differences in mediators are more limited. Here we consider the setting where trial activities that differ from usual care settings (e.g., monetary compensation, follow-up visits frequency) affect treatment adherence. When treatment and adherence data are unavailable for the real-world target population, we cannot identify the mean outcome under a specific treatment assignment (i.e., mean potential outcome) in the target. Therefore, we propose a sensitivity analysis in which a parameter for the relative difference in adherence to a specific treatment between the trial and the target, possibly conditional on covariates, must be specified. We discuss options for specification of the sensitivity analysis parameter based on external knowledge including setting a range to estimate bounds or specifying a probability distribution from which to repeatedly draw parameter values (i.e., use Monte Carlo sampling). We introduce two estimators for the mean counterfactual outcome in the target that incorporates this sensitivity parameter, a plug-in estimator and a one-step estimator that is double robust and supports the use of machine learning for estimating nuisance models. Finally, we apply the proposed approach to the motivating application where we transport the risk of relapse under two different medications for the treatment of opioid use disorder from a trial to a real-world population.
△ Less
Submitted 30 May, 2025;
originally announced June 2025.
-
A Cautionary Tale on Integrating Studies with Disparate Outcome Measures for Causal Inference
Authors:
Harsh Parikh,
Trang Quynh Nguyen,
Elizabeth A. Stuart,
Kara E. Rudolph,
Caleb H. Miles
Abstract:
Data integration approaches are increasingly used to enhance the efficiency and generalizability of studies. However, a key limitation of these methods is the assumption that outcome measures are identical across datasets -- an assumption that often does not hold in practice. Consider the following opioid use disorder (OUD) studies: the XBOT trial and the POAT study, both evaluating the effect of…
▽ More
Data integration approaches are increasingly used to enhance the efficiency and generalizability of studies. However, a key limitation of these methods is the assumption that outcome measures are identical across datasets -- an assumption that often does not hold in practice. Consider the following opioid use disorder (OUD) studies: the XBOT trial and the POAT study, both evaluating the effect of medications for OUD on withdrawal symptom severity (not the primary outcome of either trial). While XBOT measures withdrawal severity using the subjective opiate withdrawal scale, POAT uses the clinical opiate withdrawal scale. We analyze this realistic yet challenging setting where outcome measures differ across studies and where neither study records both types of outcomes. Our paper studies whether and when integrating studies with disparate outcome measures leads to efficiency gains. We introduce three sets of assumptions -- with varying degrees of strength -- linking both outcome measures. Our theoretical and empirical results highlight a cautionary tale: integration can improve asymptotic efficiency only under the strongest assumption linking the outcomes. However, misspecification of this assumption leads to bias. In contrast, a milder assumption may yield finite-sample efficiency gains, yet these benefits diminish as sample size increases. We illustrate these trade-offs via a case study integrating the XBOT and POAT datasets to estimate the comparative effect of two medications for opioid use disorder on withdrawal symptoms. By systematically varying the assumptions linking the SOW and COW scales, we show potential efficiency gains and the risks of bias. Our findings emphasize the need for careful assumption selection when fusing datasets with differing outcome measures, offering guidance for researchers navigating this common challenge in modern data integration.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
General targeted machine learning for modern causal mediation analysis
Authors:
Richard Liu,
Nicholas T. Williams,
Kara E. Rudolph,
Iván Díaz
Abstract:
Causal mediation analyses investigate the mechanisms through which causes exert their effects, and are therefore central to scientific progress. The literature on the non-parametric definition and identification of mediational effects in rigourous causal models has grown significantly in recent years, and there has been important progress to address challenges in the interpretation and identificat…
▽ More
Causal mediation analyses investigate the mechanisms through which causes exert their effects, and are therefore central to scientific progress. The literature on the non-parametric definition and identification of mediational effects in rigourous causal models has grown significantly in recent years, and there has been important progress to address challenges in the interpretation and identification of such effects. Despite great progress in the causal inference front, statistical methodology for non-parametric estimation has lagged behind, with few or no methods available for tackling non-parametric estimation in the presence of multiple, continuous, or high-dimensional mediators. In this paper we show that the identification formulas for six popular non-parametric approaches to mediation analysis proposed in recent years can be recovered from just two statistical estimands. We leverage this finding to propose an all-purpose one-step estimation algorithm that can be coupled with machine learning in any mediation study that uses any of these six definitions of mediation. The estimators have desirable properties, such as $\sqrt{n}$-convergence and asymptotic normality. Estimating the first-order correction for the one-step estimator requires estimation of complex density ratios on the potentially high-dimensional mediators, a challenge that is solved using recent advancements in so-called Riesz learning. We illustrate the properties of our methods in a simulation study and illustrate its use on real data to estimate the extent to which pain management practices mediate the total effect of having a chronic pain disorder on opioid use disorder.
△ Less
Submitted 12 June, 2025; v1 submitted 26 August, 2024;
originally announced August 2024.
-
Longitudinal Generalizations of the Average Treatment Effect on the Treated for Multi-valued and Continuous Treatments
Authors:
Herbert Susmann,
Nicholas T. Williams,
Kara E. Rudolph,
Iván Díaz
Abstract:
The Average Treatment Effect on the Treated (ATT) is a common causal parameter defined as the average effect of a binary treatment among the subset of the population receiving treatment. We propose a novel family of parameters, Generalized ATTs (GATTs), that generalize the concept of the ATT to longitudinal data structures, multi-valued or continuous treatments, and conditioning on arbitrary treat…
▽ More
The Average Treatment Effect on the Treated (ATT) is a common causal parameter defined as the average effect of a binary treatment among the subset of the population receiving treatment. We propose a novel family of parameters, Generalized ATTs (GATTs), that generalize the concept of the ATT to longitudinal data structures, multi-valued or continuous treatments, and conditioning on arbitrary treatment subsets. We provide a formal causal identification result that expresses the GATT in terms of sequential regressions, and derive the efficient influence function of the parameter, which defines its semi-parametric efficiency bound. Efficient semi-parametric inference of the GATT requires estimating the ratios of functions of conditional probabilities (or densities); we propose directly estimating these ratios via empirical loss minimization, drawing on the theory of Riesz representers. Simulations suggest that estimation of the density ratios using Riesz representation have better stability in finite samples. Lastly, we illustrate the use of our methods to evaluate the effect of chronic pain management strategies on the development of opioid use disorder among Medicare patients with chronic pain.
△ Less
Submitted 24 October, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Associations between pain-management treatments and opioid use disorder risk among Medicaid patients
Authors:
Kara E. Rudolph,
Nicholas T. Williams,
Ivan Diaz,
Sarah Forrest,
Katherine L. Hoffman,
Hillary Samples,
Mark Olfson,
Lisa Doan,
Magdalena Cerda,
Rachael Ross
Abstract:
Introduction: Chronic pain patients are at increased risk of opioid-misuse. Less is known about the unique risk conferred by each pain-management treatment, as treatments are typically implemented together, confounding their independent effects. We estimated the extent to which pain-management strategies were associated with risk of incident opioid use disorder (OUD) for those with chronic pain, c…
▽ More
Introduction: Chronic pain patients are at increased risk of opioid-misuse. Less is known about the unique risk conferred by each pain-management treatment, as treatments are typically implemented together, confounding their independent effects. We estimated the extent to which pain-management strategies were associated with risk of incident opioid use disorder (OUD) for those with chronic pain, controlling for baseline demographic and clinical confounding variables and holding other pain-management treatments at their observed levels.
Methods: We used data from two chronic pain subgroups within a cohort of non-pregnant Medicaid patients aged 35-64 years, 2016-2019, from 25 states: 1) those with a chronic pain condition co-morbid with physical disability (N=6,133) or 2) those with chronic pain without disability (N=67,438). We considered 9 pain-management treatments: prescription opioid i) dose and ii) duration; iii) number of opioid prescribers; opioid co-prescription with iv) benzodiazepines, v) muscle relaxants, and vi) gabapentinoids; vii) non-opioid pain prescription, viii) physical therapy, and ix) other pain treatment modality. Our outcome was incident OUD.
Results: Having an opioid and gabapentin co-prescription or an opioid and benzodiazepine co-prescription was statistically significantly associated with a 16-46% increased risk of OUD. Opioid dose and duration also were significantly associated with increased risk of OUD. Physical therapy was significantly associated with an 11% decreased risk of OUD in the subgroup with chronic pain but no disability.
Conclusions: Co-prescription of opioids with either gabapentin or benzodiazepines may substantially increase risk of OUD. More positively, physical therapy may be a relatively accessible and safe pain-management strategy.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Identification and estimation of mediational effects of longitudinal modified treatment policies
Authors:
Brian Gilbert,
Katherine L. Hoffman,
Nicholas Williams,
Kara E. Rudolph,
Edward J. Schenck,
Iván Díaz
Abstract:
We demonstrate a comprehensive semiparametric approach to causal mediation analysis, addressing the complexities inherent in settings with longitudinal and continuous treatments, confounders, and mediators. Our methodology utilizes a nonparametric structural equation model and a cross-fitted sequential regression technique based on doubly robust pseudo-outcomes, yielding an efficient, asymptotical…
▽ More
We demonstrate a comprehensive semiparametric approach to causal mediation analysis, addressing the complexities inherent in settings with longitudinal and continuous treatments, confounders, and mediators. Our methodology utilizes a nonparametric structural equation model and a cross-fitted sequential regression technique based on doubly robust pseudo-outcomes, yielding an efficient, asymptotically normal estimator without relying on restrictive parametric modeling assumptions. We are motivated by a recent scientific controversy regarding the effects of invasive mechanical ventilation (IMV) on the survival of COVID-19 patients, considering acute kidney injury (AKI) as a mediating factor. We highlight the possibility of "inconsistent mediation," in which the direct and indirect effects of the exposure operate in opposite directions. We discuss the significance of mediation analysis for scientific understanding and its potential utility in treatment decisions.
△ Less
Submitted 10 December, 2024; v1 submitted 14 March, 2024;
originally announced March 2024.
-
Who Are We Missing? A Principled Approach to Characterizing the Underrepresented Population
Authors:
Harsh Parikh,
Rachael Ross,
Elizabeth Stuart,
Kara Rudolph
Abstract:
Randomized controlled trials (RCTs) serve as the cornerstone for understanding causal effects, yet extending inferences to target populations presents challenges due to effect heterogeneity and underrepresentation. Our paper addresses the critical issue of identifying and characterizing underrepresented subgroups in RCTs, proposing a novel framework for refining target populations to improve gener…
▽ More
Randomized controlled trials (RCTs) serve as the cornerstone for understanding causal effects, yet extending inferences to target populations presents challenges due to effect heterogeneity and underrepresentation. Our paper addresses the critical issue of identifying and characterizing underrepresented subgroups in RCTs, proposing a novel framework for refining target populations to improve generalizability. We introduce an optimization-based approach, Rashomon Set of Optimal Trees (ROOT), to characterize underrepresented groups. ROOT optimizes the target subpopulation distribution by minimizing the variance of the target average treatment effect estimate, ensuring more precise treatment effect estimations. Notably, ROOT generates interpretable characteristics of the underrepresented population, aiding researchers in effective communication. Our approach demonstrates improved precision and interpretability compared to alternatives, as illustrated with synthetic data experiments. We apply our methodology to extend inferences from the Starting Treatment with Agonist Replacement Therapies (START) trial -- investigating the effectiveness of medication for opioid use disorder -- to the real-world population represented by the Treatment Episode Dataset: Admissions (TEDS-A). By refining target populations using ROOT, our framework offers a systematic approach to enhance decision-making accuracy and inform future trials in diverse populations.
△ Less
Submitted 25 August, 2024; v1 submitted 25 January, 2024;
originally announced January 2024.
-
Learning Optimal Dynamic Treatment Regimes from Longitudinal Data
Authors:
Nicholas T. Williams,
Katherine L. Hoffman Iván Díaz,
Kara E. Rudolph
Abstract:
Studies often report estimates of the average treatment effect. While the ATE summarizes the effect of a treatment on average, it does not provide any information about the effect of treatment within any individual. A treatment strategy that uses an individual's information to tailor treatment to maximize benefit is known as an optimal dynamic treatment rule. Treatment, however, is typically not l…
▽ More
Studies often report estimates of the average treatment effect. While the ATE summarizes the effect of a treatment on average, it does not provide any information about the effect of treatment within any individual. A treatment strategy that uses an individual's information to tailor treatment to maximize benefit is known as an optimal dynamic treatment rule. Treatment, however, is typically not limited to a single point in time; consequently, learning an optimal rule for a time-varying treatment may involve not just learning the extent to which the comparative treatments' benefits vary across the characteristics of individuals, but also learning the extent to which the comparative treatments' benefits vary as relevant circumstances evolve within an individual. The goal of this paper is to provide a tutorial for estimating ODTR from longitudinal observational and clinical trial data for applied researchers. We describe an approach that uses a doubly-robust unbiased transformation of the conditional average treatment effect. We then learn a time-varying ODTR for when to increase buprenorphine-naloxone dose to minimize return-to-regular-opioid-use among patients with opioid use disorder. Our analysis highlights the utility of ODTRs in the context of sequential decision making: the learned ODTR outperforms a clinically defined strategy.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
Recanting twins: addressing intermediate confounding in mediation analysis
Authors:
Tat-Thang Vo,
Nicholas Williams,
Richard Liu,
Kara E. Rudolph,
Ivan Dıaz
Abstract:
The presence of intermediate confounders, also called recanting witnesses, is a fundamental challenge to the investigation of causal mechanisms in mediation analysis, preventing the identification of natural path-specific effects. Proposed alternative parameters (such as randomizational interventional effects) are problematic because they can be non-null even when there is no mediation for any ind…
▽ More
The presence of intermediate confounders, also called recanting witnesses, is a fundamental challenge to the investigation of causal mechanisms in mediation analysis, preventing the identification of natural path-specific effects. Proposed alternative parameters (such as randomizational interventional effects) are problematic because they can be non-null even when there is no mediation for any individual in the population; i.e., they are not an average of underlying individual-level mechanisms. In this paper we develop a novel method for mediation analysis in settings with intermediate confounding, with guarantees that the causal parameters are summaries of the individual-level mechanisms of interest. The method is based on recently proposed ideas that view causality as the transfer of information, and thus replace recanting witnesses by draws from their conditional distribution, what we call "recanting twins". We show that, in the absence of intermediate confounding, recanting twin effects recover natural path-specific effects. We present the assumptions required for identification of recanting twins effects under a standard structural causal model, as well as the assumptions under which the recanting twin identification formulas can be interpreted in the context of the recently proposed separable effects models. To estimate recanting-twin effects, we develop efficient semi-parametric estimators that allow the use of data driven methods in the estimation of the nuisance parameters. We present numerical studies of the methods using synthetic data, as well as an application to evaluate the role of new-onset anxiety and depressive disorder in explaining the relationship between gabapentin/pregabalin prescription and incident opioid use disorder among Medicaid beneficiaries with chronic pain.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Two-Step Targeted Minimum-Loss Based Estimation for Non-Negative Two-Part Outcomes
Authors:
Nicholas T. Williams,
Richard Liu,
Katherine L. Hoffman,
Sarah Forrest,
Kara E. Rudolph,
Iván Díaz
Abstract:
Non-negative two-part outcomes are defined as outcomes with a density function that have a zero point mass but are otherwise positive. Examples, such as healthcare expenditure and hospital length of stay, are common in healthcare utilization research. Despite the practical relevance of non-negative two-part outcomes, very few methods exist to leverage knowledge of their semicontinuity to achieve i…
▽ More
Non-negative two-part outcomes are defined as outcomes with a density function that have a zero point mass but are otherwise positive. Examples, such as healthcare expenditure and hospital length of stay, are common in healthcare utilization research. Despite the practical relevance of non-negative two-part outcomes, very few methods exist to leverage knowledge of their semicontinuity to achieve improved performance in estimating causal effects. In this paper, we develop a nonparametric two-step targeted minimum-loss based estimator (denoted as hTMLE) for non-negative two-part outcomes. We present methods for a general class of interventions referred to as modified treatment policies, which can accommodate continuous, categorical, and binary exposures. The two-step TMLE uses a targeted estimate of the intensity component of the outcome to produce a targeted estimate of the binary component of the outcome that may improve finite sample efficiency. We demonstrate the efficiency gains achieved by the two-step TMLE with simulated examples and then apply it to a cohort of Medicaid beneficiaries to estimate the effect of chronic pain and physical disability on days' supply of opioids.
△ Less
Submitted 22 April, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Transporting treatment effects from difference-in-differences studies
Authors:
Audrey Renson,
Ellicott C. Matthay,
Kara E. Rudolph
Abstract:
Difference-in-differences (DID) is a popular approach to identify the causal effects of treatments and policies in the presence of unmeasured confounding. DID identifies the sample average treatment effect in the treated (SATT). However, a goal of such research is often to inform decision-making in target populations outside the treated sample. Transportability methods have been developed to exten…
▽ More
Difference-in-differences (DID) is a popular approach to identify the causal effects of treatments and policies in the presence of unmeasured confounding. DID identifies the sample average treatment effect in the treated (SATT). However, a goal of such research is often to inform decision-making in target populations outside the treated sample. Transportability methods have been developed to extend inferences from study samples to external target populations; these methods have primarily been developed and applied in settings where identification is based on conditional independence between the treatment and potential outcomes, such as in a randomized trial. We present a novel approach to identifying and estimating effects in a target population, based on DID conducted in a study sample that differs from the target population. We present a range of assumptions under which one may identify causal effects in the target population and employ causal diagrams to illustrate these assumptions. In most realistic settings, results depend critically on the assumption that any unmeasured confounders are not effect measure modifiers on the scale of the effect of interest (e.g., risk difference, odds ratio). We develop several estimators of transported effects, including g-computation, inverse odds weighting, and a doubly robust estimator based on the efficient influence function. Simulation results support theoretical properties of the proposed estimators. As an example, we apply our approach to study the effects of a 2018 US federal smoke-free public housing law on air quality in public housing across the US, using data from a DID study conducted in New York City alone.
△ Less
Submitted 18 June, 2024; v1 submitted 26 October, 2023;
originally announced October 2023.
-
Studying continuous, time-varying, and/or complex exposures using longitudinal modified treatment policies
Authors:
Katherine L. Hoffman,
Diego Salazar-Barreto,
Nicholas Williams,
Kara E. Rudolph,
Ivan Diaz
Abstract:
This tutorial discusses methodology for causal inference using longitudinal modified treatment policies. This method facilitates the mathematical formalization, identification, and estimation of many novel parameters, and mathematically generalizes many commonly used parameters, such as the average treatment effect. Longitudinal modified treatment policies apply to a wide variety of exposures, inc…
▽ More
This tutorial discusses methodology for causal inference using longitudinal modified treatment policies. This method facilitates the mathematical formalization, identification, and estimation of many novel parameters, and mathematically generalizes many commonly used parameters, such as the average treatment effect. Longitudinal modified treatment policies apply to a wide variety of exposures, including binary, multivariate, and continuous, and can accommodate time-varying treatments and confounders, competing risks, loss-to-follow-up, as well as survival, binary, or continuous outcomes. Longitudinal modified treatment policies can be seen as an extension of static and dynamic interventions to involve the natural value of treatment, and, like dynamic interventions, can be used to define alternative estimands with a positivity assumption that is more likely to be satisfied than estimands corresponding to static interventions. This tutorial aims to illustrate several practical uses of the longitudinal modified treatment policy methodology, including describing different estimation strategies and their corresponding advantages and disadvantages. We provide numerous examples of types of research questions which can be answered using longitudinal modified treatment policies. We go into more depth with one of these examples--specifically, estimating the effect of delaying intubation on critically ill COVID-19 patients' mortality. We demonstrate the use of the open-source R package lmtp to estimate the effects, and we provide code on https://github.com/kathoffman/lmtp-tutorial.
△ Less
Submitted 14 May, 2024; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Improving efficiency in transporting average treatment effects
Authors:
Kara E. Rudolph,
Nicholas T. Williams,
Elizabeth A. Stuart,
Ivan Diaz
Abstract:
We develop flexible, semiparametric estimators of the average treatment effect (ATE) transported to a new population ("target population") that offer potential efficiency gains. Transport may be of value when the ATE may differ across populations. We consider the setting where differences in the ATE are due to differences in the distribution of baseline covariates that modify the treatment effect…
▽ More
We develop flexible, semiparametric estimators of the average treatment effect (ATE) transported to a new population ("target population") that offer potential efficiency gains. Transport may be of value when the ATE may differ across populations. We consider the setting where differences in the ATE are due to differences in the distribution of baseline covariates that modify the treatment effect ("effect modifiers"). First, we propose a collaborative one-step semiparametric estimator that can improve efficiency. This approach does not require researchers to have knowledge about which covariates are effect modifiers and which differ in distribution between the populations, but does require all covariates to be measured in the target population. Second, we propose two one-step semiparametric estimators that assume knowledge of which covariates are effect modifiers and which are both effect modifiers and differentially distributed between the populations. These estimators can be used even when not all covariates are observed in the target population; one requires that only effect modifiers are observed, and the other requires that only those modifiers that are also differentially distributed are observed. We use simulation to compare finite sample performance across our proposed estimators and an existing semiparametric estimator of the transported ATE, including in the presence of practical violations of the positivity assumption. Lastly, we apply our proposed estimators to a large-scale housing trial.
△ Less
Submitted 6 June, 2024; v1 submitted 31 March, 2023;
originally announced April 2023.
-
Nonparametric estimators of interventional (transported) direct and indirect effects that accommodate multiple mediators and multiple intermediate confounders
Authors:
Kara E Rudolph,
Nicholas Williams,
Ivan Diaz
Abstract:
Mediation analysis is appealing for its ability to improve understanding of the mechanistic drivers of causal effects, but real-world data complexities challenge its successful implementation, including: 1) the existence of post-exposure variables that also affect mediators and outcomes (thus, confounding the mediator-outcome relationship), that may also be 2) multivariate, and 3) the existence of…
▽ More
Mediation analysis is appealing for its ability to improve understanding of the mechanistic drivers of causal effects, but real-world data complexities challenge its successful implementation, including: 1) the existence of post-exposure variables that also affect mediators and outcomes (thus, confounding the mediator-outcome relationship), that may also be 2) multivariate, and 3) the existence of multivariate mediators. Interventional direct and indirect effects (IDE/IIE) accommodate post-exposure variables that confound the mediator-outcome relationship, but currently, no estimator for IDE/IIE exists that allows for both multivariate mediators and multivariate post-exposure intermediate confounders. This, again, represents a significant limitation for real-world analyses. We address this gap by extending two recently developed nonparametric estimators -- one that estimates the IDE/IIE and another that estimates the IDE/IIE transported to a new, target population -- to allow for multivariate mediators and multivariate intermediate confounders simultaneously. We use simulation to examine finite sample performance, and apply these estimators to longitudinal data from the Moving to Opportunity trial. In the application, we walk through a strategy for separating indirect effects into mediator- or mediator-group-specific indirect effects, while appropriately accounting for other, possibly co-occurring intermediate variables.
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
All models are wrong, but which are useful? Comparing parametric and nonparametric estimation of causal effects in finite samples
Authors:
Kara E. Rudolph,
Nicholas Williams,
Caleb H. Miles,
Joseph Antonelli,
Ivan Diaz
Abstract:
There is a long-standing debate in the statistical, epidemiological and econometric fields as to whether nonparametric estimation that uses data-adaptive methods, like machine learning algorithms in model fitting, confer any meaningful advantage over simpler, parametric approaches in real-world, finite sample estimation of causal effects. We address the question: when trying to estimate the effect…
▽ More
There is a long-standing debate in the statistical, epidemiological and econometric fields as to whether nonparametric estimation that uses data-adaptive methods, like machine learning algorithms in model fitting, confer any meaningful advantage over simpler, parametric approaches in real-world, finite sample estimation of causal effects. We address the question: when trying to estimate the effect of a treatment on an outcome, across a universe of reasonable data distributions, how much does the choice of nonparametric vs.~parametric estimation matter? Instead of answering this question with simulations that reflect a few chosen data scenarios, we propose a novel approach evaluating performance across thousands of data-generating mechanisms drawn from non-parametric models with semi-informative priors. We call this approach a Universal Monte-Carlo Simulation. We compare performance of estimating the average treatment effect across two parametric estimators (a g-computation estimator that uses a parametric outcome model and an inverse probability of treatment weighted estimator) and two nonparametric estimators (Bayesian additive regression trees and a targeted minimum loss-based estimator that uses an ensemble of machine learning algorithms in model fitting). We summarize estimator performance in terms of bias, confidence interval coverage, and mean squared error. We find that the nonparametric estimators nearly always outperform the parametric estimators with the exception of having similar performance in terms of bias and similar-to-slightly-worse performance in terms of coverage under the smallest sample size of N=100.
△ Less
Submitted 19 December, 2022; v1 submitted 18 November, 2022;
originally announced November 2022.
-
A novel decomposition to explain heterogeneity in observational and randomized studies of causality
Authors:
Brian Gilbert,
Ivan Dıaz,
Kara E. Rudolph,
Tat-Thang Vo
Abstract:
This paper introduces a novel decomposition framework to explain heterogeneity in causal effects observed across different studies, considering both observational and randomized settings. We present a formal decomposition of between-study heterogeneity, identifying sources of variability in treatment effects across studies. The proposed methodology allows for robust estimation of causal parameters…
▽ More
This paper introduces a novel decomposition framework to explain heterogeneity in causal effects observed across different studies, considering both observational and randomized settings. We present a formal decomposition of between-study heterogeneity, identifying sources of variability in treatment effects across studies. The proposed methodology allows for robust estimation of causal parameters under various assumptions, addressing differences in pre-treatment covariate distributions, mediating variables, and the outcome mechanism. Our approach is validated through a simulation study and applied to data from the Moving to Opportunity (MTO) study, demonstrating its practical relevance. This work contributes to the broader understanding of causal inference in multi-study environments, with potential applications in evidence synthesis and policy-making.
△ Less
Submitted 9 December, 2024; v1 submitted 10 August, 2022;
originally announced August 2022.
-
Efficient and flexible estimation of natural mediation effects under intermediate confounding and monotonicity constraints
Authors:
Kara E. Rudolph,
Ivan Diaz
Abstract:
Natural direct and indirect effects are mediational estimands that decompose the average treatment effect and describe how outcomes would be affected by contrasting levels of a treatment through changes induced in mediator values (in the case of the indirect effect) or not through induced changes in the mediator values (in the case of the direct effect). Natural direct and indirect effects are not…
▽ More
Natural direct and indirect effects are mediational estimands that decompose the average treatment effect and describe how outcomes would be affected by contrasting levels of a treatment through changes induced in mediator values (in the case of the indirect effect) or not through induced changes in the mediator values (in the case of the direct effect). Natural direct and indirect effects are not generally point-identifiable in the presence of a treatment-induced confounder, however they may still be identified if one is willing to assume monotonicity between a treatment and the treatment-induced confounder. We argue that this assumption may be reasonable in the relatively common encouragement-design trial setting where intervention is randomized treatment assignment and the treatment-induced confounder is whether or not treatment was actually taken/adhered to. We develop efficiency theory for the natural direct and indirect effects under this monotonicity assumption, and use it to propose a nonparametric, multiply robust estimator. We demonstrate the finite sample properties of this estimator using a simulation study, and apply it to data from the Moving to Opportunity Study to estimate the natural direct and indirect effects of being randomly assigned to receive a Section 8 housing voucher -- the most common form of federal housing assistance -- on risk developing any mood or externalizing disorder among adolescent boys, possibly operating through various school and community characteristics.
△ Less
Submitted 9 May, 2022;
originally announced May 2022.
-
Efficient and flexible causal mediation with time-varying mediators, treatments, and confounders
Authors:
Iván Díaz,
Nicholas Williams,
Kara E. Rudolph
Abstract:
Interventional effects have been proposed as a solution to the unidentifiability of natural (in)direct effects under mediator-outcome confounders affected by the exposure. Such confounders are an intrinsic characteristic of studies with time-varying exposures and mediators, yet the generalization of the interventional effect framework to the time-varying case has received little attention in the l…
▽ More
Interventional effects have been proposed as a solution to the unidentifiability of natural (in)direct effects under mediator-outcome confounders affected by the exposure. Such confounders are an intrinsic characteristic of studies with time-varying exposures and mediators, yet the generalization of the interventional effect framework to the time-varying case has received little attention in the literature. We present an identification result for interventional effects in a general longitudinal data structure that allows flexibility in the specification of treatment-outcome, treatment-mediator, and mediator-outcome relationships. Identification is achieved under the standard no-unmeasured-confounders and positivity assumptions. We also present a theoretical and computational study of the properties of the identifying functional based on the efficient influence function (EIF). We use the EIF to propose a sequential regression estimation algorithm that yields doubly robust, $\sqrt{n}$-consistent, asymptotically Gaussian, and efficient estimators under slow convergence rates for the regression algorithms used. This allows the use of flexible machine learning for regression while permitting uncertainty quantification through confidence intervals and p-values. A free and open source \texttt{R} package implementing our proposed estimators is made available on GitHub. We apply the proposed estimator to an application from a comparative effectiveness trial of two medications for opioid use disorder. In the application, we estimate the extent to which differences between the two treatments' on subsequent risk of opioid use is mediated by craving symptoms.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
Causal mediation with instrumental variables
Authors:
Kara E. Rudolph,
Nicholas Williams,
Ivan Diaz
Abstract:
Mediation analysis is a strategy for understanding the mechanisms by which treatments or interventions affect later outcomes. Mediation analysis is frequently applied in randomized trial settings, but typically assumes: a) that randomized assignment is the exposure of interest as opposed to actual take-up of the intervention, and b) no unobserved confounding of the mediator-outcome relationship. I…
▽ More
Mediation analysis is a strategy for understanding the mechanisms by which treatments or interventions affect later outcomes. Mediation analysis is frequently applied in randomized trial settings, but typically assumes: a) that randomized assignment is the exposure of interest as opposed to actual take-up of the intervention, and b) no unobserved confounding of the mediator-outcome relationship. In contrast to the rich literature on instrumental variable (IV) methods to estimate a total effect of a non-randomized exposure, there has been almost no research into using IV as an identification strategy in the presence of both exposure-outcome and mediator-outcome unobserved confounding. In response, we define and identify novel estimands -- complier interventional direct and indirect effects (i.e., IV mediational effects) in two scenarios: 1) with a single IV for the exposure, and 2) with two IVs, one for the exposure and another for the mediator, that may be related. We propose nonparametric, robust, efficient estimators, and apply them to a housing voucher experiment.
△ Less
Submitted 27 December, 2021;
originally announced December 2021.
-
When effects cannot be estimated: redefining estimands to understand the effects of naloxone access laws
Authors:
Kara E. Rudolph,
Catherine Gimbrone,
Ellicott C. Matthay,
Ivan Diaz,
Corey S. Davis,
Katherine Keyes,
Magdalena Cerda
Abstract:
Violations of the positivity assumption (also called the common support condition) challenge health policy research, and can result in significant bias, large variance, and invalid inference. We define positivity in the single- and multiple-timepoint (i.e., longitudinal) health policy evaluation setting, and discuss real-world threats to positivity. We show empirical evidence of the practical posi…
▽ More
Violations of the positivity assumption (also called the common support condition) challenge health policy research, and can result in significant bias, large variance, and invalid inference. We define positivity in the single- and multiple-timepoint (i.e., longitudinal) health policy evaluation setting, and discuss real-world threats to positivity. We show empirical evidence of the practical positivity violations that can result when attempting to estimate effects of health policies (in this case, Naloxone Access Laws). In such scenarios, an alternative is to estimate the effect of a shift in law enactment (e.g., the effect if enactment had been delayed by some number of years). Such an effect corresponds to what is called a modified treatment policy, and dramatically weakens the required positivity assumption, thereby offering a means to estimate policy effects even in scenarios with serious positivity problems. We apply the approach to define and estimate longitudinal effects of Naloxone Access Laws on opioid overdose rates.
△ Less
Submitted 13 June, 2022; v1 submitted 6 May, 2021;
originally announced May 2021.
-
When the ends don't justify the means: Learning a treatment strategy to prevent harmful indirect effects
Authors:
Kara E. Rudolph,
Ivan Diaz
Abstract:
There is a growing literature on finding so-called optimal treatment rules, which are rules by which to assign treatment to individuals based on an individual's characteristics, such that a desired outcome is maximized. A related goal entails identifying individuals who are predicted to have a harmful indirect effect (the effect of treatment on an outcome through mediators) even in the presence of…
▽ More
There is a growing literature on finding so-called optimal treatment rules, which are rules by which to assign treatment to individuals based on an individual's characteristics, such that a desired outcome is maximized. A related goal entails identifying individuals who are predicted to have a harmful indirect effect (the effect of treatment on an outcome through mediators) even in the presence of an overall beneficial effect of the treatment on the outcome. In some cases, the likelihood of a harmful indirect effect may outweigh a likely beneficial overall effect, and would be reason to caution against treatment for indicated individuals. We build on both the current mediation and optimal treatment rule literature to propose a method of identifying a subgroup for which the treatment effect through the mediator is harmful. Our approach is nonparametric, incorporates post-treatment variables that may confound the mediator-outcome relationship, and does not make restrictions on the distribution of baseline covariates, mediating variables (considered jointly), or outcomes. We apply the proposed approach to identify a subgroup of boys in the Moving to Opportunity housing voucher experiment who are predicted to have harmful indirect effects, though the average total effect is beneficial.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
Nonparametric causal mediation analysis for stochastic interventional (in)direct effects
Authors:
Nima S. Hejazi,
Kara E. Rudolph,
Mark J. van der Laan,
Iván Díaz
Abstract:
Causal mediation analysis has historically been limited in two important ways: (i) a focus has traditionally been placed on binary treatments and static interventions, and (ii) direct and indirect effect decompositions have been pursued that are only identifiable in the absence of intermediate confounders affected by treatment. We present a theoretical study of an (in)direct effect decomposition o…
▽ More
Causal mediation analysis has historically been limited in two important ways: (i) a focus has traditionally been placed on binary treatments and static interventions, and (ii) direct and indirect effect decompositions have been pursued that are only identifiable in the absence of intermediate confounders affected by treatment. We present a theoretical study of an (in)direct effect decomposition of the population intervention effect, defined by stochastic interventions jointly applied to the treatment and mediators. In contrast to existing proposals, our causal effects can be evaluated regardless of whether a treatment is categorical or continuous and remain well-defined even in the presence of intermediate confounders affected by treatment. Our (in)direct effects are identifiable without a restrictive assumption on cross-world counterfactual independencies, allowing for substantive conclusions drawn from them to be validated in randomized controlled trials. Beyond the novel effects introduced, we provide a careful study of nonparametric efficiency theory relevant for the construction of flexible, multiply robust estimators of our (in)direct effects, while avoiding undue restrictions induced by assuming parametric models of nuisance parameter functionals. To complement our nonparametric estimation strategy, we introduce inferential techniques for constructing confidence intervals and hypothesis tests, and discuss open source software implementing the proposed methodology.
△ Less
Submitted 11 January, 2022; v1 submitted 14 September, 2020;
originally announced September 2020.
-
Efficiently transporting causal (in)direct effects to new populations under intermediate confounding and with multiple mediators
Authors:
Kara E. Rudolph,
Ivan Diaz
Abstract:
The same intervention can produce different effects in different sites. Transport mediation estimators can estimate the extent to which such differences can be explained by differences in compositional factors and the mechanisms by which mediating or intermediate variables are produced; however, they are limited to consider a single, binary mediator. We propose novel nonparametric estimators of tr…
▽ More
The same intervention can produce different effects in different sites. Transport mediation estimators can estimate the extent to which such differences can be explained by differences in compositional factors and the mechanisms by which mediating or intermediate variables are produced; however, they are limited to consider a single, binary mediator. We propose novel nonparametric estimators of transported stochastic (in)direct effects that consider multiple, high-dimensional mediators and intermediate variables. They are multiply robust, efficient, asymptotically normal, and can incorporate data-adaptive estimation of nuisance parameters. They can be applied to understand differences in treatment effects across sites and/or to predict treatment effects in a target site based on outcome data in source sites.
△ Less
Submitted 13 June, 2020;
originally announced June 2020.
-
Non-parametric efficient causal mediation with intermediate confounders
Authors:
Iván Díaz,
Nima S. Hejazi,
Kara E. Rudolph,
Mark J. van der Laan
Abstract:
Interventional effects for mediation analysis were proposed as a solution to the lack of identifiability of natural (in)direct effects in the presence of a mediator-outcome confounder affected by exposure. We present a theoretical and computational study of the properties of the interventional (in)direct effect estimands based on the efficient influence fucntion (EIF) in the non-parametric statist…
▽ More
Interventional effects for mediation analysis were proposed as a solution to the lack of identifiability of natural (in)direct effects in the presence of a mediator-outcome confounder affected by exposure. We present a theoretical and computational study of the properties of the interventional (in)direct effect estimands based on the efficient influence fucntion (EIF) in the non-parametric statistical model. We use the EIF to develop two asymptotically optimal, non-parametric estimators that leverage data-adaptive regression for estimation of the nuisance parameters: a one-step estimator and a targeted minimum loss estimator. A free and open source \texttt{R} package implementing our proposed estimators is made available on GitHub. We further present results establishing the conditions under which these estimators are consistent, multiply robust, $n^{1/2}$-consistent and efficient. We illustrate the finite-sample performance of the estimators and corroborate our theoretical results in a simulation study. We also demonstrate the use of the estimators in our motivating application to elucidate the mechanisms behind the unintended harmful effects that a housing intervention had on adolescent girls' risk behavior.
△ Less
Submitted 29 May, 2020; v1 submitted 20 December, 2019;
originally announced December 2019.
-
Transporting stochastic direct and indirect effects to new populations
Authors:
Kara E Rudolph,
Jonathan Levy,
Mark J van der Laan
Abstract:
Transported mediation effects may contribute to understanding how and why interventions may work differently when applied to new populations. However, we are not aware of any estimators for such effects. Thus, we propose several different estimators of transported stochastic direct and indirect effects: an inverse-probability of treatment stabilized weighted estimator, a doubly robust estimator th…
▽ More
Transported mediation effects may contribute to understanding how and why interventions may work differently when applied to new populations. However, we are not aware of any estimators for such effects. Thus, we propose several different estimators of transported stochastic direct and indirect effects: an inverse-probability of treatment stabilized weighted estimator, a doubly robust estimator that solves the estimating equation, and a doubly robust substitution estimator in the targeted minimum loss-based framework. We demonstrate their finite sample properties in a simulation study.
△ Less
Submitted 8 March, 2019;
originally announced March 2019.
-
Complier stochastic direct effects: identification and robust estimation
Authors:
Kara E Rudolph,
Oleg Sofrygin,
Mark J van der Laan
Abstract:
Mediation analysis is critical to understanding the mechanisms underlying exposure-outcome relationships. In this paper, we identify the instrumental variable (IV)-direct effect of the exposure on the outcome not through the mediator, using randomization of the instrument. To our knowledge, such an estimand has not previously been considered or estimated. We propose and evaluate several estimators…
▽ More
Mediation analysis is critical to understanding the mechanisms underlying exposure-outcome relationships. In this paper, we identify the instrumental variable (IV)-direct effect of the exposure on the outcome not through the mediator, using randomization of the instrument. To our knowledge, such an estimand has not previously been considered or estimated. We propose and evaluate several estimators for this estimand: a ratio of inverse-probability of treatment-weighted estimators (IPTW), a ratio of estimating equation estimators (EE), a ratio of targeted minimum loss-based estimators (TMLE), and a TMLE that targets the CSDE directly. These estimators are applicable for a variety of study designs, including randomized encouragement trials, like the MTO housing voucher experiment we consider as an illustrative example, treatment discontinuities, and Mendelian randomization. We found the IPTW estimator to be the most sensitive to finite sample bias, resulting in bias of over 40% even when all models were correctly specified in a sample size of N=100. In contrast, the EE estimator and compatible TMLE estimator were far less sensitive to finite samples. The EE and TMLE estimators also have advantages over the IPTW estimator in terms of efficiency and reduced reliance on correct parametric model specification.
△ Less
Submitted 29 October, 2018;
originally announced October 2018.
-
Robust and Flexible Estimation of Stochastic Mediation Effects: A Proposed Method and Example in a Randomized Trial Setting
Authors:
Kara E. Rudolph,
Oleg Sofrygin,
Wenjing Zheng,
Mark J. van der Laan
Abstract:
Causal mediation analysis can improve understanding of the mechanisms underlying epidemiologic associations. However, the utility of natural direct and indirect effect estimation has been limited by the assumption of no confounder of the mediator-outcome relationship that is affected by prior exposure---an assumption frequently violated in practice. We build on recent work that identified alternat…
▽ More
Causal mediation analysis can improve understanding of the mechanisms underlying epidemiologic associations. However, the utility of natural direct and indirect effect estimation has been limited by the assumption of no confounder of the mediator-outcome relationship that is affected by prior exposure---an assumption frequently violated in practice. We build on recent work that identified alternative estimands that do not require this assumption and propose a flexible and double robust semiparametric targeted minimum loss-based estimator for data-dependent stochastic direct and indirect effects. The proposed method treats the intermediate confounder affected by prior exposure as a time-varying confounder and intervenes stochastically on the mediator using a distribution which conditions on baseline covariates and marginalizes over the intermediate confounder. In addition, we assume the stochastic intervention is given, conditional on observed data, which results in a simpler estimator and weaker identification assumptions. We demonstrate the estimator's finite sample and robustness properties in a simple simulation study. We apply the method to an example from the Moving to Opportunity experiment. In this application, randomization to receive a housing voucher is the treatment/instrument that influenced moving to a low-poverty neighborhood, which is the intermediate confounder. We estimate the data-dependent stochastic direct effect of randomization to the voucher group on adolescent marijuana use not mediated by change in school district and the stochastic indirect effect mediated by change in school district. We find no evidence of mediation. Our estimator is easy to implement in standard statistical software, and we provide annotated R code to further lower implementation barriers.
△ Less
Submitted 29 October, 2018; v1 submitted 27 July, 2017;
originally announced July 2017.