-
Formalizing the causal interpretation in accelerated failure time models with unmeasured heterogeneity
Authors:
Mari Brathovde,
Hein Putter,
Morten Valberg,
Richard A. J. Post
Abstract:
In the presence of unmeasured heterogeneity, the hazard ratio for exposure has a complex causal interpretation. To address this, accelerated failure time (AFT) models, which assess the effect on the survival time ratio scale, are often suggested as a better alternative. AFT models also allow for straightforward confounder adjustment. In this work, we formalize the causal interpretation of the acce…
▽ More
In the presence of unmeasured heterogeneity, the hazard ratio for exposure has a complex causal interpretation. To address this, accelerated failure time (AFT) models, which assess the effect on the survival time ratio scale, are often suggested as a better alternative. AFT models also allow for straightforward confounder adjustment. In this work, we formalize the causal interpretation of the acceleration factor in AFT models using structural causal models and data under independent censoring. We prove that the acceleration factor is a valid causal effect measure, even in the presence of frailty and treatment effect heterogeneity. Through simulations, we show that the acceleration factor better captures the causal effect than the hazard ratio when both AFT and proportional hazards models apply. Additionally, we extend the interpretation to systems with time-dependent acceleration factors, revealing the challenge of distinguishing between a time-varying homogeneous effect and unmeasured heterogeneity. While the causal interpretation of acceleration factors is promising, we caution practitioners about potential challenges in estimating these factors in the presence of effect heterogeneity.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Beyond Conditional Averages: Estimating The Individual Causal Effect Distribution
Authors:
Richard Post,
Edwin van den Heuvel
Abstract:
In recent years, the field of causal inference from observational data has emerged rapidly. The literature has focused on (conditional) average causal effect estimation. When (remaining) variability of individual causal effects (ICEs) is considerable, average effects may be uninformative for an individual. The fundamental problem of causal inference precludes estimating the joint distribution of p…
▽ More
In recent years, the field of causal inference from observational data has emerged rapidly. The literature has focused on (conditional) average causal effect estimation. When (remaining) variability of individual causal effects (ICEs) is considerable, average effects may be uninformative for an individual. The fundamental problem of causal inference precludes estimating the joint distribution of potential outcomes without making assumptions. In this work, we show that the ICE distribution is identifiable under (conditional) independence of the individual effect and the potential outcome under no exposure, in addition to the common assumptions of consistency, positivity, and conditional exchangeability. Moreover, we present a family of flexible latent variable models that can be used to study individual effect modification and estimate the ICE distribution from cross-sectional data. How such latent variable models can be applied and validated in practice is illustrated in a case study on the effect of Hepatic Steatosis on a clinical precursor to heart failure. Under the assumptions presented, we estimate that 20.6% (95% Bayesian credible interval: 8.9%, 33.6%) of the population has a harmful effect greater than twice the average causal effect.
△ Less
Submitted 8 April, 2025; v1 submitted 29 October, 2022;
originally announced October 2022.
-
Bias of the additive hazard model in the presence of causal effect heterogeneity
Authors:
Richard Post,
Edwin van den Heuvel,
Hein Putter
Abstract:
Hazard ratios are prone to selection bias, compromising their use as causal estimands. On the other hand, the hazard difference has been shown to remain unaffected by the selection of frailty factors over time. Therefore, observed hazard differences can be used as an unbiased estimator for the causal hazard differences in the absence of confounding. However, in the presence of effect (on the hazar…
▽ More
Hazard ratios are prone to selection bias, compromising their use as causal estimands. On the other hand, the hazard difference has been shown to remain unaffected by the selection of frailty factors over time. Therefore, observed hazard differences can be used as an unbiased estimator for the causal hazard differences in the absence of confounding. However, in the presence of effect (on the hazard) heterogeneity, the hazard difference is also affected by selection. In this work, we formalize how the observed hazard difference (from a randomized controlled trial) evolves by selecting favourable levels of effect modifiers in the exposed group and thus deviates from the causal hazard difference of interest. Such selection may result in a non-linear integrated hazard difference curve even when the individual causal effects are time-invariant. Therefore, a homogeneous time-varying causal additive effect on the hazard can not be distinguished from a constant but heterogeneous causal effect. We illustrate this causal issue by studying the effect of chemotherapy on the survival time of patients suffering from carcinoma of the oropharynx using data from a clinical trial. The hazard difference can thus not be used as an appropriate measure of the causal effect without making untestable assumptions.
△ Less
Submitted 29 October, 2022;
originally announced October 2022.
-
The built-in selection bias of hazard ratios formalized
Authors:
Richard Post,
Edwin van den Heuvel,
Hein Putter
Abstract:
It is known that the hazard ratio lacks a useful causal interpretation. Even for data from a randomized controlled trial, the hazard ratio suffers from built-in selection bias as, over time, the individuals at risk in the exposed and unexposed are no longer exchangeable. In this work, we formalize how the observed hazard ratio evolves and deviates from the causal hazard ratio of interest in the pr…
▽ More
It is known that the hazard ratio lacks a useful causal interpretation. Even for data from a randomized controlled trial, the hazard ratio suffers from built-in selection bias as, over time, the individuals at risk in the exposed and unexposed are no longer exchangeable. In this work, we formalize how the observed hazard ratio evolves and deviates from the causal hazard ratio of interest in the presence of heterogeneity of the hazard of unexposed individuals (frailty) and heterogeneity in effect (individual modification). For the case of effect heterogeneity, we define the causal hazard ratio. We show that the observed hazard ratio equals the ratio of expectations of the latent variables (frailty and modifier) conditionally on survival in the world with and without exposure, respectively. Examples with gamma, inverse Gaussian and compound Poisson distributed frailty, and categorical (harming, beneficial or neutral) effect modifiers are presented for illustration. This set of examples shows that an observed hazard ratio with a particular value can arise for all values of the causal hazard ratio. Therefore, the hazard ratio can not be used as a measure of the causal effect without making untestable assumptions, stressing the importance of using more appropriate estimands such as contrasts of the survival probabilities.
△ Less
Submitted 29 October, 2022;
originally announced October 2022.
-
Flexible machine learning estimation of conditional average treatment effects: a blessing and a curse
Authors:
Richard Post,
Isabel van den Heuvel,
Marko Petkovic,
Edwin van den Heuvel
Abstract:
Causal inference from observational data requires untestable identification assumptions. If these assumptions apply, machine learning (ML) methods can be used to study complex forms of causal effect heterogeneity. Recently, several ML methods were developed to estimate the conditional average treatment effect (CATE). If the features at hand cannot explain all heterogeneity, the individual treatmen…
▽ More
Causal inference from observational data requires untestable identification assumptions. If these assumptions apply, machine learning (ML) methods can be used to study complex forms of causal effect heterogeneity. Recently, several ML methods were developed to estimate the conditional average treatment effect (CATE). If the features at hand cannot explain all heterogeneity, the individual treatment effects (ITEs) can seriously deviate from the CATE. In this work, we demonstrate how the distributions of the ITE and the CATE can differ when a causal random forest (CRF) is applied. We extend the CRF to estimate the difference in conditional variance between treated and controls. If the ITE distribution equals the CATE distribution, this estimated difference in variance should be small. If they differ, an additional causal assumption is necessary to quantify the heterogeneity not captured by the CATE distribution. The conditional variance of the ITE can be identified when the individual effect is independent of the outcome under no treatment given the measured features. Then, in the cases where the ITE and CATE distributions differ, the extended CRF can appropriately estimate the variance of the ITE distribution while the CRF fails to do so.
△ Less
Submitted 20 July, 2023; v1 submitted 29 October, 2022;
originally announced October 2022.
-
Individual causal effects from observational longitudinal studies with time-varying exposures
Authors:
Richard Post,
Zhuozhao Zhan,
Edwin van den Heuvel
Abstract:
Causal effects may vary among individuals and can even be of opposite signs. When significant effect heterogeneity exists, the population average causal effect might be uninformative for an individual. Due to the fundamental problem of causality, individual causal effects (ICEs) cannot be retrieved from cross-sectional data. However, in crossover studies, it is accepted that ICEs can be estimated…
▽ More
Causal effects may vary among individuals and can even be of opposite signs. When significant effect heterogeneity exists, the population average causal effect might be uninformative for an individual. Due to the fundamental problem of causality, individual causal effects (ICEs) cannot be retrieved from cross-sectional data. However, in crossover studies, it is accepted that ICEs can be estimated under the assumptions of no carryover effects and time invariance of potential outcomes. A generic potential-outcome formulation with appropriate statistical assumptions to identify ICEs is lacking for other longitudinal data with time-varying exposures. We present a general framework for causal effect heterogeneity in which individual-specific effect modification is parameterized with a latent variable, the receptiveness factor. If the exposure varies over time, then the repeated measurements contain information on an individual's level of this receptiveness factor. Therefore, we study the conditional distribution of the ICE given all an individual's factual information. This novel conditional random variable is called the cross-world causal effect (CWCE). For known causal structures and time-varying exposures, the variability of the CWCE reduces with an increasing number of repeated measurements. The CWCE becomes identifiable from observational data under the causal assumption of cross-world similarity of individual-effect modification (i.e. there exists an exposure strategy whose effect is affected by all latent causes). We illustrate the theory with examples in which the cause-effect relations can be parameterized as generalized linear mixed assignments.
△ Less
Submitted 9 December, 2022; v1 submitted 6 September, 2021;
originally announced September 2021.
-
Alexander- and Markov-type theorems for virtual trivalent braids
Authors:
Carmen Caprau,
Abigayle Dirdak,
Rita Post,
Erica Sawyer
Abstract:
We prove Alexander- and Markov-type theorems for virtual spatial trivalent graphs and virtual trivalent braids. We provide two versions for the Markov-type theorem: one uses an algebraic approach similar to the case of classical braids and the other one is based on L-moves.
We prove Alexander- and Markov-type theorems for virtual spatial trivalent graphs and virtual trivalent braids. We provide two versions for the Markov-type theorem: one uses an algebraic approach similar to the case of classical braids and the other one is based on L-moves.
△ Less
Submitted 17 December, 2018; v1 submitted 26 April, 2018;
originally announced April 2018.