-
Causal inference in survival analysis using longitudinal observational data: Sequential trials and marginal structural models
Authors:
Ruth H. Keogh,
Jon Michael Gran,
Shaun R. Seaman,
Gwyneth Davies,
Stijn Vansteelandt
Abstract:
Longitudinal observational patient data can be used to investigate the causal effects of time-varying treatments on time-to-event outcomes. Several methods have been developed for controlling for the time-dependent confounding that typically occurs. The most commonly used is inverse probability weighted estimation of marginal structural models (MSM-IPTW). An alternative, the sequential trials appr…
▽ More
Longitudinal observational patient data can be used to investigate the causal effects of time-varying treatments on time-to-event outcomes. Several methods have been developed for controlling for the time-dependent confounding that typically occurs. The most commonly used is inverse probability weighted estimation of marginal structural models (MSM-IPTW). An alternative, the sequential trials approach, is increasingly popular, in particular in combination with the target trial emulation framework. This approach involves creating a sequence of `trials' from new time origins, restricting to individuals as yet untreated and meeting other eligibility criteria, and comparing treatment initiators and non-initiators. Individuals are censored when they deviate from their treatment status at the start of each `trial' (initiator/non-initiator) and this is addressed using inverse probability of censoring weights. The analysis is based on data combined across trials. We show that the sequential trials approach can estimate the parameter of a particular MSM, and compare it to a MSM-IPTW with respect to the estimands being identified, the assumptions needed and how data are used differently. We show how both approaches can estimate the same marginal risk differences. The two approaches are compared using a simulation study. The sequential trials approach, which tends to involve less extreme weights than MSM-IPTW, results in greater efficiency for estimating the marginal risk difference at most follow-up times, but this can, in certain scenarios, be reversed at late time points. We apply the methods to longitudinal observational data from the UK Cystic Fibrosis Registry to estimate the effect of dornase alfa on survival.
△ Less
Submitted 6 October, 2021;
originally announced October 2021.
-
A hybrid landmark Aalen-Johansen estimator for transition probabilities in partially non-Markov multi-state models
Authors:
N. Maltzahn,
R. Hoff,
O. O. Aalen,
I. S. Mehlum,
H. Putter,
J. M. Gran
Abstract:
Multi-state models are increasingly being used to model complex epidemiological and clinical outcomes over time. It is common to assume that the models are Markov, but the assumption can often be unrealistic. The Markov assumption is seldomly checked and violations can lead to biased estimation for many parameters of interest. As argued by Datta and Satten (2001), the Aalen-Johansen estimator of o…
▽ More
Multi-state models are increasingly being used to model complex epidemiological and clinical outcomes over time. It is common to assume that the models are Markov, but the assumption can often be unrealistic. The Markov assumption is seldomly checked and violations can lead to biased estimation for many parameters of interest. As argued by Datta and Satten (2001), the Aalen-Johansen estimator of occupation probabilities is consistent also in the non-Markov case. Putter and Spitoni (2018) exploit this fact to construct a consistent estimator of state transition probabilities, the landmark Aalen-Johansen estimator, which does not rely on the Markov assumption. A disadvantage of landmarking is data reduction, leading to a loss of power. This is problematic for less traveled transitions, and undesirable when such transitions indeed exhibit Markov behaviour. Using a framework of partially non-Markov multi-state models we suggest a hybrid landmark Aalen-Johansen estimator for transition probabilities. The proposed estimator is a compromise between regular Aalen-Johansen and landmark estimation, using transition specific landmarking, and can drastically improve statistical power. The methods are compared in a simulation study and in a real data application modelling individual transitions between states of sick leave, disability, education, work and unemployment. In the application, a birth cohort of 184951 Norwegian men are followed for 14 years from the year they turn 21, using data from national registries.
△ Less
Submitted 25 August, 2020; v1 submitted 2 July, 2020;
originally announced July 2020.
-
Simulating longitudinal data from marginal structural models using the additive hazard model
Authors:
Ruth H. Keogh,
Shaun R. Seaman,
Jon Michael Gran,
Stijn Vansteelandt
Abstract:
Observational longitudinal data on treatments and covariates are increasingly used to investigate treatment effects, but are often subject to time-dependent confounding. Marginal structural models (MSMs), estimated using inverse probability of treatment weighting or the g-formula, are popular for handling this problem. With increasing development of advanced causal inference methods, it is importa…
▽ More
Observational longitudinal data on treatments and covariates are increasingly used to investigate treatment effects, but are often subject to time-dependent confounding. Marginal structural models (MSMs), estimated using inverse probability of treatment weighting or the g-formula, are popular for handling this problem. With increasing development of advanced causal inference methods, it is important to be able to assess their performance in different scenarios to guide their application. Simulation studies are a key tool for this, but their use to evaluate causal inference methods has been limited. This paper focuses on the use of simulations for evaluations involving MSMs in studies with a time-to-event outcome. In a simulation, it is important to be able to generate the data in such a way that the correct form of any models to be fitted to those data is known. However, this is not straightforward in the longitudinal setting because it is natural for data to be generated in a sequential conditional manner, whereas MSMs involve fitting marginal rather than conditional hazard models. We provide general results that enable the form of the correctly-specified MSM to be derived based on a conditional data generating procedure, and show how the results can be applied when the conditional hazard model is an Aalen additive hazard or Cox model. Using conditional additive hazard models is advantageous because they imply additive MSMs that can be fitted using standard software. We describe and illustrate a simulation algorithm. Our results will help researchers to effectively evaluate causal inference methods via simulation.
△ Less
Submitted 10 February, 2020;
originally announced February 2020.
-
Estimating the treatment effect on the treated under time-dependent confounding in an application to the Swiss HIV Cohort Study
Authors:
J. M. Gran,
R. Hoff,
K. Røysland,
B. Ledergerber,
J. Young,
O. O. Aalen
Abstract:
When comparing time-varying treatments in a non-randomised setting, one must often correct for time-dependent confounders that influence treatment choice over time and that are themselves influenced by treatment. We present a new two step procedure, based on additive hazard regression and linear increments models, for handling such confounding when estimating average treatment effects on the treat…
▽ More
When comparing time-varying treatments in a non-randomised setting, one must often correct for time-dependent confounders that influence treatment choice over time and that are themselves influenced by treatment. We present a new two step procedure, based on additive hazard regression and linear increments models, for handling such confounding when estimating average treatment effects on the treated (ATT). The approach can also be used for mediation analysis. The method is applied to data from the Swiss HIV Cohort Study, estimating the effect of antiretroviral treatment on time to AIDS or death. Compared to other methods for estimating the ATT, the proposed method is easy to implement using available software packages in R.
△ Less
Submitted 5 October, 2016; v1 submitted 6 April, 2016;
originally announced April 2016.
-
Dynamic models for estimating the effect of HAART on CD4 in observational studies: application to the Aquitaine Cohort study and the Swiss HIV Cohort Study
Authors:
M. Prague,
D. Commenges,
J. M. Gran,
B. Ledergerber,
J. young,
H. Furrer,
R. Thiébaut
Abstract:
Highly active antiretroviral therapy (HAART) has proved efficient in increasing CD4 counts in many randomized clinical trials. Because randomized trials have some limitations (e.g., short duration, highly selected subjects), it is interesting to assess it using observational studies. This is challenging because treatment is started preferentially in subjects with severe conditions, in particular i…
▽ More
Highly active antiretroviral therapy (HAART) has proved efficient in increasing CD4 counts in many randomized clinical trials. Because randomized trials have some limitations (e.g., short duration, highly selected subjects), it is interesting to assess it using observational studies. This is challenging because treatment is started preferentially in subjects with severe conditions, in particular in subjects with low CD4 counts. This general problem had been treated using Marginal Structural Models (MSM) relying on the counterfactual formulation. Another approach to causality is based on dynamical models. First, we present three discrete-time dynamic models based on linear increments (LIM): the simplest model is described by one difference equation for CD4 counts; the second has an equilibrium point; the third model is based on a system of two difference equations which allows jointly modeling CD4 counts and viral load. Then we consider continuous time models based on ordinary differential equations with random effects (ODE-NLME). These mechanistic models allow incorporating biological knowledge when available, which leads to increased power for detecting treatment effect. Inference in ODE-NLME models, however, is challenging from a numerical point of view, and requires specific methods and softwares. LIMs are a valuable intermediary option in terms of consistency, precision and complexity. The different approaches are compared in simulation and applied to HIV cohorts (the ANRS CO3 Aquitaine Cohort and the Swiss HIV Cohort Study).
△ Less
Submitted 20 November, 2015; v1 submitted 30 March, 2015;
originally announced March 2015.