Search | arXiv e-print repository

On the limitations for causal inference in Cox models with time-varying treatment

Authors: Mark B. Knudsen, Erin E. Gabriel, Torben Martinussen, Helene C. W. Rytgaard, Arvid Sjölander

Abstract: When using the Cox model to analyze the effect of a time-varying treatment on a survival outcome, treatment is commonly included, using only the current level as a time-dependent covariate. Such a model does not necessarily assume that past treatment is not associated with the outcome (the Markov property), since it is possible to model the hazard conditional on only the current treatment value. H… ▽ More When using the Cox model to analyze the effect of a time-varying treatment on a survival outcome, treatment is commonly included, using only the current level as a time-dependent covariate. Such a model does not necessarily assume that past treatment is not associated with the outcome (the Markov property), since it is possible to model the hazard conditional on only the current treatment value. However, modeling the hazard conditional on the full treatment history is required in order to interpret the results causally, and such a full model assumes the Markov property when only including current treatment. This is, for example, common in marginal structural Cox models. We demonstrate that relying on the Markov property is problematic, since it only holds in unrealistic settings or if the treatment has no causal effect. This is the case even if there are no confounders and the true causal effect of treatment really only depends on its current level. Further, we provide an example of a scenario where the Markov property is not fulfilled, but the Cox model that includes only current treatment as a covariate is correctly specified. Transforming the result to the survival scale does not give the true intervention-specific survival probabilities, showcasing that it is unclear how to make causal statements from such models. △ Less

Submitted 2 April, 2025; originally announced April 2025.

arXiv:2405.10773 [pdf, ps, other]

Proximal indirect comparison

Authors: Zehao Su, Helene C. W. Rytgaard, Henrik Ravn, Frank Eriksson

Abstract: We consider the problem of indirect comparison, where a treatment arm of interest is absent by design in one randomized controlled trial but available in the other. The former is the target trial, and the latter is the source trial. The identifiability of the target population average treatment effect often relies on conditional transportability assumptions. However, it is a common concern whether… ▽ More We consider the problem of indirect comparison, where a treatment arm of interest is absent by design in one randomized controlled trial but available in the other. The former is the target trial, and the latter is the source trial. The identifiability of the target population average treatment effect often relies on conditional transportability assumptions. However, it is a common concern whether all relevant effect modifiers are measured and controlled for. We give a new proximal identification result in the presence of shifted, unobserved effect modifiers based on proxies: an adjustment proxy in both trials and an additional reweighting proxy in the source trial. We propose an estimator which is doubly-robust against misspecifications of the so-called bridge functions and asymptotically normal under mild consistency of estimators for the bridge functions. We use two weight management trials as a context to illustrate selection of proxies and apply our method to compare the weight loss effect of active treatments from these trials. △ Less

Submitted 4 June, 2025; v1 submitted 17 May, 2024; originally announced May 2024.

arXiv:2404.11083 [pdf, other]

Estimating conditional hazard functions and densities with the highly-adaptive lasso

Authors: Anders Munch, Thomas A. Gerds, Mark J. van der Laan, Helene C. W. Rytgaard

Abstract: We consider estimation of conditional hazard functions and densities over the class of multivariate càdlàg functions with uniformly bounded sectional variation norm when data are either fully observed or subject to right-censoring. We demonstrate that the empirical risk minimizer is either not well-defined or not consistent for estimation of conditional hazard functions and densities. Under a smoo… ▽ More We consider estimation of conditional hazard functions and densities over the class of multivariate càdlàg functions with uniformly bounded sectional variation norm when data are either fully observed or subject to right-censoring. We demonstrate that the empirical risk minimizer is either not well-defined or not consistent for estimation of conditional hazard functions and densities. Under a smoothness assumption about the data-generating distribution, a highly-adaptive lasso estimator based on a particular data-adaptive sieve achieves the same convergence rate as has been shown to hold for the empirical risk minimizer in settings where the latter is well-defined. We use this result to study a highly-adaptive lasso estimator of a conditional hazard function based on right-censored data. We also propose a new conditional density estimator and derive its convergence rate. Finally, we show that the result is of interest also for settings where the empirical risk minimizer is well-defined, because the highly-adaptive lasso depends on a much smaller number of basis function than the empirical risk minimizer. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 36 pages, 14 figures

MSC Class: 62G05 (primary) 62N02 (secondary)

arXiv:2404.01736 [pdf, other]

Nonparametric efficient causal estimation of the intervention-specific expected number of recurrent events with continuous-time targeted maximum likelihood and highly adaptive lasso estimation

Authors: Helene C. W. Rytgaard, Mark J. van der Laan

Abstract: Longitudinal settings involving outcome, competing risks and censoring events occurring and recurring in continuous time are common in medical research, but are often analyzed with methods that do not allow for taking post-baseline information into account. In this work, we define statistical and causal target parameters via the g-computation formula by carrying out interventions directly on the p… ▽ More Longitudinal settings involving outcome, competing risks and censoring events occurring and recurring in continuous time are common in medical research, but are often analyzed with methods that do not allow for taking post-baseline information into account. In this work, we define statistical and causal target parameters via the g-computation formula by carrying out interventions directly on the product integral representing the observed data distribution in a continuous-time counting process model framework. In recurrent events settings our target parameter identifies the expected number of recurrent events also in settings where the censoring mechanism or post-baseline treatment decisions depend on past information of post-baseline covariates such as the recurrent event process. We propose a flexible estimation procedure based on targeted maximum likelihood estimation coupled with highly adaptive lasso estimation to provide a novel approach for double robust and nonparametric inference for the considered target parameter. We illustrate the methods in a simulation study. △ Less

Submitted 11 April, 2025; v1 submitted 2 April, 2024; originally announced April 2024.

arXiv:2310.19197 [pdf, other]

concrete: Targeted Estimation of Survival and Competing Risks in Continuous Time

Authors: David Chen, Helene C. W. Rytgaard, Edwin C. H. Fong, Jens M. Tarp, Maya L. Petersen, Mark J. van der Laan, Thomas A. Gerds

Abstract: This article introduces the R package concrete, which implements a recently developed targeted maximum likelihood estimator (TMLE) for the cause-specific absolute risks of time-to-event outcomes measured in continuous time. Cross-validated Super Learner machine learning ensembles are used to estimate propensity scores and conditional cause-specific hazards, which are then targeted to produce robus… ▽ More This article introduces the R package concrete, which implements a recently developed targeted maximum likelihood estimator (TMLE) for the cause-specific absolute risks of time-to-event outcomes measured in continuous time. Cross-validated Super Learner machine learning ensembles are used to estimate propensity scores and conditional cause-specific hazards, which are then targeted to produce robust and efficient plug-in estimates of the effects of static or dynamic interventions on a binary treatment given at baseline quantified as risk differences or risk ratios. Influence curve-based asymptotic inference is provided for TMLE estimates and simultaneous confidence bands can be computed for target estimands spanning multiple multiple times or events. In this paper we review the one-step continuous-time TMLE methodology as it is situated in an overarching causal inference workflow, describe its implementation, and demonstrate the use of the package on the PBC dataset. △ Less

Submitted 20 March, 2025; v1 submitted 29 October, 2023; originally announced October 2023.

Comments: 18 pages, 4 figures, submitted to the R Journal

arXiv:2305.10095 [pdf, other]

Nonparametric estimation of the interventional disparity indirect effect among the exposed

Authors: Helene C. W. Rytgaard, Amalie Lykkemark Møller, Thomas A. Gerds

Abstract: In situations with non-manipulable exposures, interventions can be targeted to shift the distribution of intermediate variables between exposure groups to define interventional disparity indirect effects. In this work, we present a theoretical study of identification and nonparametric estimation of the interventional disparity indirect effect among the exposed. The targeted estimand is intended fo… ▽ More In situations with non-manipulable exposures, interventions can be targeted to shift the distribution of intermediate variables between exposure groups to define interventional disparity indirect effects. In this work, we present a theoretical study of identification and nonparametric estimation of the interventional disparity indirect effect among the exposed. The targeted estimand is intended for applications examining the outcome risk among an exposed population for which the risk is expected to be reduced if the distribution of a mediating variable was changed by a (hypothetical) policy or health intervention that targets the exposed population specifically. We derive the nonparametric efficient influence function, study its double robustness properties and present a targeted minimum loss-based estimation (TMLE) procedure. All theoretical results and algorithms are provided for both uncensored and right-censored survival outcomes. With offset in the ongoing discussion of the interpretation of non-manipulable exposures, we discuss relevant interpretations of the estimand under different sets of assumptions of no unmeasured confounding and provide a comparison of our estimand to other related estimands within the framework of interventional (disparity) effects. Small-sample performance and double robustness properties of our estimation procedure are investigated and illustrated in a simulation study. △ Less

Submitted 17 May, 2023; originally announced May 2023.

Comments: 35 pages, 1 figure

arXiv:2107.01537 [pdf, other]

One-step TMLE for targeting cause-specific absolute risks and survival curves

Authors: Helene C. W. Rytgaard, Mark J. van der Laan

Abstract: This paper considers one-step targeted maximum likelihood estimation method for general competing risks and survival analysis settings where event times take place on the positive real line R+ and are subject to right-censoring. Our interest is overall in the effects of baseline treatment decisions, static, dynamic or stochastic, possibly confounded by pre-treatment covariates. We point out two ov… ▽ More This paper considers one-step targeted maximum likelihood estimation method for general competing risks and survival analysis settings where event times take place on the positive real line R+ and are subject to right-censoring. Our interest is overall in the effects of baseline treatment decisions, static, dynamic or stochastic, possibly confounded by pre-treatment covariates. We point out two overall contributions of our work. First, our method can be used to obtain simultaneous inference across all absolute risks in competing risks settings. Second, we present a practical result for achieving inference for the full survival curve, or a full absolute risk curve, across time by targeting over a fine enough grid of points. The one-step procedure is based on a one-dimensional universal least favorable submodel for each cause-specific hazard that can be implemented in recursive steps along a corresponding universal least favorable submodel. We present a theorem for conditions to achieve weak convergence of the estimator for an infinite-dimensional target parameter. Our empirical study demonstrates the use of the methods. △ Less

Submitted 1 September, 2021; v1 submitted 4 July, 2021; originally announced July 2021.

Comments: 21 pages (including appendix), 1 figure, 5 tables

arXiv:2106.11009 [pdf, ps, other]

Estimation of time-specific intervention effects on continuously distributed time-to-event outcomes by targeted maximum likelihood estimation

Authors: Helene Charlotte Wiese Rytgaard, Frank Eriksson, Mark van der Laan

Abstract: Targeted maximum likelihood estimation is a general methodology combining flexible ensemble learning and semiparametric efficiency theory in a two-step procedure for estimation of causal parameters. Proposed targeted maximum likelihood procedures for survival and competing risks analysis have so far focused on events taken values in discrete time. We here present a targeted maximum likelihood esti… ▽ More Targeted maximum likelihood estimation is a general methodology combining flexible ensemble learning and semiparametric efficiency theory in a two-step procedure for estimation of causal parameters. Proposed targeted maximum likelihood procedures for survival and competing risks analysis have so far focused on events taken values in discrete time. We here present a targeted maximum likelihood estimation procedure for event times that take values in R+. We focuson the estimation of intervention-specific mean outcomes with stochastic interventions on a time-fixed treatment. For data-adaptive estimation of nuisance parameters, we propose a new flexible highly adaptive lasso estimation method for continuous-time intensities that can be implemented with L1-penalized Poisson regression. In a simulation study the targeted maximum likelihood estimator based on the highly adaptive lasso estimator proves to be unbiased and achieve proper coverage in agreement with the asymptotic theory and further displays efficiency improvements relative to a Kaplan-Meier approach. △ Less

Submitted 21 June, 2021; originally announced June 2021.

arXiv:2104.13028 [pdf, other]

Ranking of average treatment effects with generalized random forests for time-to-event outcomes

Authors: Helene C. W. Rytgaard, Claus T. Ekstrøm, Lars V. Kessing, Thomas A. Gerds

Abstract: In this paper we present a data-adaptive estimation procedure for estimation of average treatment effects in a time-to-event setting based on generalized random forests. In these kinds of settings, the definition of causal effect parameters are complicated by competing risks; here we distinguish between treatment effects on the crude and the net probabilities, respectively. To handle right-censori… ▽ More In this paper we present a data-adaptive estimation procedure for estimation of average treatment effects in a time-to-event setting based on generalized random forests. In these kinds of settings, the definition of causal effect parameters are complicated by competing risks; here we distinguish between treatment effects on the crude and the net probabilities, respectively. To handle right-censoring, and to switch between crude and net probabilities, we propose a two-step procedure for estimation, applying inverse probability weighting to construct time-point specific weighted outcomes as input for the forest. The forest adaptively handles confounding of the treatment assigned by applying a splitting rule that targets a causal parameter. We demonstrate that our method is effective for a causal search through a list of treatments to be ranked according to the magnitude of their effect. We further apply our method to a dataset from the Danish health registries where it is of interest to discover drugs with an unexpected protective effect against relapse of severe depression. △ Less

Submitted 27 April, 2021; originally announced April 2021.

Showing 1–9 of 9 results for author: Rytgaard, H C W