-
Robust and Efficient Optimization Using a Marquardt-Levenberg Algorithm with R Package marqLevAlg
Authors:
Viviane Philipps,
Boris P Hejblum,
Mélanie Prague,
Daniel Commenges,
Cécile Proust-Lima
Abstract:
Implementations in R of classical general-purpose algorithms for local optimization generally have two major limitations which cause difficulties in applications to complex problems: too loose convergence criteria and too long calculation time. By relying on a Marquardt-Levenberg algorithm (MLA), a Newton-like method particularly robust for solving local optimization problems, we provide with marq…
▽ More
Implementations in R of classical general-purpose algorithms for local optimization generally have two major limitations which cause difficulties in applications to complex problems: too loose convergence criteria and too long calculation time. By relying on a Marquardt-Levenberg algorithm (MLA), a Newton-like method particularly robust for solving local optimization problems, we provide with marqLevAlg package an efficient and general-purpose local optimizer which (i) prevents convergence to saddle points by using a stringent convergence criterion based on the relative distance to minimum/maximum in addition to the stability of the parameters and of the objective function; and (ii) reduces the computation time in complex settings by allowing parallel calculations at each iteration. We demonstrate through a variety of cases from the literature that our implementation reliably and consistently reaches the optimum (even when other optimizers fail), and also largely reduces computational time in complex settings through the example of maximum likelihood estimation of different sophisticated statistical models.
△ Less
Submitted 26 November, 2021; v1 submitted 8 September, 2020;
originally announced September 2020.
-
Effects of interventions and optimal strategies in the stochastic system approach to causality
Authors:
Daniel Commenges,
Mélanie Prague
Abstract:
We consider the problem of defining the effect of an intervention on a time-varying risk factor or treatment for a disease or a physiological marker; we develop here the latter case. So, the system considered is $(Y,A,C)$, where $Y=(Y_t)$, is the marker process of interest, $A=A_t$ the treatment. A realistic case is that the treatment can be changed only at discrete times. In an observational stud…
▽ More
We consider the problem of defining the effect of an intervention on a time-varying risk factor or treatment for a disease or a physiological marker; we develop here the latter case. So, the system considered is $(Y,A,C)$, where $Y=(Y_t)$, is the marker process of interest, $A=A_t$ the treatment. A realistic case is that the treatment can be changed only at discrete times. In an observational study the treatment attribution law is unknown; however, the physical law can be estimated without knowing the treatment attribution law, provided a well-specified model is available. An intervention is specified by the treatment attribution law, which is thus known. Simple interventions will simply randomize the attribution of the treatment; interventions that take into account the past history will be called "strategies". The effect of interventions can be defined by a risk function $R^{\intr}=\Ee_{\intr}[L(\bar Y_{t_J}, \bar A_{t_{J}},C)]$, where $L(\bar Y_{t_J}, \bar A_{t_{J}},C)$ is a loss function, and contrasts between risk functions for different strategies can be formed. Once we can compute effects for any strategy, we can search for optimal or sub-optimal strategies; in particular we can find optimal parametric strategies. We present several ways for designing strategies. As an illustration, we consider the choice of a strategy for containing the HIV load below a certain level while limiting the treatment burden. A simulation study demonstrates the possibility of finding optimal parametric strategies.
△ Less
Submitted 30 July, 2019;
originally announced July 2019.
-
Causality without potential outcomes and the dynamic approach
Authors:
Daniel Commenges
Abstract:
Several approaches to causal inference from observational studies have been proposed. Since the proposal of Rubin (1974) many works have developed a counterfactual approach to causality, statistically formalized by potential outcomes. Pearl (2000) has put forward a theory of structural causal models which gives an important role to graphical models and do not necessarily use potential outcomes. On…
▽ More
Several approaches to causal inference from observational studies have been proposed. Since the proposal of Rubin (1974) many works have developed a counterfactual approach to causality, statistically formalized by potential outcomes. Pearl (2000) has put forward a theory of structural causal models which gives an important role to graphical models and do not necessarily use potential outcomes. On the other hand, several authors have developed a dynamical approach in line with Granger (1969). We analyze prospective and retrospective causal questions and their different modalities. Following Dawid (2000) we develop criticisms about the potential outcome approach and we show that causal effects can be estimated without potential outcomes: in particular direct computation of the marginal effect can be done by a change of probability measure. Finally, we highlight the need to adopt a dynamic approach to causality through two examples, "truncation by death" and the "obesity paradox".
△ Less
Submitted 3 May, 2019;
originally announced May 2019.
-
Dynamic Modelling of Multivariate Dimensions and Their Temporal Relationships using Latent Processes: Application to Alzheimer's Disease
Authors:
Bachirou O. Taddé,
Hélène Jacqmin-Gadda,
Jean-François Dartigues,
Daniel Commenges,
Cécile Proust-Lima
Abstract:
Alzheimer's disease gradually affects several components including the cerebral dimension with brain atrophies, the cognitive dimension with a decline in various functions and the functional dimension with impairment in the daily living activities. Understanding how such dimensions interconnect is crucial for AD research. However it requires to simultaneously capture the dynamic and multidimension…
▽ More
Alzheimer's disease gradually affects several components including the cerebral dimension with brain atrophies, the cognitive dimension with a decline in various functions and the functional dimension with impairment in the daily living activities. Understanding how such dimensions interconnect is crucial for AD research. However it requires to simultaneously capture the dynamic and multidimensional aspects, and to explore temporal relationships between dimensions. We propose an original dynamic structural model that accounts for all these features. The model defines dimensions as latent processes and combines a multivariate linear mixed model and a system of difference equations to model trajectories and temporal relationships between latent processes in finely discrete time. Dimensions are simultaneously related to their observed (possibly multivariate) markers through nonlinear equations of observation. Parameters are estimated in the maximum likelihood framework enjoying a closed form for the likelihood. We demonstrate in a simulation study that this dynamic model in discrete time benefits the same causal interpretation of temporal relationships as models defined in continuous time as long as the discretization step remains small. The model is then applied to the data of the Alzheimer's Disease Neuroimaging Initiative. Three longitudinal dimensions (cerebral anatomy, cognitive ability and functional autonomy) measured by 6 markers are analyzed and their temporal structure is contrasted between different clinical stages of Alzheimer's disease. Keywords: causality, difference equations, latent process, longitudinal data, mixed models, multivariate data.
△ Less
Submitted 14 October, 2019; v1 submitted 10 June, 2018;
originally announced June 2018.
-
Dealing with death when studying disease or physiological marker: the stochastic system approach to causality
Authors:
Daniel Commenges
Abstract:
The stochastic system approach to causality is applied to situations where the risk of death is not negligible. This approach grounds causality on physical laws, distinguishes system and observation and represents the system by multivariate stochastic processes. The particular role of death is highlighted, and it is shown that local influences must be defined on the random horizon of time of death…
▽ More
The stochastic system approach to causality is applied to situations where the risk of death is not negligible. This approach grounds causality on physical laws, distinguishes system and observation and represents the system by multivariate stochastic processes. The particular role of death is highlighted, and it is shown that local influences must be defined on the random horizon of time of death. We particularly study the problem of estimating the effect of a factor $V$ on a process of interest $Y$, taking death into account. We unify the cases where $Y$ is a counting process (describing an event) and the case where $Y$ is quantitative; we examine the case of observations in continuous and discrete time and we give a typology of cases where the mechanism leading to incomplete data can be ignored. Finally, we give an example of a situation where we are interested in estimating the effect of a factor (blood pressure) on cognitive ability in elderly.
△ Less
Submitted 6 October, 2016;
originally announced October 2016.
-
Modeling CD4+ T cells dynamics in HIV-infected patients receiving repeated cycles of exogenous Interleukin 7
Authors:
Ana Jarne,
Daniel Commenges,
Mélanie Prague,
Yves Levy,
Rodolphe Thiébaut
Abstract:
Combination Antiretroviral Therapy (cART) succeeds to control viral replication in most HIV infected patients. This is normally followed by a reconstitution of the CD4$^+$ T cells pool; however, this does not happen for a substantial proportion of patients. For these patients, an immunotherapy based on injections of Interleukin 7 (IL-7) has been recently proposed as a co-adjutant treatment in the…
▽ More
Combination Antiretroviral Therapy (cART) succeeds to control viral replication in most HIV infected patients. This is normally followed by a reconstitution of the CD4$^+$ T cells pool; however, this does not happen for a substantial proportion of patients. For these patients, an immunotherapy based on injections of Interleukin 7 (IL-7) has been recently proposed as a co-adjutant treatment in the hope of obtaining long-term reconstitution of the T cells pool. Several questions arise as to the long-term efficiency of this treatment and the best protocol to apply.
We develop a model based on a system of ordinary differential equations and a statistical model of variability and measurement. We can estimate key parameters of this model using the data from INSPIRE, INSPIRE 2 $\&$ INSPIRE 3 trials. In all three studies, cycles of three injections have been administered; in the last two studies, for the first time, repeated cycles of exogenous IL-7 have been administered. Our aim was to estimate the possible different effects of successive injections in a cycle, to estimate the effect of repeated cycles and to assess different protocols.
The use of dynamical models together with our complex statistical approach allow us to analyze major biological questions. We found a strong effect of IL-7 injections on the proliferation rate; however, the effect of the third injection of the cycle appears to be much weaker than the first ones. Also, despite a slightly weaker effect of repeated cycles with respect to the initial one, our simulations show the ability of this treatment of maintaining adequate CD4$^+$ T cells count for years. We were also able to compare different protocols, showing that cycles of two injections should be sufficient in most cases. %Finally, we also explore the possibility of adaptive protocols.
△ Less
Submitted 17 February, 2016;
originally announced February 2016.
-
Dynamic models for estimating the effect of HAART on CD4 in observational studies: application to the Aquitaine Cohort study and the Swiss HIV Cohort Study
Authors:
M. Prague,
D. Commenges,
J. M. Gran,
B. Ledergerber,
J. young,
H. Furrer,
R. Thiébaut
Abstract:
Highly active antiretroviral therapy (HAART) has proved efficient in increasing CD4 counts in many randomized clinical trials. Because randomized trials have some limitations (e.g., short duration, highly selected subjects), it is interesting to assess it using observational studies. This is challenging because treatment is started preferentially in subjects with severe conditions, in particular i…
▽ More
Highly active antiretroviral therapy (HAART) has proved efficient in increasing CD4 counts in many randomized clinical trials. Because randomized trials have some limitations (e.g., short duration, highly selected subjects), it is interesting to assess it using observational studies. This is challenging because treatment is started preferentially in subjects with severe conditions, in particular in subjects with low CD4 counts. This general problem had been treated using Marginal Structural Models (MSM) relying on the counterfactual formulation. Another approach to causality is based on dynamical models. First, we present three discrete-time dynamic models based on linear increments (LIM): the simplest model is described by one difference equation for CD4 counts; the second has an equilibrium point; the third model is based on a system of two difference equations which allows jointly modeling CD4 counts and viral load. Then we consider continuous time models based on ordinary differential equations with random effects (ODE-NLME). These mechanistic models allow incorporating biological knowledge when available, which leads to increased power for detecting treatment effect. Inference in ODE-NLME models, however, is challenging from a numerical point of view, and requires specific methods and softwares. LIMs are a valuable intermediary option in terms of consistency, precision and complexity. The different approaches are compared in simulation and applied to HIV cohorts (the ANRS CO3 Aquitaine Cohort and the Swiss HIV Cohort Study).
△ Less
Submitted 20 November, 2015; v1 submitted 30 March, 2015;
originally announced March 2015.
-
Modeling the dynamics of biomarkers during primary HIV infection taking into account the uncertainty of infection date
Authors:
J. Drylewicz,
J. Guedj,
D. Commenges,
R. Thiébaut
Abstract:
During primary HIV infection, the kinetics of plasma virus concentrations and CD4+ cell counts is very complex. Parametric and nonparametric models have been suggested for fitting repeated measurements of these markers. Alternatively, mechanistic approaches based on ordinary differential equations have also been proposed. These latter models are constructed according to biological knowledge and ta…
▽ More
During primary HIV infection, the kinetics of plasma virus concentrations and CD4+ cell counts is very complex. Parametric and nonparametric models have been suggested for fitting repeated measurements of these markers. Alternatively, mechanistic approaches based on ordinary differential equations have also been proposed. These latter models are constructed according to biological knowledge and take into account the complex nonlinear interactions between viruses and cells. However, estimating the parameters of these models is difficult. A main difficulty in the context of primary HIV infection is that the date of infection is generally unknown. For some patients, the date of last negative HIV test is available in addition to the date of first positive HIV test (seroconverters). In this paper we propose a likelihood-based method for estimating the parameters of dynamical models using a population approach and taking into account the uncertainty of the infection date. We applied this method to a sample of 761 HIV-infected patients from the Concerted Action on SeroConversion to AIDS and Death in Europe (CASCADE).
△ Less
Submitted 6 January, 2011;
originally announced January 2011.
-
Extending The Range of Application of Permutation Tests: the Expected Permutation p-value Approach
Authors:
Daniel Commenges
Abstract:
The limitation of permutation tests is that they assume exchangeability. It is shown that in generalized linear models one can construct permutation tests from score statistics in particular cases. When under the null hypothesis the observations are not exchangeable, a representation in terms of Cox-Snell residuals allows to develop an approach based on an expected permutation p-value (Eppv); th…
▽ More
The limitation of permutation tests is that they assume exchangeability. It is shown that in generalized linear models one can construct permutation tests from score statistics in particular cases. When under the null hypothesis the observations are not exchangeable, a representation in terms of Cox-Snell residuals allows to develop an approach based on an expected permutation p-value (Eppv); this is applied to the logistic regression model. A small simulation study and an illustration with real data are given.
△ Less
Submitted 4 March, 2010;
originally announced March 2010.
-
Estimating a difference between Kullback-Leibler risks by a normalized difference of AIC
Authors:
D. Commenges,
A. Sayyareh,
L. Letenneur,
J. Guedj,
A. Bar-Hen
Abstract:
AIC is commonly used for model selection but the precise value of AIC has no direct interpretation. We are interested in quantifying a difference of risks between two models. This may be useful for both an explanatory point of view or for prediction, where a simpler model may be preferred if it does nearly as well as a more complex model. The difference of risks can be interpreted by linking the…
▽ More
AIC is commonly used for model selection but the precise value of AIC has no direct interpretation. We are interested in quantifying a difference of risks between two models. This may be useful for both an explanatory point of view or for prediction, where a simpler model may be preferred if it does nearly as well as a more complex model. The difference of risks can be interpreted by linking the risks with relative errors in the computation of probabilities and looking at the values obtained for simple models. A scale of values going from negligible to large is proposed. We propose a normalization of a difference of Akaike criteria for estimating the difference of expected Kullback-Leibler risks between maximum likelihood estimators of the distribution in two different models. The variability of this statistic can be estimated. Thus, an interval can be constructed which contains the true difference of expected Kullback-Leibler risks with a pre-specified probability. A simulation study shows that the method works and it is illustrated on two examples. The first is a study of the relationship between body-mass index and depression in elderly people. The second is the choice between models of HIV dynamics, where one model makes the distinction between activated CD4+ T lymphocytes and the other does not.
△ Less
Submitted 25 July, 2008;
originally announced July 2008.
-
Bivariate linear mixed models using SAS proc MIXED
Authors:
Rodolphe Thiébaut,
Hélène Jacqmin-Gadda,
Geneviève Chêne,
Catherine Leport,
Daniel Commenges
Abstract:
Bivariate linear mixed models are useful when analyzing longitudinal data of two associated markers. In this paper, we present a bivariate linear mixed model including random effects or first-order auto-regressive process and independent measurement error for both markers. Codes and tricks to fit these models using SAS Proc MIXED are provided. Limitations of this program are discussed and an exa…
▽ More
Bivariate linear mixed models are useful when analyzing longitudinal data of two associated markers. In this paper, we present a bivariate linear mixed model including random effects or first-order auto-regressive process and independent measurement error for both markers. Codes and tricks to fit these models using SAS Proc MIXED are provided. Limitations of this program are discussed and an example in the field of HIV infection is shown. Despite some limitations, SAS Proc MIXED is a useful tool that may be easily extendable to multivariate response in longitudinal studies.
△ Less
Submitted 4 May, 2007;
originally announced May 2007.