-
Estimating relapse time distribution from longitudinal biomarker trajectories using iterative regression and continuous time Markov processes
Authors:
Alice Cleynen,
Benoîte de Saporta,
Amélie Vernay
Abstract:
Biomarker measurements obtained by blood sampling are often used as a non-invasive means of monitoring tumour progression in cancer patients. Diseases evolve dynamically over time, and studying longitudinal observations of specific biomarkers can help to understand patients response to treatment and predict disease progression. We propose a novel iterative regression-based method to estimate chang…
▽ More
Biomarker measurements obtained by blood sampling are often used as a non-invasive means of monitoring tumour progression in cancer patients. Diseases evolve dynamically over time, and studying longitudinal observations of specific biomarkers can help to understand patients response to treatment and predict disease progression. We propose a novel iterative regression-based method to estimate changes in patients status within a cohort that includes censored patients, and illustrate it on clinical data from myeloma cases. We formulate the relapse time estimation problem in the framework of Piecewise Deterministic Markov processes (PDMP), where the Euclidean component is a surrogate biomarker for patient state. This approach enables continuous-time estimation of the status-change dates, which in turn allows for accurate inference of the relapse time distribution. A key challenge lies in the partial observability of the process, a complexity that has been rarely addressed in previous studies. . We evaluate the performance of our procedure through a simulation study and compare it with different approaches. This work is a proof of concept on biomarker trajectories with simple behaviour, but our method can easily be extended to more complex dynamics.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Bridging Impulse Control of Piecewise Deterministic Markov Processes and Markov Decision Processes: Frameworks, Extensions, and Open Challenges
Authors:
Alice Cleynen,
Benoîte de Saporta,
Orlane Rossini,
Régis Sabbadin,
Amélie Vernay
Abstract:
Control theory plays a pivotal role in understanding and optimizing the behavior of complex dynamical systems across various scientific and engineering disciplines. Two key frameworks that have emerged for modeling and solving control problems in stochastic systems are piecewise deterministic Markov processes (PDMPs) and Markov decision processes (MDPs). Each framework has its unique strengths, an…
▽ More
Control theory plays a pivotal role in understanding and optimizing the behavior of complex dynamical systems across various scientific and engineering disciplines. Two key frameworks that have emerged for modeling and solving control problems in stochastic systems are piecewise deterministic Markov processes (PDMPs) and Markov decision processes (MDPs). Each framework has its unique strengths, and their intersection offers promising opportunities for tackling a broad class of problems, particularly in the context of impulse controls and decision-making in complex systems.
The relationship between PDMPs and MDPs is a natural subject of exploration, as embedding impulse control problems for PDMPs into the MDP framework could open new avenues for their analysis and resolution. Specifically, this integration would allow leveraging the computational and theoretical tools developed for MDPs to address the challenges inherent in PDMPs. On the other hand, PDMPs can offer a versatile and simple paradigm to model continuous time problems that are often described as discrete-time MDPs parametrized by complex transition kernels. This transformation has the potential to bridge the gap between the two frameworks, enabling solutions to previously intractable problems and expanding the scope of both fields. This paper presents a comprehensive review of two research domains, illustrated through a recurring medical example. The example is revisited and progressively formalized within the framework of thevarious concepts and objects introduced
△ Less
Submitted 14 April, 2025; v1 submitted 7 January, 2025;
originally announced January 2025.
-
Estimating parameters of continuous-time multi-chain hidden Markov models for infectious diseases
Authors:
Ibrahim Bouzalmat,
Benoîte de Saporta,
Solym M. Manou-Abi
Abstract:
This study aims to estimate the parameters of a stochastic exposed-infected epidemiological model for the transmission dynamics of notifiable infectious diseases, based on observations related to isolated cases counts only. We use the setting of hidden multi-chain Markov models and adapt the Baum-Welch algorithm to the special structure of the multi-chain. From the estimated transition matrix, we…
▽ More
This study aims to estimate the parameters of a stochastic exposed-infected epidemiological model for the transmission dynamics of notifiable infectious diseases, based on observations related to isolated cases counts only. We use the setting of hidden multi-chain Markov models and adapt the Baum-Welch algorithm to the special structure of the multi-chain. From the estimated transition matrix, we retrieve the parameters of interest (contamination rates, incubation rate, and isolation rate) from analytical expressions of the moments and Monte Carlo simulations. The performance of this approach is investigated on synthetic data, together with an analysis of the impact of using a model with one less compartment to fit the data in order to help for model selection.
△ Less
Submitted 12 April, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Parameter estimation for a hidden linear birth and death process with immigration
Authors:
Ibrahim Bouzalmat,
Benoîte de Saporta,
Solym M. Manou-Abi
Abstract:
In this paper, we use a linear birth and death process with immigration to model infectious disease propagation when contamination stems from both person-to-person contact and contact with the environment. Our aim is to estimate the parameters of the process. The main originality and difficulty comes from the observation scheme. Counts of infected population are hidden. The only data available are…
▽ More
In this paper, we use a linear birth and death process with immigration to model infectious disease propagation when contamination stems from both person-to-person contact and contact with the environment. Our aim is to estimate the parameters of the process. The main originality and difficulty comes from the observation scheme. Counts of infected population are hidden. The only data available are periodic cumulated new retired counts. Although very common in epidemiology, this observation scheme is mathematically challenging even for such a standard stochastic process. We first derive an analytic expression of the unknown parameters as functions of well-chosen discrete time transition probabilities. Second, we extend and adapt the standard Baum-Welch algorithm in order to estimate the said discrete time transition probabilities in our hidden data framework. The performance of our estimators is illustrated both on synthetic data and real data of typhoid fever in Mayotte.
△ Less
Submitted 10 January, 2024; v1 submitted 1 March, 2023;
originally announced March 2023.
-
Investigation of asymmetry in E. coli growth rate
Authors:
Bernard Delyon,
Benoîte de Saporta,
Nathalie Krell,
Lydia Robert
Abstract:
The data we analyze derives from the observation of numerous cells of the bacterium Escherichia coli (E. coli) growing and dividing. Single cells grow and divide to give birth to two daughter cells, that in turn grow and divide. Thus, a colony of cells from a single ancestor is structured as a binary genealogical tree. At each node the measured data is the growth rate of the bacterium. In this pap…
▽ More
The data we analyze derives from the observation of numerous cells of the bacterium Escherichia coli (E. coli) growing and dividing. Single cells grow and divide to give birth to two daughter cells, that in turn grow and divide. Thus, a colony of cells from a single ancestor is structured as a binary genealogical tree. At each node the measured data is the growth rate of the bacterium. In this paper, we study two different data sets. One set corresponds to small complete trees, whereas the other one corresponds to long specific sub-trees. Our aim is to compare both sets. This paper is accessible to post graduate students and readers with advanced knowledge in statistics.
△ Less
Submitted 17 September, 2015;
originally announced September 2015.
-
Statistical study of asymmetry in cell lineage data
Authors:
Benoîte de Saporta,
Anne Gégout Petit,
Laurence Marsalle
Abstract:
A rigorous methodology is proposed to study cell division data consisting in several observed genealogical trees of possibly different shapes. The procedure takes into account missing observations, data from different trees, as well as the dependence structure within genealogical trees. Its main new feature is the joint use of all available information from several data sets instead of single data…
▽ More
A rigorous methodology is proposed to study cell division data consisting in several observed genealogical trees of possibly different shapes. The procedure takes into account missing observations, data from different trees, as well as the dependence structure within genealogical trees. Its main new feature is the joint use of all available information from several data sets instead of single data set estimation, to avoid the drawbacks of low accuracy for estimators or low power for tests on small single-trees. The data is modeled by an asymmetric bifurcating autoregressive process and possibly missing observations are taken into account by modeling the genealogies with a two-type Galton-Watson process. Least-squares estimators of the unknown parameters of the processes are given and symmetry tests are derived. Results are applied on real data of Escherichia coli division and an empirical study of the convergence rates of the estimators and power of the tests is conducted on simulated data.
△ Less
Submitted 12 April, 2013; v1 submitted 22 May, 2012;
originally announced May 2012.