-
Stochastic dynamics at the back of a gene drive propagation wave
Authors:
Léna Kläy,
Léo Girardin,
Florence Débarre,
Vincent Calvez
Abstract:
Gene drive alleles bias their own inheritance to offspring. They can fix in a wild-type population in spite of a fitness cost, and even lead to the eradication of the target population if the fitness cost is high. However, this outcome may be prevented or delayed if areas previously cleared by the drive are recolonised by wild-type individuals. Here, we investigate the conditions under which these…
▽ More
Gene drive alleles bias their own inheritance to offspring. They can fix in a wild-type population in spite of a fitness cost, and even lead to the eradication of the target population if the fitness cost is high. However, this outcome may be prevented or delayed if areas previously cleared by the drive are recolonised by wild-type individuals. Here, we investigate the conditions under which these stochastic wild-type recolonisation events are likely and when they are unlikely to occur in one spatial dimension. More precisely, we examine the conditions ensuring that the last individual carrying a wild-type allele is surrounded by a large enough number of drive homozygous individuals, resulting in a very low chance of wild-type recolonisation. To do so, we make a deterministic approximation of the distribution of drive alleles within the wave, and we split the distribution of wild-type alleles into a deterministic part and a stochastic part. Our analytical and numerical results suggest that the probability of wild-type recolonisation events increases with lower fitness of drive individuals, with smaller migration rate, and also with smaller local carrying capacity. Numerical simulations show that these results extend to two spatial dimensions. We also demonstrate that, if a wild-type recolonisation event were to occur, the probability of a following drive reinvasion event decreases with smaller values of the intrinsic growth rate of the population. Overall, our study paves the way for further analysis of wild-type recolonisation at the back of eradication traveling waves.
△ Less
Submitted 28 February, 2025;
originally announced February 2025.
-
No evidence of systematic proximity ascertainment bias in early COVID-19 cases in Wuhan Reply to Weissman (2024)
Authors:
Florence Débarre,
Michael Worobey
Abstract:
In a short text published as Letter to the Editor of the Journal of the Royal Statistical Society Series A, Weissman (2024) argues that the finding that early COVID-19 cases without an ascertained link to Wuhan's Huanan Seafood Wholesale market resided on average closer to the market than cases epidemiologically linked to it, reveals "major proximity ascertainment bias". Here we show that Weissman…
▽ More
In a short text published as Letter to the Editor of the Journal of the Royal Statistical Society Series A, Weissman (2024) argues that the finding that early COVID-19 cases without an ascertained link to Wuhan's Huanan Seafood Wholesale market resided on average closer to the market than cases epidemiologically linked to it, reveals "major proximity ascertainment bias". Here we show that Weissman's conclusion is based on a flawed premise, and that there is no such "internal evidence" of major bias. The pattern can indeed be explained by places of infection not being limited to residential neighbourhoods, and by stochasticity -- i.e., without requiring any ascertainment bias.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Confirmation of the centrality of the Huanan market among early COVID-19 cases
Authors:
Florence Débarre,
Michael Worobey
Abstract:
The centrality of Wuhan's Huanan market in maps of December 2019 COVID-19 case residential locations, established by Worobey et al. (2022a), has recently been challenged by Stoyan and Chiu (2024, SC2024). SC2024 proposed a statistical test based on the premise that the measure of central tendency (hereafter, "centre") of a sample of case locations must coincide with the exact point from which loca…
▽ More
The centrality of Wuhan's Huanan market in maps of December 2019 COVID-19 case residential locations, established by Worobey et al. (2022a), has recently been challenged by Stoyan and Chiu (2024, SC2024). SC2024 proposed a statistical test based on the premise that the measure of central tendency (hereafter, "centre") of a sample of case locations must coincide with the exact point from which local transmission began. Here we show that this premise is erroneous. SC2024 put forward two alternative centres (centroid and mode) to the centre-point which was used by Worobey et al. for some analyses, and proposed a bootstrapping method, based on their premise, to test whether a particular location is consistent with it being the point source of transmission. We show that SC2024's concerns about the use of centre-points are inconsequential, and that use of centroids for these data is inadvisable. The mode is an appropriate, even optimal, choice as centre; however, contrary to SC2024's results, we demonstrate that with proper implementation of their methods, the mode falls at the entrance of a parking lot at the market itself, and the 95% confidence region around the mode includes the market. Thus, the market cannot be rejected as central even by SC2024's overly stringent statistical test. Our results directly contradict SC2024's and -- together with myriad additional lines of evidence overlooked by SC2024, including crucial epidemiological information -- point to the Huanan market as the early epicentre of the COVID-19 pandemic.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Pulled, pushed or failed: the demographic impact of a gene drive can change the nature of its spatial spread
Authors:
Léna Kläy,
Léo Girardin,
Vincent Calvez,
Florence Débarre
Abstract:
Understanding the temporal spread of gene drive alleles -- alleles that bias their own transmission -- through modeling is essential before any field experiments. In this paper, we present a deterministic reaction-diffusion model describing the interplay between demographic and allelic dynamics, in a one-dimensional spatial context. We focused on the traveling wave solutions, and more specifically…
▽ More
Understanding the temporal spread of gene drive alleles -- alleles that bias their own transmission -- through modeling is essential before any field experiments. In this paper, we present a deterministic reaction-diffusion model describing the interplay between demographic and allelic dynamics, in a one-dimensional spatial context. We focused on the traveling wave solutions, and more specifically, on the speed of gene drive invasion (if successful). We considered various timings of gene conversion (in the zygote or in the germline) and different probabilities of gene conversion (instead of assuming 100$\%$ conversion as done in a previous work). We compared the types of propagation when the intrinsic growth rate of the population takes extreme values, either very large or very low. When it is infinitely large, the wave can be either successful or not, and, if successful, it can be either pulled or pushed, in agreement with previous studies (extended here to the case of partial conversion). In contrast, it cannot be pushed when the intrinsic growth rate is vanishing. In this case, analytical results are obtained through an insightful connection with an epidemiological SI model. We conducted extensive numerical simulations to bridge the gap between the two regimes of large and low growth rate. We conjecture that, if it is pulled in the two extreme regimes, then the wave is always pulled, and the wave speed is independent of the growth rate. This occurs for instance when the fitness cost is small enough, or when there is stable coexistence of the drive and the wild-type in the population after successful drive invasion. Our model helps delineate the conditions under which demographic dynamics can affect the spread of a gene drive.
△ Less
Submitted 31 October, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
Demographic feedbacks can hamper the spatial spread of a gene drive
Authors:
Léo Girardin,
Florence Débarre
Abstract:
This paper is concerned with a reaction--diffusion system modeling the fixation and the invasion in a population of a gene drive (an allele biasing inheritance, increasing its own transmission to offspring). In our model, the gene drive has a negative effect on the fitness of individuals carrying it, and is therefore susceptible of decreasing the total carrying capacity of the population locally i…
▽ More
This paper is concerned with a reaction--diffusion system modeling the fixation and the invasion in a population of a gene drive (an allele biasing inheritance, increasing its own transmission to offspring). In our model, the gene drive has a negative effect on the fitness of individuals carrying it, and is therefore susceptible of decreasing the total carrying capacity of the population locally in space. This tends to generate an opposing demographic advection that the gene drive has to overcome in order to invade. While previous reaction--diffusion models neglected this aspect, here we focus on it and try to predict the sign of the traveling wave speed. It turns out to be an analytical challenge, only partial results being within reach, and we complete our theoretical analysis by numerical simulations. Our results indicate that taking into account the interplay between population dynamics and population genetics might actually be crucial, as it can effectively reverse the direction of the invasion and lead to failure. Our findings can be extended to other bistable systems, such as the spread of cytoplasmic incompatibilities caused by Wolbachia.
△ Less
Submitted 30 September, 2021; v1 submitted 27 January, 2021;
originally announced January 2021.
-
The stochastic dynamics of early epidemics: probability of establishment, initial growth rate, and infection cluster size at first detection
Authors:
Peter Czuppon,
Emmanuel Schertzer,
François Blanquart,
Florence Débarre
Abstract:
Emerging epidemics and local infection clusters are initially prone to stochastic effects that can substantially impact the epidemic trajectory. While numerous studies are devoted to the deterministic regime of an established epidemic, mathematical descriptions of the initial phase of epidemic growth are comparatively rarer. Here, we review existing mathematical results on the epidemic size over t…
▽ More
Emerging epidemics and local infection clusters are initially prone to stochastic effects that can substantially impact the epidemic trajectory. While numerous studies are devoted to the deterministic regime of an established epidemic, mathematical descriptions of the initial phase of epidemic growth are comparatively rarer. Here, we review existing mathematical results on the epidemic size over time, and derive new results to elucidate the early dynamics of an infection cluster started by a single infected individual. We show that the initial growth of epidemics that eventually take off is accelerated by stochasticity. These results are critical to improve early cluster detection and control. As an application, we compute the distribution of the first detection time of an infected individual in an infection cluster depending on the testing effort, and estimate that the SARS-CoV-2 variant of concern Alpha detected in September 2020 first appeared in the United Kingdom early August 2020. We also compute a minimal testing frequency to detect clusters before they exceed a given threshold size. These results improve our theoretical understanding of early epidemics and will be useful for the study and control of local infectious disease clusters.
△ Less
Submitted 17 September, 2021; v1 submitted 17 November, 2020;
originally announced November 2020.
-
The split-and-drift random graph, a null model for speciation
Authors:
François Bienvenu,
Florence Débarre,
Amaury Lambert
Abstract:
We introduce a new random graph model motivated by biological questions relating to speciation. This random graph is defined as the stationary distribution of a Markov chain on the space of graphs on $\{1, \ldots, n\}$. The dynamics of this Markov chain is governed by two types of events: vertex duplication, where at constant rate a pair of vertices is sampled uniformly and one of these vertices l…
▽ More
We introduce a new random graph model motivated by biological questions relating to speciation. This random graph is defined as the stationary distribution of a Markov chain on the space of graphs on $\{1, \ldots, n\}$. The dynamics of this Markov chain is governed by two types of events: vertex duplication, where at constant rate a pair of vertices is sampled uniformly and one of these vertices loses its incident edges and is rewired to the other vertex and its neighbors; and edge removal, where each edge disappears at constant rate. Besides the number of vertices $n$, the model has a single parameter $r_n$.
Using a coalescent approach, we obtain explicit formulas for the first moments of several graph invariants such as the number of edges or the number of complete subgraphs of order $k$. These are then used to identify five non-trivial regimes depending on the asymptotics of the parameter $r_n$. We derive an explicit expression for the degree distribution, and show that under appropriate rescaling it converges to classical distributions when the number of vertices goes to infinity. Finally, we give asymptotic bounds for the number of connected components, and show that in the sparse regime the number of edges is Poissonian.
△ Less
Submitted 17 March, 2018; v1 submitted 3 June, 2017;
originally announced June 2017.
-
The availability of research data declines rapidly with article age
Authors:
Timothy Vines,
Arianne Albert,
Rose Andrew,
Florence Debarré,
Dan Bock,
Michelle Franklin,
Kimberley Gilbert,
Jean-Sébastien Moore,
Sébastien Renaut,
Diana J. Rennison
Abstract:
Policies ensuring that research data are available on public archives are increasingly being implemented at the government [1], funding agency [2-4], and journal [5,6] level. These policies are predicated on the idea that authors are poor stewards of their data, particularly over the long term [7], and indeed many studies have found that authors are often unable or unwilling to share their data [8…
▽ More
Policies ensuring that research data are available on public archives are increasingly being implemented at the government [1], funding agency [2-4], and journal [5,6] level. These policies are predicated on the idea that authors are poor stewards of their data, particularly over the long term [7], and indeed many studies have found that authors are often unable or unwilling to share their data [8-11]. However, there are no systematic estimates of how the availability of research data changes with time since publication. We therefore requested datasets from a relatively homogenous set of 516 articles published between 2 and 22 years ago, and found that availability of the data was strongly affected by article age. For papers where the authors gave the status of their data, the odds of a dataset being extant fell by 17% per year. In addition, the odds that we could find a working email address for the first, last or corresponding author fell by 7% per year. Our results reinforce the notion that, in the long term, research data cannot be reliably preserved by individual researchers, and further demonstrate the urgent need for policies mandating data sharing via public archives.
△ Less
Submitted 19 December, 2013;
originally announced December 2013.