-
Large-sample analysis of cost functionals for inference under the coalescent
Authors:
Martina Favero,
Jere Koskela
Abstract:
The coalescent is a foundational model of latent genealogical trees under neutral evolution, but suffers from intractable sampling probabilities. Methods for approximating these sampling probabilities either introduce bias or fail to scale to large sample sizes. We show that a class of cost functionals of the coalescent with recurrent mutation and a finite number of alleles converge to tractable p…
▽ More
The coalescent is a foundational model of latent genealogical trees under neutral evolution, but suffers from intractable sampling probabilities. Methods for approximating these sampling probabilities either introduce bias or fail to scale to large sample sizes. We show that a class of cost functionals of the coalescent with recurrent mutation and a finite number of alleles converge to tractable processes in the infinite-sample limit. A particular choice of costs yields insight about importance sampling methods, which are a classical tool for coalescent sampling probability approximation. These insights reveal that the behaviour of coalescent importance sampling algorithms differs markedly from standard sequential importance samplers, with or without resampling. We conduct a simulation study to verify that our asymptotics are accurate for algorithms with finite (and moderate) sample sizes. Our results also facilitate the a priori optimisation of computational resource allocation for coalescent sequential importance sampling. We do not observe the same behaviour for importance sampling methods under the infinite sites model of mutation, which is regarded as a good and more tractable approximation of finite alleles mutation in most respects.
△ Less
Submitted 8 December, 2024;
originally announced December 2024.
-
Sideward contact tracing in an epidemic model with mixing groups
Authors:
Dongni Zhang,
Martina Favero
Abstract:
We consider a stochastic epidemic model with sideward contact tracing. We assume that infection is driven by interactions within mixing events (gatherings of two or more individuals). Once an infective is diagnosed, each individual who was infected at the same event as the diagnosed individual is contact traced with some given probability. Assuming few initial infectives in a large population, the…
▽ More
We consider a stochastic epidemic model with sideward contact tracing. We assume that infection is driven by interactions within mixing events (gatherings of two or more individuals). Once an infective is diagnosed, each individual who was infected at the same event as the diagnosed individual is contact traced with some given probability. Assuming few initial infectives in a large population, the early phase of the epidemic is approximated by a branching process with sibling dependencies. To address the challenges given by the dependencies, we consider sibling groups (individuals who become infected at the same event) as macro-individuals and define a macro-branching process. This allows us to derive an expression for the effective macro-reproduction number which corresponds to the effective individual reproduction number and represents a threshold for the behaviour of the epidemic. Through numerical examples, we show how the reproduction number varies with the distribution of the mixing event size, the mean size, the rate of diagnosis and the tracing probability.
△ Less
Submitted 26 March, 2025; v1 submitted 16 July, 2024;
originally announced July 2024.
-
Sampling probabilities, diffusions, ancestral graphs, and duality under strong selection
Authors:
Martina Favero,
Paul A. Jenkins
Abstract:
Wright-Fisher diffusions and their dual ancestral graphs occupy a central role in the study of allele frequency change and genealogical structure, and they provide expressions, explicit in some special cases but generally implicit, for the sampling probability, a crucial quantity in inference. Under a finite-allele mutation model, with possibly parent-dependent mutation, we consider the asymptotic…
▽ More
Wright-Fisher diffusions and their dual ancestral graphs occupy a central role in the study of allele frequency change and genealogical structure, and they provide expressions, explicit in some special cases but generally implicit, for the sampling probability, a crucial quantity in inference. Under a finite-allele mutation model, with possibly parent-dependent mutation, we consider the asymptotic regime where the selective advantage of one allele grows to infinity, while the other parameters remain fixed. In this regime, we show that the Wright-Fisher diffusion can be approximated either by a Gaussian process or by a process whose components are independent continuous-state branching processes with immigration, aligning with analogous results for Wright-Fisher models but employing different methods. While the first process becomes degenerate at stationarity, the latter does not and provides a simple, analytic approximation for the leading term of the sampling probability. Furthermore, using another approach based on a recursion formula, we characterise all remaining terms to provide a full asymptotic expansion for the sampling probability. Finally, we study the asymptotic behaviour of the rates of the block-counting process of the conditional ancestral selection graph and establish an asymptotic duality relationship between this and the diffusion.
△ Less
Submitted 13 March, 2025; v1 submitted 28 December, 2023;
originally announced December 2023.
-
Modelling preventive measures and their effect on generation times in emerging epidemics
Authors:
Martina Favero,
Gianpaolo Scalia Tomba,
Tom Britton
Abstract:
We present a stochastic epidemic model to study the effect of various preventive measures, such as uniform reduction of contacts and transmission, vaccination, isolation, screening and contact tracing, on a disease outbreak in a homogeneously mixing community. The model is based on an infectivity process, which we define through stochastic contact and infectiousness processes, so that each individ…
▽ More
We present a stochastic epidemic model to study the effect of various preventive measures, such as uniform reduction of contacts and transmission, vaccination, isolation, screening and contact tracing, on a disease outbreak in a homogeneously mixing community. The model is based on an infectivity process, which we define through stochastic contact and infectiousness processes, so that each individual has an independent infectivity profile. In particular, we monitor variations of the reproduction number and of the distribution of generation times. We show that some interventions, i.e. uniform reduction and vaccination, affect the former while leaving the latter unchanged, whereas other interventions, i.e. isolation, screening and contact tracing, affect both quantities. We provide a theoretical analysis of the variation of these quantities, and we show that, in practice, the variation of the generation time distribution can be significant and that it can cause biases in the estimation of reproduction numbers. The framework, because of its general nature, captures the properties of many infectious diseases, but particular emphasis is on COVID-19, for which numerical results are provided.
△ Less
Submitted 7 July, 2022; v1 submitted 24 January, 2022;
originally announced January 2022.
-
Estimates of the proportion of SARS-CoV-2 infected individuals in Sweden
Authors:
Henrik Hult,
Martina Favero
Abstract:
In this paper a Bayesian SEIR model is studied to estimate the proportion of the population infected with SARS-CoV-2, the virus responsible for COVID-19. To capture heterogeneity in the population and the effect of interventions to reduce the rate of epidemic spread, the model uses a time-varying contact rate, whose logarithm has a Gaussian process prior. A Poisson point process is used to model t…
▽ More
In this paper a Bayesian SEIR model is studied to estimate the proportion of the population infected with SARS-CoV-2, the virus responsible for COVID-19. To capture heterogeneity in the population and the effect of interventions to reduce the rate of epidemic spread, the model uses a time-varying contact rate, whose logarithm has a Gaussian process prior. A Poisson point process is used to model the occurrence of deaths due to COVID-19 and the model is calibrated using data of daily death counts in combination with a snapshot of the the proportion of individuals with an active infection, performed in Stockholm in late March. The methodology is applied to regions in Sweden. The results show that the estimated proportion of the population who has been infected is around 13.5% in Stockholm, by 2020-05-15, and ranges between 2.5% - 15.6% in the other investigated regions. In Stockholm where the peak of daily death counts is likely behind us, parameter uncertainty does not heavily influence the expected daily number of deaths, nor the expected cumulative number of deaths. It does, however, impact the estimated cumulative number of infected individuals. In the other regions, where random sampling of the number of active infections is not available, parameter sharing is used to improve estimates, but the parameter uncertainty remains substantial.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
The oral tolerance as a complex network phenomenon
Authors:
Pedro J. Miranda,
Murilo Delgobo,
Giovanni M. Favero,
Kátia S. Paludo,
Murilo S. Baptista,
Sandro E. de S. Pinto
Abstract:
The phenomenon of oral tolerance refers to a local and systemic state of tolerance, induced in the gut associated lymphoid tissues, after its exposure to innocuous antigens, such as food proteins. While recent findings shed light in the cellular and molecular basis of oral tolerance, the network of interactions between the components mediating oral tolerance has not been investigated yet. Our work…
▽ More
The phenomenon of oral tolerance refers to a local and systemic state of tolerance, induced in the gut associated lymphoid tissues, after its exposure to innocuous antigens, such as food proteins. While recent findings shed light in the cellular and molecular basis of oral tolerance, the network of interactions between the components mediating oral tolerance has not been investigated yet. Our work brings a complex systems theory approach, aiming to identify the contribution of each element in an oral tolerance network. We also propose a model that allows dynamical plus topological quantifying which must encompass functional responses as the local host involved on the oral tolerance. To keep track of reality of our model, we test knockout (KO) of immunological components (i. e. silencing a vertex) and see how it diverges when the system is topologically health. The results from these simulated KO's are then compared to real molecular knock-outs. To infer from these processing we apply a new implementation of a random walk algorithm for directed graphs, which ultimately generate statistical quantities provided by the dynamical behaviour of the simulated KO's. It was observed that the a specifics KO caused the greatest impact on network standard flux. In a brief analysis, the results obtained correspond to biological data. Our model addresses both topological proprieties and dynamical relations. The construction of a qualitative dynamic model for oral tolerance could reflect empirical observations, through the standard flux results and relative error based on individual knockout.
△ Less
Submitted 13 November, 2013;
originally announced November 2013.