-
Limit theorems for the site frequency spectrum of neutral mutations in an exponentially growing population
Authors:
Einar Bjarki Gunnarsson,
Kevin Leder,
Xuanming Zhang
Abstract:
The site frequency spectrum (SFS) is a widely used summary statistic of genomic data. Motivated by recent evidence for the role of neutral evolution in cancer, we investigate the SFS of neutral mutations in an exponentially growing population. Using branching process techniques, we establish (first-order) almost sure convergence results for the SFS of a Galton-Watson process, evaluated either at a…
▽ More
The site frequency spectrum (SFS) is a widely used summary statistic of genomic data. Motivated by recent evidence for the role of neutral evolution in cancer, we investigate the SFS of neutral mutations in an exponentially growing population. Using branching process techniques, we establish (first-order) almost sure convergence results for the SFS of a Galton-Watson process, evaluated either at a fixed time or at the stochastic time at which the population first reaches a certain size. We finally use our results to construct consistent estimators for the extinction probability and the effective mutation rate of a birth-death process.
△ Less
Submitted 12 March, 2024; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Dynamics of advantageous mutant spread in spatial death-birth and birth-death Moran models
Authors:
Jasmine Foo,
Einar Bjarki Gunnarsson,
Kevin Leder,
David Sivakoff
Abstract:
The spread of an advantageous mutation through a population is of fundamental interest in population genetics. While the classical Moran model is formulated for a well-mixed population, it has long been recognized that in real-world applications, the population usually has an explicit spatial structure which can significantly influence the dynamics. In the context of cancer initiation in epithelia…
▽ More
The spread of an advantageous mutation through a population is of fundamental interest in population genetics. While the classical Moran model is formulated for a well-mixed population, it has long been recognized that in real-world applications, the population usually has an explicit spatial structure which can significantly influence the dynamics. In the context of cancer initiation in epithelial tissue, several recent works have analyzed the dynamics of advantageous mutant spread on integer lattices, using the biased voter model from particle systems theory. In this spatial version of the Moran model, individuals first reproduce according to their fitness and then replace a neighboring individual. From a biological standpoint, the opposite dynamics, where individuals first die and are then replaced by a neighboring individual according to its fitness, are equally relevant. Here, we investigate this death-birth analogue of the biased voter model. We construct the process mathematically, derive the associated dual process, establish bounds on the survival probability of a single mutant, and prove that the process has an asymptotic shape. We also briefly discuss alternative birth-death and death-birth dynamics, depending on how the mutant fitness advantage affects the dynamics. We show that birth-death and death-birth formulations of the biased voter model are equivalent when fitness affects the former event of each update of the model, whereas the birth-death model is fundamentally different from the death-birth model when fitness affects the latter event.
△ Less
Submitted 23 September, 2022;
originally announced September 2022.
-
Exact site frequency spectra of neutrally evolving tumors: a transition between power laws reveals a signature of cell viability
Authors:
Einar Bjarki Gunnarsson,
Kevin Leder,
Jasmine Foo
Abstract:
The site frequency spectrum (SFS) is a popular summary statistic of genomic data. While the SFS of a constant-sized population undergoing neutral mutations has been extensively studied in population genetics, the rapidly growing amount of cancer genomic data has attracted interest in the spectrum of an exponentially growing population. Recent theoretical results have generally dealt with special o…
▽ More
The site frequency spectrum (SFS) is a popular summary statistic of genomic data. While the SFS of a constant-sized population undergoing neutral mutations has been extensively studied in population genetics, the rapidly growing amount of cancer genomic data has attracted interest in the spectrum of an exponentially growing population. Recent theoretical results have generally dealt with special or limiting cases, such as considering only cells with an infinite line of descent, assuming deterministic tumor growth, or taking large-time or large-population limits. In this work, we derive exact expressions for the expected SFS of a cell population that evolves according to a stochastic branching process, first for cells with an infinite line of descent and then for the total population, evaluated either at a fixed time (fixed-time spectrum) or at the stochastic time at which the population reaches a certain size (fixed-size spectrum). We find that while the rate of mutation scales the SFS of the total population linearly, the rates of cell birth and cell death change the shape of the spectrum at the small-frequency end, inducing a transition between a $1/j^2$ power-law spectrum and a $1/j$ spectrum as cell viability decreases. We show that this insight can in principle be used to estimate the ratio between the rate of cell death and cell birth, as well as the mutation rate, using the site frequency spectrum alone. Although the discussion is framed in terms of tumor dynamics, our results apply to any exponentially growing population of individuals undergoing neutral mutations.
△ Less
Submitted 11 September, 2021; v1 submitted 23 February, 2021;
originally announced February 2021.
-
Spread of premalignant mutant clones and cancer initiation in multilayered tissue
Authors:
Jasmine Foo,
Einar Bjarki Gunnarsson,
Kevin Leder,
Kathleen Storey
Abstract:
Over 80% of human cancers originate from the epithelium, which covers the outer and inner surfaces of organs and blood vessels. In stratified epithelium, the bottom layers are occupied by stem and stem-like cells that continually divide and replenish the upper layers. In this work, we study the spread of premalignant mutant clones and cancer initiation in stratified epithelium using the biased vot…
▽ More
Over 80% of human cancers originate from the epithelium, which covers the outer and inner surfaces of organs and blood vessels. In stratified epithelium, the bottom layers are occupied by stem and stem-like cells that continually divide and replenish the upper layers. In this work, we study the spread of premalignant mutant clones and cancer initiation in stratified epithelium using the biased voter model on stacked two-dimensional lattices. Our main result is an estimate of the propagation speed of a premalignant mutant clone, which is asymptotically precise in the cancer-relevant weak-selection limit. We use our main result to study cancer initiation under a two-step mutational model of cancer, which includes computing the distributions of the time of cancer initiation and the size of the premalignant clone giving rise to cancer. Our work quantifies the effect of epithelial tissue thickness on the process of carcinogenesis, thereby contributing to an emerging understanding of the spatial evolutionary dynamics of cancer.
△ Less
Submitted 26 March, 2022; v1 submitted 7 July, 2020;
originally announced July 2020.