Skip to main content

Showing 1–13 of 13 results for author: Baele, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.06042  [pdf, other

    q-bio.PE stat.ME

    Infinite Mixture Models for Improved Modeling of Across-Site Evolutionary Variation

    Authors: Mandev S. Gill, Guy Baele, Marc A. Suchard, Philippe Lemey

    Abstract: Scientific studies in many areas of biology routinely employ evolutionary analyses based on the probabilistic inference of phylogenetic trees from molecular sequence data. Evolutionary processes that act at the molecular level are highly variable, and properly accounting for heterogeneity in evolutionary processes is crucial for more accurate phylogenetic inference. Nucleotide substitution rates a… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

  2. arXiv:2303.13642  [pdf, other

    q-bio.PE stat.CO

    Random-effects substitution models for phylogenetics via scalable gradient approximations

    Authors: Andrew F. Magee, Andrew J. Holbrook, Jonathan E. Pekar, Itzue W. Caviedes-Solis, Fredrick A. Matsen IV, Guy Baele, Joel O. Wertheim, Xiang Ji, Philippe Lemey, Marc A. Suchard

    Abstract: Phylogenetic and discrete-trait evolutionary inference depend heavily on an appropriate characterization of the underlying character substitution process. In this paper, we present random-effects substitution models that extend common continuous-time Markov chain models into a richer class of processes capable of capturing a wider variety of substitution dynamics. As these random-effects substitut… ▽ More

    Submitted 25 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

  3. arXiv:2303.04390  [pdf, other

    stat.CO q-bio.PE

    Many-core algorithms for high-dimensional gradients on phylogenetic trees

    Authors: Karthik Gangavarapu, Xiang Ji, Guy Baele, Mathieu Fourment, Philippe Lemey, Frederick A. Matsen IV, Marc A. Suchard

    Abstract: The rapid growth in genomic pathogen data spurs the need for efficient inference techniques, such as Hamiltonian Monte Carlo (HMC) in a Bayesian framework, to estimate parameters of these phylogenetic models where the dimensions of the parameters increase with the number of sequences $N$. HMC requires repeated calculation of the gradient of the data log-likelihood with respect to (wrt) all branch-… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  4. arXiv:2110.13298  [pdf, other

    q-bio.PE stat.CO

    Scalable Bayesian divergence time estimation with ratio transformations

    Authors: Xiang Ji, Alexander A. Fisher, Shuo Su, Jeffrey L. Thorne, Barney Potter, Philippe Lemey, Guy Baele, Marc A. Suchard

    Abstract: Divergence time estimation is crucial to provide temporal signals for dating biologically important events, from species divergence to viral transmissions in space and time. With the advent of high-throughput sequencing, recent Bayesian phylogenetic studies have analyzed hundreds to thousands of sequences. Such large-scale analyses challenge divergence time reconstruction by requiring inference on… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 34 pages, 6 figures

  5. arXiv:2107.01246  [pdf, other

    q-bio.PE stat.AP stat.ME

    Principled, practical, flexible, fast: a new approach to phylogenetic factor analysis

    Authors: Gabriel W. Hassler, Brigida Gallone, Leandro Aristide, William L. Allen, Max R. Tolkoff, Andrew J. Holbrook, Guy Baele, Philippe Lemey, Marc A. Suchard

    Abstract: Biological phenotypes are products of complex evolutionary processes in which selective forces influence multiple biological trait measurements in unknown ways. Phylogenetic factor analysis disentangles these relationships across the evolutionary history of a group of organisms. Scientists seeking to employ this modeling framework confront numerous modeling and implementation decisions, the detail… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 27 pages, 7 figures, 1 table

  6. arXiv:2003.10336  [pdf, other

    stat.AP q-bio.PE

    Efficient Bayesian Inference of General Gaussian Models on Large Phylogenetic Trees

    Authors: Paul Bastide, Lam Si Tung Ho, Guy Baele, Philippe Lemey, Marc A Suchard

    Abstract: Phylogenetic comparative methods correct for shared evolutionary history among a set of non-independent organisms by modeling sample traits as arising from a diffusion process along on the branches of a possibly unknown history. To incorporate such uncertainty, we present a scalable Bayesian inference framework under a general Gaussian trait evolution model that exploits Hamiltonian Monte Carlo (H… ▽ More

    Submitted 29 September, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

  7. arXiv:2002.00245  [pdf, other

    q-bio.PE stat.ME

    Online Bayesian phylodynamic inference in BEAST with application to epidemic reconstruction

    Authors: Mandev S. Gill, Philippe Lemey, Marc A. Suchard, Andrew Rambaut, Guy Baele

    Abstract: Reconstructing pathogen dynamics from genetic data as they become available during an outbreak or epidemic represents an important statistical scenario in which observations arrive sequentially in time and one is interested in performing inference in an 'online' fashion. Widely-used Bayesian phylogenetic inference packages are not set up for this purpose, generally requiring one to recompute trees… ▽ More

    Submitted 1 February, 2020; originally announced February 2020.

    Comments: 20 pages, 3 figures

  8. arXiv:1906.05136  [pdf, other

    q-bio.PE stat.ME

    Markov-modulated continuous-time Markov chains to identify site- and branch-specific evolutionary variation

    Authors: Guy Baele, Mandev S. Gill, Philippe Lemey, Marc A. Suchard

    Abstract: Markov models of character substitution on phylogenies form the foundation of phylogenetic inference frameworks. Early models made the simplifying assumption that the substitution process is homogeneous over time and across sites in the molecular sequence alignment. While standard practice adopts extensions that accommodate heterogeneity of substitution rates across sites, heterogeneity in the pro… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: 30 pages, 8 figures

  9. arXiv:1905.12146  [pdf, other

    stat.CO q-bio.PE stat.ME

    Gradients do grow on trees: a linear-time ${\cal O}\hspace{-0.2em}\left( N \right)$-dimensional gradient for statistical phylogenetics

    Authors: Xiang Ji, Zhenyu Zhang, Andrew Holbrook, Akihiko Nishimura, Guy Baele, Andrew Rambaut, Philippe Lemey, Marc A. Suchard

    Abstract: Calculation of the log-likelihood stands as the computational bottleneck for many statistical phylogenetic algorithms. Even worse is its gradient evaluation, often used to target regions of high probability. Order ${\cal O}\hspace{-0.2em}\left( N \right)$-dimensional gradient calculations based on the standard pruning algorithm require ${\cal O}\hspace{-0.2em}\left( N^2 \right)$ operations where N… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  10. arXiv:1905.04582  [pdf, other

    stat.CO

    Massive parallelization boosts big Bayesian multidimensional scaling

    Authors: Andrew Holbrook, Philippe Lemey, Guy Baele, Simon Dellicour, Dirk Brockmann, Andrew Rambaut, Marc Suchard

    Abstract: Big Bayes is the computationally intensive co-application of big data and large, expressive Bayesian models for the analysis of complex phenomena in scientific inference and statistical learning. Standing as an example, Bayesian multidimensional scaling (MDS) can help scientists learn viral trajectories through space-time, but its computational burden prevents its wider use. Crucial MDS model calc… ▽ More

    Submitted 10 December, 2019; v1 submitted 11 May, 2019; originally announced May 2019.

  11. arXiv:1701.07496  [pdf, other

    stat.ME stat.AP stat.CO

    Phylogenetic Factor Analysis

    Authors: Max R. Tolkoff, Michael L. Alfaro, Guy Baele, Philippe Lemey, Marc A. Suchard

    Abstract: Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting… ▽ More

    Submitted 25 January, 2017; originally announced January 2017.

    Comments: 51 pages (42 main, 9 supplemental), 9 figures (5 main, 4 supplemental), 4 tables (2 main, 2 supplemental), submitted to Systematic Biology

  12. arXiv:1512.07948  [pdf, other

    q-bio.PE stat.ME

    A Relaxed Drift Diffusion Model for Phylogenetic Trait Evolution

    Authors: Mandev S. Gill, Lam Si Tung Ho, Guy Baele, Philippe Lemey, Marc A. Suchard

    Abstract: Understanding the processes that give rise to quantitative measurements associated with molecular sequence data remains an important issue in statistical phylogenetics. Examples of such measurements include geographic coordinates in the context of phylogeography and phenotypic traits in the context of comparative studies. A popular approach is to model the evolution of continuously varying traits… ▽ More

    Submitted 29 December, 2015; v1 submitted 24 December, 2015; originally announced December 2015.

    Comments: 35 pages, 3 figures, 5 tables. Changed from double-spaced to single-spaced

  13. arXiv:1309.3075  [pdf, other

    q-bio.PE stat.CO

    Inferring Heterogeneous Evolutionary Processes Through Time: from sequence substitution to phylogeography

    Authors: Filip Bielejec, Philippe Lemey, Guy Baele, Andrew Rambaut, Marc A Suchard

    Abstract: Molecular phylogenetic and phylogeographic reconstructions generally assume time-homogeneous substitution processes. Motivated by computational convenience, this assumption sacrifices biological realism and offers little opportunity to uncover the temporal dynamics in evolutionary histories. Here, we extend and generalize an evolutionary approach that relaxes the time-homogeneous process assumptio… ▽ More

    Submitted 12 September, 2013; originally announced September 2013.

    Comments: 30 pages, 6 figure, 3 tables