Skip to main content

Showing 1–16 of 16 results for author: Nye, T M W

.
  1. arXiv:2506.22135  [pdf, ps, other

    stat.ME

    Brownian motion, bridges and Bayesian inference in phylogenetic tree space

    Authors: William M. Woodman, Tom M. W. Nye

    Abstract: Billera-Holmes-Vogtmann (BHV) tree space is a geodesic metric space of edge-weighted phylogenetic trees with a fixed leaf set. Constructing parametric distributions on this space is challenging due to its non-Euclidean geometry and the intractability of normalizing constants. We address this by fitting Brownian motion transition kernels to tree-valued data via a non-Euclidean bridge construction.… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 21 pages plus appendix of 22 pages. 22 figures

    MSC Class: 60J65 (Primary) 92D15 (Secondary)

  2. arXiv:2407.03977  [pdf, other

    q-bio.PE math.ST

    Statistics for Phylogenetic Trees in the Presence of Stickiness

    Authors: Lars Lammers, Tom M. W. Nye, Stephan F. Huckemann

    Abstract: Samples of phylogenetic trees arise in a variety of evolutionary and biomedical applications, and the Fréchet mean in Billera-Holmes-Vogtmann tree space is a summary tree shown to have advantages over other mean or consensus trees. However, use of the Fréchet mean raises computational and statistical issues which we explore in this paper. The Fréchet sample mean is known often to contain fewer int… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 37 pages, 16 figures

  3. arXiv:2402.06410  [pdf, other

    stat.ME

    Manifold-valued models for analysis of EEG time series data

    Authors: Tao Ding, Tom M. W. Nye, Yujiang Wang

    Abstract: We propose a model for time series taking values on a Riemannian manifold and fit it to time series of covariance matrices derived from EEG data for patients suffering from epilepsy. The aim of the study is two-fold: to develop a model with interpretable parameters for different possible modes of EEG dynamics, and to explore the extent to which modelling results are affected by the choice of manif… ▽ More

    Submitted 12 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: 22 pages and 9 figures. Supplementary material is appended

    MSC Class: 62M10; 62R30

  4. arXiv:2304.05025  [pdf, other

    math.ST

    Types of Stickiness in BHV Phylogenetic Tree Spaces and Their Degree

    Authors: Lars Lammers, Do Tran Van, Tom M. W. Nye, Stephan F. Huckemann

    Abstract: It has been observed that the sample mean of certain probability distributions in Billera-Holmes-Vogtmann (BHV) phylogenetic spaces is confined to a lower-dimensional subspace for large enough sample size. This non-standard behavior has been called stickiness and poses difficulties in statistical applications when comparing samples of sticky distributions. We extend previous results on stickiness… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 8 Pages, 1 Figure, conference submission to GSI 2023

    MSC Class: 62F03

  5. arXiv:2209.05332  [pdf, other

    math.ST math.DG

    Foundations of the Wald Space for Phylogenetic Trees

    Authors: Jonas Lueg, Maryam K. Garba, Tom M. W. Nye, Stephan F. Huckemann

    Abstract: Evolutionary relationships between species are represented by phylogenetic trees, but these relationships are subject to uncertainty due to the random nature of evolution. A geometry for the space of phylogenetic trees is necessary in order to properly quantify this uncertainty during the statistical analysis of collections of possible evolutionary trees inferred from biological data. Recently, th… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 42 pages, 15 figures

    MSC Class: 30L05; 57N80; 53A35

  6. arXiv:2107.00502  [pdf, other

    stat.AP q-bio.QM stat.ME

    A sparse Bayesian hierarchical vector autoregressive model for microbial dynamics in a wastewater treatment plant

    Authors: Naomi E. Hannaford, Sarah E. Heaps, Tom M. W. Nye, Thomas P. Curtis, Ben Allen, Andrew Golightly, Darren J. Wilkinson

    Abstract: Proper function of a wastewater treatment plant (WWTP) relies on maintaining a delicate balance between a multitude of competing microorganisms. Gaining a detailed understanding of the complex network of interactions therein is essential to maximising not only current operational efficiencies, but also for the effective design of new treatment technologies. Metagenomics offers an insight into thes… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 23 pages, 6 figures

  7. arXiv:2007.08511  [pdf, other

    q-bio.PE stat.AP

    Incorporating compositional heterogeneity into Lie Markov models for phylogenetic inference

    Authors: Naomi E. Hannaford, Sarah E. Heaps, Tom M. W. Nye, Tom A. Williams, T. Martin Embley

    Abstract: Phylogenetics uses alignments of molecular sequence data to learn about evolutionary trees. Substitutions in sequences are modelled through a continuous-time Markov process, characterised by an instantaneous rate matrix, which standard models assume is time-reversible and stationary. These assumptions are biologically questionable and induce a likelihood function which is invariant to a tree's roo… ▽ More

    Submitted 17 July, 2020; v1 submitted 16 July, 2020; originally announced July 2020.

  8. arXiv:2003.13004  [pdf, other

    math.PR cs.IT math.DG q-bio.PE stat.ME

    Information geometry for phylogenetic trees

    Authors: Maryam K. Garba, Tom M. W. Nye, Jonas Lueg, Stephan F. Huckemann

    Abstract: We propose a new space of phylogenetic trees which we call wald space. The motivation is to develop a space suitable for statistical analysis of phylogenies, but with a geometry based on more biologically principled assumptions than existing spaces: in wald space, trees are close if they induce similar distributions on genetic sequence data. As a point set, wald space contains the previously devel… ▽ More

    Submitted 17 September, 2020; v1 submitted 29 March, 2020; originally announced March 2020.

    MSC Class: 92D15; 53A35; 94A17

  9. arXiv:1702.05972  [pdf, other

    stat.ME

    Generalising rate heterogeneity across sites in statistical phylogenetics

    Authors: Sarah E. Heaps, Tom M. W. Nye, Richard J. Boys, Tom A. Williams, Svetlana Cherlin, T. Martin Embley

    Abstract: Phylogenetics uses alignments of molecular sequence data to learn about evolutionary trees relating species. Along branches, sequence evolution is modelled using a continuous-time Markov process characterised by an instantaneous rate matrix. Early models assumed the same rate matrix governed substitutions at all sites of the alignment, ignoring variation in evolutionary pressures. Substantial impr… ▽ More

    Submitted 2 May, 2019; v1 submitted 20 February, 2017; originally announced February 2017.

    Comments: 13 figures, 1 accompanying file of supplementary material

  10. arXiv:1609.03045  [pdf, other

    stat.ME

    Principal component analysis and the locus of the Frechet mean in the space of phylogenetic trees

    Authors: Tom M. W. Nye, Xiaoxian Tang, Grady Weyenberg, Ruriko Yoshida

    Abstract: Most biological data are multidimensional, posing a major challenge to human comprehension and computational analysis. Principal component analysis is the most popular approach to rendering two- or three-dimensional representations of the major trends in such multidimensional data. The problem of multidimensionality is acute in the rapidly growing area of phylogenomics. Evolutionary relationships… ▽ More

    Submitted 10 September, 2016; originally announced September 2016.

    Comments: 26 pages, 5 figures

    MSC Class: 60D05 (Primary) 62H25; 92D15 (Secondary)

  11. arXiv:1508.02906  [pdf, other

    q-bio.PE math.PR

    Convergence of random walks to Brownian motion on cubical complexes

    Authors: Tom M. W. Nye

    Abstract: Cubical complexes are metric spaces constructed by gluing together unit cubes in an analogous way to the construction of simplicial complexes. We construct Brownian motion on such spaces, define random walks, and prove that the transition kernels of the random walks converge to that for Brownian motion. The proof involves pulling back onto the complex the distribution of Brownian sample paths on t… ▽ More

    Submitted 22 May, 2019; v1 submitted 12 August, 2015; originally announced August 2015.

    Comments: 14 pages, 2 figures. The results in the original submission have been changed substantially. In particular, the main theorem has been generalized to apply to a wide class of cubical complexes rather than Billera-Holmes-Vogtmann tree space alone. This simplifies some parts of the proof, although the main ideas are the same. Tree space is now dealt with as a special example in Section 5

    MSC Class: 92D15 (Primary); 60J65 (Secondary)

  12. arXiv:1505.08009  [pdf, other

    q-bio.PE

    The effect of non-reversibility on inferring rooted phylogenies

    Authors: S. Cherlin, T. M. W. Nye, S. E. Heaps, R. J. Boys, T. A. Williams, T. M. Embley

    Abstract: Most phylogenetic models assume that the evolutionary process is stationary and reversible. As a result, the root of the tree cannot be inferred as part of the analysis because the likelihood of the data does not depend on the position of the root. Yet defining the root of a phylogenetic tree is a key component of phylogenetic inference because it provides a point of reference for polarising ances… ▽ More

    Submitted 20 February, 2017; v1 submitted 29 May, 2015; originally announced May 2015.

    Comments: 8 figures, 6 tables

  13. An algorithm for constructing principal geodesics in phylogenetic treespace

    Authors: Tom M. W. Nye

    Abstract: Most phylogenetic analyses result in a sample of trees, but summarizing and visualizing these samples can be challenging. Consensus trees often provide limited information about a sample, and so methods such as consensus networks, clustering and multidimensional scaling have been developed and applied to tree samples. This paper describes a stochastic algorithm for constructing a principal geodesi… ▽ More

    Submitted 2 September, 2014; originally announced September 2014.

    Comments: 6 figures, IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 11, No. 2, 2014

  14. arXiv:1202.5132  [pdf, ps, other

    math.ST q-bio.PE

    Principal components analysis in the space of phylogenetic trees

    Authors: Tom M. W. Nye

    Abstract: Phylogenetic analysis of DNA or other data commonly gives rise to a collection or sample of inferred evolutionary trees. Principal Components Analysis (PCA) cannot be applied directly to collections of trees since the space of evolutionary trees on a fixed set of taxa is not a vector space. This paper describes a novel geometrical approach to PCA in tree-space that constructs the first principal p… ▽ More

    Submitted 23 February, 2012; originally announced February 2012.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS915 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS915

    Journal ref: Annals of Statistics 2011, Vol. 39, No. 5, 2716-2739

  15. arXiv:hep-th/0311215  [pdf, ps, other

    hep-th

    The Geometry of Calorons

    Authors: Tom M. W. Nye

    Abstract: Calorons (periodic instantons) are anti-self-dual (ASD) connections on S^1 \times R^3 and form an intermediate case between instantons and monopoles. The ADHM and Nahm constructions of instantons and monopoles can be regarded as generalizations of a correspondence between ASD connections on the 4-torus, often referred to as the Nahm transform. This thesis describes how the Nahm transform can be… ▽ More

    Submitted 22 November, 2003; originally announced November 2003.

    Comments: PhD Thesis, University of Edinburgh 2001, supervised by Michael Singer

  16. arXiv:math/0009144  [pdf, ps, other

    math.DG hep-th

    An L^2-Index Theorem for Dirac Operators on S^1 * R^3

    Authors: Tom M. W. Nye, Michael A. Singer

    Abstract: An expression is found for the $L^2$-index of a Dirac operator coupled to a connection on a $U_n$ vector bundle over $S^1\times{\mathbb R}^3$. Boundary conditions for the connection are given which ensure the coupled Dirac operator is Fredholm. Callias' index theorem is used to calculate the index when the connection is independent of the coordinate on $S^1$. An excision theorem due to Gromov, L… ▽ More

    Submitted 14 September, 2000; originally announced September 2000.

    Comments: 14 pages, Latex, to appear in the Journal of Functional Analysis