Skip to main content

Showing 1–17 of 17 results for author: Wood, A T A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.09853  [pdf, other

    stat.ME

    Principal Subsimplex Analysis

    Authors: Hyeon Lee, Kassel Liam Hingee, Janice L. Scealy, Andrew T. A. Wood, Eric Grunsky, J. S. Marron

    Abstract: Compositional data, also referred to as simplicial data, naturally arise in many scientific domains such as geochemistry, microbiology, and economics. In such domains, obtaining sensible lower-dimensional representations and modes of variation plays an important role. A typical approach to the problem is applying a log-ratio transformation followed by principal component analysis (PCA). However, t… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  2. arXiv:2504.04622  [pdf, ps, other

    stat.ME

    Regularization and Selection in A Directed Network Model with Nodal Homophily and Nodal Effects

    Authors: Zhaoyu Xing, Y. X. Rachel Wang, Andrew T. A. Wood, Tao Zou

    Abstract: This article introduces a regularization and selection methods for directed networks with nodal homophily and nodal effects. The proposed approach not only preserves the statistical efficiency of the resulting estimator, but also ensures that the selection of nodal homophily and nodal effects is scalable with large-scale network data and multiple nodal features. In particular, we propose a directe… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: 26 pages

    MSC Class: 62J07; 62F12; 05C82

  3. arXiv:2503.24003  [pdf, other

    stat.ME

    A Robust Extrinsic Single-index Model for Spherical Data

    Authors: Houren Hong, Janice L. Scealy, Andrew T. A. Wood, Yanrong Yang

    Abstract: Regression with a spherical response is challenging due to the absence of linear structure, making standard regression models inadequate. Existing methods, mainly parametric, lack the flexibility to capture the complex relationship induced by spherical curvature, while methods based on techniques from Riemannian geometry often suffer from computational difficulties. The non-Euclidean structure fur… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

  4. arXiv:2501.01657  [pdf, other

    stat.ME

    Change Point Detection for Random Objects with Possibly Periodic Behavior

    Authors: Jiazhen Xu, Andrew T. A. Wood, Tao Zou

    Abstract: Time-varying random objects have been increasingly encountered in modern data analysis. Moreover, in a substantial number of these applications, periodic behavior of the random objects has been observed. We introduce a new, powerful scan statistic and corresponding test for the precise identification and localization of abrupt changes in the distribution of non-Euclidean random objects with possib… ▽ More

    Submitted 3 January, 2025; originally announced January 2025.

    Comments: arXiv admin note: text overlap with arXiv:2311.16025 by other authors

  5. arXiv:2412.18818  [pdf, other

    math.ST stat.CO stat.ME

    Empirical likelihood for Fréchet means on open books

    Authors: Karthik Bharath, Huiling Le, Andrew T A Wood, Xi Yan

    Abstract: Empirical Likelihood (EL) is a type of nonparametric likelihood that is useful in many statistical inference problems, including confidence region construction and $k$-sample problems. It enjoys some remarkable theoretical properties, notably Bartlett correctability. One area where EL has potential but is under-developed is in non-Euclidean statistics where the Fréchet mean is the population chara… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

  6. arXiv:2312.07741  [pdf, other

    stat.ME math.ST stat.AP

    Robust Functional Principal Component Analysis for Non-Euclidean Random Objects

    Authors: Jiazhen Xu, Andrew T. A. Wood, Tao Zou

    Abstract: Functional data analysis offers a diverse toolkit of statistical methods tailored for analyzing samples of real-valued random functions. Recently, samples of time-varying random objects, such as time-varying networks, have been increasingly encountered in modern data analysis. These data structures represent elements within general metric spaces that lack local or global linear structures, renderi… ▽ More

    Submitted 6 March, 2025; v1 submitted 28 November, 2023; originally announced December 2023.

  7. arXiv:2305.07434  [pdf, other

    stat.CO math.PR

    A branch cut approach to the probability density and distribution functions of a linear combination of central and non-central Chi-square random variables

    Authors: Alfred Kume, Tomonari Sei, Andrew T. A. Wood

    Abstract: The paper considers the distribution of a general linear combination of central and non-central chi-square random variables by exploring the branch cut regions that appear in the standard Laplace inversion process. Due to the original interest from the directional statistics, the focus of this paper is on the density function of such distributions and not on their cumulative distribution function.… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  8. arXiv:2305.07349  [pdf, other

    stat.ME math.ST stat.AP stat.CO

    Robust score matching for compositional data

    Authors: Janice L. Scealy, Kassel L. Hingee, John T. Kent, Andrew T. A. Wood

    Abstract: The restricted polynomially-tilted pairwise interaction (RPPI) distribution gives a flexible model for compositional data. It is particularly well-suited to situations where some of the marginal distributions of the components of a composition are concentrated near zero, possibly with right skewness. This article develops a method of tractable robust estimation for the model by combining two ideas… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  9. arXiv:2303.08987  [pdf, other

    stat.ME math.ST stat.AP stat.CO

    Generalized Score Matching

    Authors: Jiazhen Xu, Janice L. Scealy, Andrew T. A. Wood, Tao Zou

    Abstract: Score matching is an estimation procedure that has been developed for statistical models whose probability density function is known up to proportionality but whose normalizing constant is intractable, so that maximum likelihood is difficult or impossible to implement. To date, applications of score matching have focused more on continuous IID models. Motivated by various data modelling problems,… ▽ More

    Submitted 21 April, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.09864

  10. arXiv:2211.03181  [pdf, other

    stat.ME

    Cauchy robust principal component analysis with applications to high-deimensional data sets

    Authors: Ayisha Fayomi, Yannis Pantazis, Michail Tsagris, Andrew T. A. Wood

    Abstract: Principal component analysis (PCA) is a standard dimensionality reduction technique used in various research and applied fields. From an algorithmic point of view, classical PCA can be formulated in terms of operations on a multivariate Gaussian likelihood. As a consequence of the implied Gaussian formulation, the principal components are not robust to outliers. In this paper, we propose a modifie… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  11. arXiv:2203.09864  [pdf, other

    math.ST stat.ME

    Generalized Score Matching for Regression

    Authors: Jiazhen Xu, Janice L. Scealy, Andrew T. A. Wood, Tao Zou

    Abstract: Many probabilistic models that have an intractable normalizing constant may be extended to contain covariates. Since the evaluation of the exact likelihood is difficult or even impossible for these models, score matching was proposed to avoid explicit computation of the normalizing constant. In the literature, score matching has so far only been developed for models in which the observations are i… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  12. arXiv:2012.12461  [pdf, other

    stat.ME stat.CO

    Score matching for compositional distributions

    Authors: Janice L. Scealy, Andrew T. A. Wood

    Abstract: Compositional data and multivariate count data with known totals are challenging to analyse due to the non-negativity and sum-to-one constraints on the sample space. It is often the case that many of the compositional components are highly right-skewed, with large numbers of zeros. A major limitation of currently available estimators for compositional models is that they either cannot handle many… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    MSC Class: 62H12

  13. Gaussian asymptotic limits for the $α$-transformation in the analysis of compositional data

    Authors: Yannis Pantazis, Michail Tsagris, Andrew T. A. Wood

    Abstract: Compositional data consists of vectors of proportions whose components sum to 1. Such vectors lie in the standard simplex, which is a manifold with boundary. One issue that has been rather controversial within the field of compositional data analysis is the choice of metric on the simplex. One popular possibility has been to use the metric implied by logtransforming the data, as proposed by Aitchi… ▽ More

    Submitted 21 February, 2019; v1 submitted 29 November, 2018; originally announced December 2018.

    Comments: This is a preprint of the original publication that is available at https://link.springer.com/article/10.1007/s13171-018-00160-1

    MSC Class: 62E20; 62H12

  14. arXiv:1711.02774  [pdf, ps, other

    stat.ME

    The extended power distribution: A new distribution on $(0, 1)$

    Authors: Chibueze E. Ogbonnaya, Simon P. Preston, Andrew T. A. Wood

    Abstract: We propose a two-parameter bounded probability distribution called the extended power distribution. This distribution on $(0, 1)$ is similar to the beta distribution, however there are some advantages which we explore. We define the moments and quantiles of this distribution and show that it is possible to give an $r$-parameter extension of this distribution ($r>2$). We also consider its complemen… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: 22 pages, 19 figures, 5 tables

    MSC Class: 62E15; 60E05

  15. arXiv:1607.07974  [pdf, ps, other

    stat.ME

    Nonparametric hypothesis testing for equality of means on the simplex

    Authors: Michail Tsagris, Simon Preston, Andrew T. A. Wood

    Abstract: In the context of data that lie on the simplex, we investigate use of empirical and exponential empirical likelihood, and Hotelling and James statistics, to test the null hypothesis of equal population means based on two independent samples. We perform an extensive numerical study using data simulated from various distributions on the simplex. The results, taken together with practical considerati… ▽ More

    Submitted 4 August, 2016; v1 submitted 27 July, 2016; originally announced July 2016.

    Comments: This is a preprint of the article to be published by Taylor & Francis Group in Journal of Statistical Computation and Simulation

  16. arXiv:1506.04976  [pdf, ps, other

    stat.ME

    Improved classification for compositional data using the $α$-transformation

    Authors: Michail Tsagris, Simon Preston, Andrew T. A. Wood

    Abstract: In compositional data analysis an observation is a vector containing non-negative values, only the relative sizes of which are considered to be of interest. Without loss of generality, a compositional vector can be taken to be a vector of proportions that sum to one. Data of this type arise in many areas including geology, archaeology, biology, economics and political science. In this paper we inv… ▽ More

    Submitted 17 June, 2015; v1 submitted 16 June, 2015; originally announced June 2015.

    Comments: This is a 17-page preprint and has been accepted for publication at the Journal of Classification

    MSC Class: 62H30

  17. arXiv:1106.1451  [pdf, ps, other

    stat.ME

    A data-based power transformation for compositional data

    Authors: Michail T. Tsagris, Simon Preston, Andrew T. A. Wood

    Abstract: Compositional data analysis is carried out either by neglecting the compositional constraint and applying standard multivariate data analysis, or by transforming the data using the logs of the ratios of the components. In this work we examine a more general transformation which includes both approaches as special cases. It is a power transformation and involves a single parameter, α. The transform… ▽ More

    Submitted 16 June, 2011; v1 submitted 7 June, 2011; originally announced June 2011.

    Comments: Published in the proceddings of the 4th international workshop on Compositional Data Analysis. http://congress.cimne.com/codawork11/frontal/default.asp

    Journal ref: Proceedings of CoDaWork'11: 4th international workshop on Compositional Data Analysis, Egozcue, J.J., Tolosana-Delgado, R. and Ortego, M.I. (eds.) 2011. ISBN: 978-84-87867-76-7