Extrinsic Principal Component Analysis
Authors:
Ka Chun Wong,
Vic Patrangenaru,
Robert L. Paige,
Mihaela Pricop Jeckstadt
Abstract:
One develops a fast computational methodology for principal component analysis on manifolds. Instead of estimating intrinsic principal components on an object space with a Riemannian structure, one embeds the object space in a numerical space, and the resulting chord distance is used. This method helps us analyzing high, theoretically even infinite dimensional data, from a new perspective. We defi…
▽ More
One develops a fast computational methodology for principal component analysis on manifolds. Instead of estimating intrinsic principal components on an object space with a Riemannian structure, one embeds the object space in a numerical space, and the resulting chord distance is used. This method helps us analyzing high, theoretically even infinite dimensional data, from a new perspective. We define the extrinsic principal sub-manifolds of a random object on a Hilbert manifold embedded in a Hilbert space, and the sample counterparts. The resulting extrinsic principal components are useful for dimension data reduction. For application, one retains a very small number of such extrinsic principal components for a shape of contour data sample, extracted from imaging data.
△ Less
Submitted 3 October, 2024; v1 submitted 5 September, 2024;
originally announced September 2024.
Topological Data Analysis for Object Data
Authors:
Vic Patrangenaru,
Peter Bubenik,
Robert L. Paige,
Daniel Osborne
Abstract:
Statistical analysis on object data presents many challenges. Basic summaries such as means and variances are difficult to compute. We apply ideas from topology to study object data. We present a framework for using persistence landscapes to vectorize object data and perform statistical analysis. We apply to this pipeline to some biological images that were previously shown to be challenging to st…
▽ More
Statistical analysis on object data presents many challenges. Basic summaries such as means and variances are difficult to compute. We apply ideas from topology to study object data. We present a framework for using persistence landscapes to vectorize object data and perform statistical analysis. We apply to this pipeline to some biological images that were previously shown to be challenging to study using shape theory. Surprisingly, the most persistent features are shown to be "topological noise" and the statistical analysis depends on the less persistent features which we refer to as the "geometric signal". We also describe the first steps to a new approach to using topology for object data analysis, which applies topology to distributions on object spaces.
△ Less
Submitted 26 April, 2018;
originally announced April 2018.