-
Tracking the topology of neural manifolds across populations
Authors:
Iris H. R. Yoon,
Gregory Henselman-Petrusek,
Yiyi Yu,
Robert Ghrist,
Spencer LaVere Smith,
Chad Giusti
Abstract:
Neural manifolds summarize the intrinsic structure of the information encoded by a population of neurons. Advances in experimental techniques have made simultaneous recordings from multiple brain regions increasingly commonplace, raising the possibility of studying how these manifolds relate across populations. However, when the manifolds are nonlinear and possibly code for multiple unknown variab…
▽ More
Neural manifolds summarize the intrinsic structure of the information encoded by a population of neurons. Advances in experimental techniques have made simultaneous recordings from multiple brain regions increasingly commonplace, raising the possibility of studying how these manifolds relate across populations. However, when the manifolds are nonlinear and possibly code for multiple unknown variables, it is challenging to extract robust and falsifiable information about their relationships. We introduce a framework, called the method of analogous cycles, for matching topological features of neural manifolds using only observed dissimilarity matrices within and between neural populations. We demonstrate via analysis of simulations and \emph{in vivo} experimental data that this method can be used to correctly identify multiple shared circular coordinate systems across both stimuli and inferred neural manifolds. Conversely, the method rejects matching features that are not intrinsic to one of the systems. Further, as this method is deterministic and does not rely on dimensionality reduction or optimization methods, it is amenable to direct mathematical investigation and interpretation in terms of the underlying neural activity. We thus propose the method of analogous cycles as a suitable foundation for a theory of cross-population analysis via neural manifolds.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
Interval Decomposition of Persistence Modules over a Principal Ideal Domain
Authors:
Jiajie Luo,
Gregory Henselman-Petrusek
Abstract:
The study of persistent homology has contributed new insights and perspectives into a variety of interesting problems in science and engineering. Work in this domain relies on the result that any finitely-indexed persistence module of finite-dimensional vector spaces admits an interval decomposition -- that is, a decomposition as a direct sum of simpler components called interval modules. This res…
▽ More
The study of persistent homology has contributed new insights and perspectives into a variety of interesting problems in science and engineering. Work in this domain relies on the result that any finitely-indexed persistence module of finite-dimensional vector spaces admits an interval decomposition -- that is, a decomposition as a direct sum of simpler components called interval modules. This result fails if we replace vector spaces with modules over more general coefficient rings.
We introduce an algorithm to determine whether a persistence module of pointwise free and finitely-generated modules over a principal ideal domain (PID) splits as a direct sum of interval submodules. If one exists, our algorithm outputs an interval decomposition. When considering persistence modules with coefficients in $\Z$ or $\Q[x]$, our algorithm computes an interval decomposition in polynomial time. This is the first algorithm with these properties of which we are aware.
We also show that a persistence module of pointwise free and finitely-generated modules over a PID splits as a direct sum of interval submodules if and only if the cokernel of every structure map is free. This result underpins the formulation of our algorithm. It also complements prior findings by Obayashi and Yoshiwaki regarding persistent homology, including a criterion for field independence and an algorithm to decompose persistence homology modules of simplex-wise filtrations.
△ Less
Submitted 14 April, 2025; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Stability for Inference with Persistent Homology Rank Functions
Authors:
Qiquan Wang,
Inés García-Redondo,
Pierre Faugère,
Gregory Henselman-Petrusek,
Anthea Monod
Abstract:
Persistent homology barcodes and diagrams are a cornerstone of topological data analysis that capture the "shape" of a wide range of complex data structures, such as point clouds, networks, and functions. However, their use in statistical settings is challenging due to their complex geometric structure. In this paper, we revisit the persistent homology rank function, which is mathematically equiva…
▽ More
Persistent homology barcodes and diagrams are a cornerstone of topological data analysis that capture the "shape" of a wide range of complex data structures, such as point clouds, networks, and functions. However, their use in statistical settings is challenging due to their complex geometric structure. In this paper, we revisit the persistent homology rank function, which is mathematically equivalent to a barcode and persistence diagram, as a tool for statistics and machine learning. Rank functions, being functions, enable the direct application of the statistical theory of functional data analysis (FDA)-a domain of statistics adapted for data in the form of functions. A key challenge they present over barcodes in practice, however, is their lack of stability-a property that is crucial to validate their use as a faithful representation of the data and therefore a viable summary statistic. In this paper, we fill this gap by deriving two stability results for persistent homology rank functions under a suitable metric for FDA integration. We then study the performance of rank functions in functional inferential statistics and machine learning on real data applications, in both single and multiparameter persistent homology. We find that the use of persistent homology captured by rank functions offers a clear improvement over existing non-persistence-based approaches.
△ Less
Submitted 22 September, 2024; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Saecular persistence
Authors:
Robert Ghrist,
Gregory Henselman-Petrusek
Abstract:
A persistence module is a functor $f: \mathbf{I} \to \mathsf{E}$, where $\mathbf{I}$ is the poset category of a totally ordered set. This work introduces saecular decomposition: a categorically natural method to decompose $f$ into simple parts, called interval modules. Saecular decomposition exists under generic conditions, e.g., when $\mathbf{I}$ is well ordered and $\mathsf{E}$ is a category of…
▽ More
A persistence module is a functor $f: \mathbf{I} \to \mathsf{E}$, where $\mathbf{I}$ is the poset category of a totally ordered set. This work introduces saecular decomposition: a categorically natural method to decompose $f$ into simple parts, called interval modules. Saecular decomposition exists under generic conditions, e.g., when $\mathbf{I}$ is well ordered and $\mathsf{E}$ is a category of modules or groups. This represents a substantial generalization of existing factorizations of 1-parameter persistence modules, leading to, among other things, persistence diagrams not only in homology, but in homotopy.
Applications of saecular decomposition include inverse and extension problems involving filtered topological spaces, the 1-parameter generalized persistence diagram, and the Leray-Serre spectral sequence. Several examples -- including cycle representatives for generalized barcodes -- hold special significance for scientific applications.
The key tools in this approach are modular and distributive order lattices, combined with Puppe exact categories.
△ Less
Submitted 12 December, 2021; v1 submitted 9 December, 2021;
originally announced December 2021.
-
Algorithmic Reconstruction of the Fiber of Persistent Homology on Cell Complexes
Authors:
Jacob Leygonie,
Gregory Henselman-Petrusek
Abstract:
Let $K$ be a finite simplicial, cubical, delta or CW complex. The persistence map $\mathrm{PH}$ takes a filter $f:K \rightarrow \mathbb{R}$ as input and returns the barcodes $\mathrm{PH}(f)$ of the associated sublevel set persistent homology modules. We address the inverse problem: given a target barcode $D$, computing the fiber $\mathrm{PH}^{-1}(D)$. For this, we use the fact that…
▽ More
Let $K$ be a finite simplicial, cubical, delta or CW complex. The persistence map $\mathrm{PH}$ takes a filter $f:K \rightarrow \mathbb{R}$ as input and returns the barcodes $\mathrm{PH}(f)$ of the associated sublevel set persistent homology modules. We address the inverse problem: given a target barcode $D$, computing the fiber $\mathrm{PH}^{-1}(D)$. For this, we use the fact that $\mathrm{PH}^{-1}(D)$ decomposes as complex of polyhedra when $K$ is a simplicial complex, and we generalise this result to arbitrary based chain complexes. We then design and implement a depth first search algorithm that recovers the polyhedra forming the fiber $\mathrm{PH}^{-1}(D)$. As an application, we solve a corpus of 120 sample problems, providing a first insight into the statistical structure of these fibers, for general CW complexes.
△ Less
Submitted 27 October, 2021;
originally announced October 2021.
-
U-match factorization: sparse homological algebra, lazy cycle representatives, and dualities in persistent (co)homology
Authors:
Haibin Hang,
Chad Giusti,
Lori Ziegelmeier,
Gregory Henselman-Petrusek
Abstract:
Persistent homology is a leading tool in topological data analysis (TDA). Many problems in TDA can be solved via homological -- and indeed, linear -- algebra. However, matrices in this domain are typically large, with rows and columns numbered in billions. Low-rank approximation of such arrays typically destroys essential information; thus, new mathematical and computational paradigms are needed f…
▽ More
Persistent homology is a leading tool in topological data analysis (TDA). Many problems in TDA can be solved via homological -- and indeed, linear -- algebra. However, matrices in this domain are typically large, with rows and columns numbered in billions. Low-rank approximation of such arrays typically destroys essential information; thus, new mathematical and computational paradigms are needed for very large, sparse matrices.
We present the U-match matrix factorization scheme to address this challenge. U-match has two desirable features. First, it admits a compressed storage format that reduces the number of nonzero entries held in computer memory by one or more orders of magnitude over other common factorizations. Second, it permits direct solution of diverse problems in linear and homological algebra, without decompressing matrices stored in memory. These problems include look-up and retrieval of rows and columns; evaluation of birth/death times, and extraction of generators in persistent (co)homology; and, calculation of bases for boundary and cycle subspaces of filtered chain complexes. Such bases are key to unlocking a range of other topological techniques for use in TDA, and U-match factorization is designed to make such calculations broadly accessible to practitioners.
As an application, we show that individual cycle representatives in persistent homology can be retrieved at time and memory costs orders of magnitude below current state of the art, via global duality. Moreover, the algebraic machinery needed to achieve this computation already exists in many modern solvers.
△ Less
Submitted 20 August, 2021; v1 submitted 19 August, 2021;
originally announced August 2021.
-
Minimal Cycle Representatives in Persistent Homology using Linear Programming: an Empirical Study with User's Guide
Authors:
Lu Li,
Connor Thompson,
Gregory Henselman-Petrusek,
Chad Giusti,
Lori Ziegelmeier
Abstract:
Cycle representatives of persistent homology classes can be used to provide descriptions of topological features in data. However, the non-uniqueness of these representatives creates ambiguity and can lead to many different interpretations of the same set of classes. One approach to solving this problem is to optimize the choice of representative against some measure that is meaningful in the cont…
▽ More
Cycle representatives of persistent homology classes can be used to provide descriptions of topological features in data. However, the non-uniqueness of these representatives creates ambiguity and can lead to many different interpretations of the same set of classes. One approach to solving this problem is to optimize the choice of representative against some measure that is meaningful in the context of the data. In this work, we provide a study of the effectiveness and computational cost of several $\ell_1$-minimization optimization procedures for constructing homological cycle bases for persistent homology with rational coefficients in dimension one, including uniform-weighted and length-weighted edge-loss algorithms as well as uniform-weighted and area-weighted triangle-loss algorithms. We conduct these optimizations via standard linear programming methods, applying general-purpose solvers to optimize over column bases of simplicial boundary matrices.
Our key findings are: (i) optimization is effective in reducing the size of cycle representatives, (ii) the computational cost of optimizing a basis of cycle representatives exceeds the cost of computing such a basis in most data sets we consider, (iii) the choice of linear solvers matters a lot to the computation time of optimizing cycles, (iv) the computation time of solving an integer program is not significantly longer than the computation time of solving a linear program for most of the cycle representatives, using the Gurobi linear solver, (v) strikingly, whether requiring integer solutions or not, we almost always obtain a solution with the same cost and almost all solutions found have entries in {-1, 0, 1} and therefore, are also solutions to a restricted $\ell_0$ optimization problem, and (vi) we obtain qualitatively different results for generators in Erdős-Rényi random clique complexes.
△ Less
Submitted 17 October, 2021; v1 submitted 14 May, 2021;
originally announced May 2021.
-
Matroids and Canonical Forms: Theory and Applications
Authors:
Gregory Henselman-Petrusek
Abstract:
This thesis proposes a combinatorial generalization of a nilpotent operator on a vector space. The resulting object is highly natural, with basic connections to a variety of fields in pure mathematics, engineering, and the sciences. For the purpose of exposition we focus the discussion of applications on homological algebra and computation, with additional remarks in lattice theory, linear algebra…
▽ More
This thesis proposes a combinatorial generalization of a nilpotent operator on a vector space. The resulting object is highly natural, with basic connections to a variety of fields in pure mathematics, engineering, and the sciences. For the purpose of exposition we focus the discussion of applications on homological algebra and computation, with additional remarks in lattice theory, linear algebra, and abelian categories. For motivation, we recall that the methods of algebraic topology have driven remarkable progress in the qualitative study of large, noisy bodies of data over the past 15 years. A primary tool in Topological Data Analysis [TDA] is the homological persistence module, which leverages categorical structure to compare algebraic shape descriptors across multiple scales of measurement. Our principle application to computation is a novel algorithm to calculate persistent homology which, in certain cases, improves the state of the art by several orders of magnitude. Included are novel results in discrete, spectral, and algebraic Morse theory, and on the strong maps of matroid theory. The defining theme throughout is interplay between the combinatorial theory matroids and the algebraic theory of categories. The nature of these interactions is remarkably simple, but their consequences in homological algebra, quiver theory, and combinatorial optimization represent new and widely open fields for interaction between the disciplines.
△ Less
Submitted 19 April, 2020; v1 submitted 17 October, 2017;
originally announced October 2017.