-
A topological data analysis based classification method for multiple measurements
Authors:
Henri Riihimäki,
Wojciech Chachólski,
Jakob Theorell,
Jan Hillert,
Ryan Ramanujam
Abstract:
Machine learning models for repeated measurements are limited. Using topological data analysis (TDA), we present a classifier for repeated measurements which samples from the data space and builds a network graph based on the data topology. When applying this to two case studies, accuracy exceeds alternative models with additional benefits such as reporting data subsets with high purity along with…
▽ More
Machine learning models for repeated measurements are limited. Using topological data analysis (TDA), we present a classifier for repeated measurements which samples from the data space and builds a network graph based on the data topology. When applying this to two case studies, accuracy exceeds alternative models with additional benefits such as reporting data subsets with high purity along with feature values. For 300 examples of 3 tree species, the accuracy reached 80% after 30 datapoints, which was improved to 90% after increased sampling to 400 datapoints. Using data from 100 examples of each of 6 point processes, the classifier achieved 96.8% accuracy. In both datasets, the TDA classifier outperformed an alternative model. This algorithm and software can be beneficial for repeated measurement data common in biological sciences, as both an accurate classifier and a feature selection tool.
△ Less
Submitted 5 April, 2019;
originally announced April 2019.
-
Generalized persistence analysis based on stable rank invariant
Authors:
Henri Riihimäki,
Wojciech Chacholski
Abstract:
We believe three ingredients are needed for further progress in persistence and its use: invariants not relying on decomposition theorems to go beyond 1-dimension, outcomes suitable for statistical analysis and a setup adopted for supervised and machine learning. Stable rank, a continuous invariant for multidimensional persistence, was introduced in W. Chacholski et al. - Multidimensional persiste…
▽ More
We believe three ingredients are needed for further progress in persistence and its use: invariants not relying on decomposition theorems to go beyond 1-dimension, outcomes suitable for statistical analysis and a setup adopted for supervised and machine learning. Stable rank, a continuous invariant for multidimensional persistence, was introduced in W. Chacholski et al. - Multidimensional persistence and noise, 2017. In the current paper we continue this work by demonstrating how one builds an efficient computational pipeline around this invariant and uses it in inference in case of one parameter. We demonstrate some computational evidence of the statistical stability of stable rank. We also show how our framework can be used in supervised learning.
△ Less
Submitted 13 June, 2018;
originally announced July 2018.
-
Multidimensional Persistence and Noise
Authors:
Martina Scolamiero,
Wojciech Chachólski,
Anders Lundman,
Ryan Ramanujam,
Sebastian Öberg
Abstract:
In this paper we study multidimensional persistence modules [5,13] via what we call tame functors and noise systems. A noise system leads to a pseudo-metric topology on the category of tame functors. We show how this pseudo-metric can be used to identify persistent features of compact multidimensional persistence modules. To count such features we introduce the feature counting invariant and prove…
▽ More
In this paper we study multidimensional persistence modules [5,13] via what we call tame functors and noise systems. A noise system leads to a pseudo-metric topology on the category of tame functors. We show how this pseudo-metric can be used to identify persistent features of compact multidimensional persistence modules. To count such features we introduce the feature counting invariant and prove that assigning this invariant to compact tame functors is a 1-Lipschitz operation. For 1-dimensional persistence, we explain how, by choosing an appropriate noise system, the feature counting invariant identifies the same persistent features as the classical barcode construction.
△ Less
Submitted 15 August, 2016; v1 submitted 26 May, 2015;
originally announced May 2015.
-
Combinatorial presentation of multidimensional persistent homology
Authors:
Wojciech Chacholski,
Martina Scolamiero,
Francesco Vaccarino
Abstract:
A multifiltration is a functor indexed by $\mathbb{N}^r$ that maps any morphism to a monomorphism. The goal of this paper is to describe in an explicit and combinatorial way the natural $\mathbb{N}^r$-graded $R[x_1,\ldots, x_r]$-module structure on the homology of a multifiltration of simplicial complexes. To do that we study multifiltrations of sets and vector spaces. We prove in particular that…
▽ More
A multifiltration is a functor indexed by $\mathbb{N}^r$ that maps any morphism to a monomorphism. The goal of this paper is to describe in an explicit and combinatorial way the natural $\mathbb{N}^r$-graded $R[x_1,\ldots, x_r]$-module structure on the homology of a multifiltration of simplicial complexes. To do that we study multifiltrations of sets and vector spaces. We prove in particular that the $\mathbb{N}^r$-graded $R[x_1,\ldots, x_r]$-modules that can occur as $R$-spans of multifiltrations of sets are the direct sums of monomial ideals.
△ Less
Submitted 28 September, 2014;
originally announced September 2014.