-
Decomposing the Persistent Homology Transform of Star-Shaped Objects
Authors:
Shreya Arya,
Barbara Giunti,
Abigail Hickok,
Lida Kanari,
Sarah McGuire,
Katharine Turner
Abstract:
In this paper, we study the geometric decomposition of the degree-$0$ Persistent Homology Transform (PHT) as viewed as a persistence diagram bundle. We focus on star-shaped objects as they can be segmented into smaller, simpler regions known as ``sectors''. Algebraically, we demonstrate that the degree-$0$ persistence diagram of a star-shaped object in $\mathbb{R}^2$ can be derived from the degree…
▽ More
In this paper, we study the geometric decomposition of the degree-$0$ Persistent Homology Transform (PHT) as viewed as a persistence diagram bundle. We focus on star-shaped objects as they can be segmented into smaller, simpler regions known as ``sectors''. Algebraically, we demonstrate that the degree-$0$ persistence diagram of a star-shaped object in $\mathbb{R}^2$ can be derived from the degree-$0$ persistence diagrams of its sectors. Using this, we then establish sufficient conditions for star-shaped objects in $\mathbb{R}^2$ so that they have ``trivial geometric monodromy''. Consequently, the PHT of such a shape can be decomposed as a union of curves parameterized by $S^1$, where the curves are given by the continuous movement of each point in the persistence diagrams that are parameterized by $S^{1}$. Finally, we discuss the current challenges of generalizing these results to higher dimensions.
△ Less
Submitted 27 August, 2024;
originally announced August 2024.
-
Do Neural Networks Trained with Topological Features Learn Different Internal Representations?
Authors:
Sarah McGuire,
Shane Jackson,
Tegan Emerson,
Henry Kvinge
Abstract:
There is a growing body of work that leverages features extracted via topological data analysis to train machine learning models. While this field, sometimes known as topological machine learning (TML), has seen some notable successes, an understanding of how the process of learning from topological features differs from the process of learning from raw data is still limited. In this work, we begi…
▽ More
There is a growing body of work that leverages features extracted via topological data analysis to train machine learning models. While this field, sometimes known as topological machine learning (TML), has seen some notable successes, an understanding of how the process of learning from topological features differs from the process of learning from raw data is still limited. In this work, we begin to address one component of this larger issue by asking whether a model trained with topological features learns internal representations of data that are fundamentally different than those learned by a model trained with the original raw data. To quantify ``different'', we exploit two popular metrics that can be used to measure the similarity of the hidden representations of data within neural networks, neural stitching and centered kernel alignment. From these we draw a range of conclusions about how training with topological features does and does not change the representations that a model learns. Perhaps unsurprisingly, we find that structurally, the hidden representations of models trained and evaluated on topological features differ substantially compared to those trained and evaluated on the corresponding raw data. On the other hand, our experiments show that in some cases, these representations can be reconciled (at least to the degree required to solve the corresponding task) using a simple affine transformation. We conjecture that this means that neural networks trained on raw data may extract some limited topological features in the process of making predictions.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Log-rank and lifting for AND-functions
Authors:
Alexander Knop,
Shachar Lovett,
Sam McGuire,
Weiqiang Yuan
Abstract:
Let $f: \{0,1\}^n \to \{0, 1\}$ be a boolean function, and let $f_\land (x, y) = f(x \land y)$ denote the AND-function of $f$, where $x \land y$ denotes bit-wise AND. We study the deterministic communication complexity of $f_\land$ and show that, up to a $\log n$ factor, it is bounded by a polynomial in the logarithm of the real rank of the communication matrix of $f_\land$. This comes within a…
▽ More
Let $f: \{0,1\}^n \to \{0, 1\}$ be a boolean function, and let $f_\land (x, y) = f(x \land y)$ denote the AND-function of $f$, where $x \land y$ denotes bit-wise AND. We study the deterministic communication complexity of $f_\land$ and show that, up to a $\log n$ factor, it is bounded by a polynomial in the logarithm of the real rank of the communication matrix of $f_\land$. This comes within a $\log n$ factor of establishing the log-rank conjecturefor AND-functions with no assumptions on $f$. Our result stands in contrast with previous results on special cases of the log-rank conjecture, which needed significant restrictions on $f$ such as monotonicity or low $\mathbb{F}_2$-degree. Our techniques can also be used to prove (within a $\log n$ factor) a lifting theorem for AND-functions, stating that the deterministic communication complexity of $f_\land$ is polynomially-related to the AND-decision tree complexity of $f$.
The results rely on a new structural result regarding boolean functions $f:\{0, 1\}^n \to \{0, 1\}$ with a sparse polynomial representation, which may be of independent interest. We show that if the polynomial computing $f$ has few monomials then the set system of the monomials has a small hitting set, of size poly-logarithmic in its sparsity. We also establish extensions of this result to multi-linear polynomials $f:\{0,1\}^n \to \mathbb{R}$ with a larger range.
△ Less
Submitted 22 October, 2020; v1 submitted 18 October, 2020;
originally announced October 2020.