-
Bi-Filtration and Stability of TDA Mapper for Point Cloud Data
Authors:
Wako Bungula,
Isabel Darcy
Abstract:
Carlsson, Singh and Memoli's TDA mapper takes a point cloud dataset and outputs a graph that depends on several parameter choices. Dey, Memoli, and Wang developed Multiscale Mapper for abstract topological spaces so that parameter choices can be analyzed via persistent homology. However, when applied to actual data, one does not always obtain filtrations of mapper graphs. DBSCAN, one of the most c…
▽ More
Carlsson, Singh and Memoli's TDA mapper takes a point cloud dataset and outputs a graph that depends on several parameter choices. Dey, Memoli, and Wang developed Multiscale Mapper for abstract topological spaces so that parameter choices can be analyzed via persistent homology. However, when applied to actual data, one does not always obtain filtrations of mapper graphs. DBSCAN, one of the most common clustering algorithms used in the TDA mapper software, has two parameters, \textbf{$ε$} and \textbf{MinPts}. If \textbf{MinPts = 1} then DBSCAN is equivalent to single linkage clustering with cutting height \textbf{$ε$}. We show that if DBSCAN clustering is used with \textbf{MinPts $>$ 2}, a filtration of mapper graphs may not exist except in the absence of free-border points; but such filtrations exist if DBSCAN clustering is used with \textbf{MinPts = 1} or \textbf{2} as the cover size increases, \textbf{$ε$} increases, and/or \textbf{MinPts} decreases. However, the 1-dimensional filtration is unstable. If one adds noise to a data set so that each data point has been perturbed by a distance at most \textbf{$δ$}, the persistent homology of the mapper graph of the perturbed data set can be significantly different from that of the original data set. We show that we can obtain stability by increasing both the cover size and \textbf{$ε$} at the same time. In particular, we show that the bi-filtrations of the homology groups with respect to cover size and $ε$ between these two datasets are \textbf{2$δ$}-interleaved.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Modeling knotted proteins with tangles
Authors:
Isabel K. Darcy,
Garrett Jones,
Puttipong Pongtanapaisan
Abstract:
Although rare, an increasing number of proteins have been observed to contain entanglements in their native structures. To gain more insight into the significance of protein knotting, researchers have been investigating protein knot formation using both experimental and theoretical methods. Motivated by the hypothesized folding pathway of $α$-haloacid dehalogenase (DehI) protein, Flapan, He, and W…
▽ More
Although rare, an increasing number of proteins have been observed to contain entanglements in their native structures. To gain more insight into the significance of protein knotting, researchers have been investigating protein knot formation using both experimental and theoretical methods. Motivated by the hypothesized folding pathway of $α$-haloacid dehalogenase (DehI) protein, Flapan, He, and Wong proposed a theory of how protein knots form, which includes existing folding pathways described by Taylor and Bölinger et al. as special cases. In their topological descriptions, two loops in an unknotted open protein chain containing at most two twists each come close together, and one end of the protein eventually passes through the two loops. In this paper, we build on Flapan, He, and Wong's theory where we pay attention to the crossing signs of the threading process and assume that the unknotted protein chain may arrange itself into a more complicated configuration before threading occurs. We then apply tangle calculus, originally developed by Ernst and Sumners to analyze the action of specific proteins on DNA, to give all possible knots or knotoids that may be discovered in the future according to our model and give recipes for engineering specific knots in proteins from simpler pieces. We show why twists knots are the most likely knots to occur in proteins. We use chirality to show that the most likely knots to occur in proteins via Taylor's twisted hairpin model are the knots $+3_1$, $4_1$, and $-5_2$.
△ Less
Submitted 7 November, 2022;
originally announced November 2022.
-
Rational tangle surgery and Xer recombination on catenanes
Authors:
Isabel K. Darcy,
Kai Ishihara,
Ram K. Medikonduri,
Koya Shimokawa
Abstract:
The protein recombinase can change the knot type of circular DNA. The action of a recombinase converting one knot into another knot is normally mathematically modeled by band surgery. Band surgeries on a 2-bridge knot N((4mn-1)/(2m)) yielding a (2,2k)-torus link are characterized. We apply this and other rational tangle surgery results to analyze Xer recombination on DNA catenanes using the tangle…
▽ More
The protein recombinase can change the knot type of circular DNA. The action of a recombinase converting one knot into another knot is normally mathematically modeled by band surgery. Band surgeries on a 2-bridge knot N((4mn-1)/(2m)) yielding a (2,2k)-torus link are characterized. We apply this and other rational tangle surgery results to analyze Xer recombination on DNA catenanes using the tangle model for protein-bound DNA.
△ Less
Submitted 2 August, 2011;
originally announced August 2011.
-
Tangle analysis of difference topology experiments: applications to a Mu protein-DNA complex
Authors:
Isabel K. Darcy,
John Luecke,
Mariel Vazquez
Abstract:
We develop topological methods for analyzing difference topology experiments involving 3-string tangles. Difference topology is a novel technique used to unveil the structure of stable protein-DNA complexes involving two or more DNA segments. We analyze such experiments for the Mu protein-DNA complex. We characterize the solutions to the corresponding tangle equations by certain knotted graphs.…
▽ More
We develop topological methods for analyzing difference topology experiments involving 3-string tangles. Difference topology is a novel technique used to unveil the structure of stable protein-DNA complexes involving two or more DNA segments. We analyze such experiments for the Mu protein-DNA complex. We characterize the solutions to the corresponding tangle equations by certain knotted graphs. By investigating planarity conditions on these graphs we show that there is a unique biologically relevant solution. That is, we show there is a unique rational tangle solution, which is also the unique solution with small crossing number.
△ Less
Submitted 22 October, 2007;
originally announced October 2007.
-
Coloring $n$-String Tangles
Authors:
Isabel K. Darcy,
Junalyn Navarra-Madsen
Abstract:
This expository paper describes how the knot invariant Fox coloring can be applied to tangles.
This expository paper describes how the knot invariant Fox coloring can be applied to tangles.
△ Less
Submitted 20 September, 2006;
originally announced September 2006.