-
A topological selection of folding pathways from native states of knotted proteins
Authors:
Agnese Barbensi,
Naya Yerolemou,
Oliver Vipond,
Barbara I. Mahler,
Pawel Dabrowski-Tumanski,
Dimos Goundaroulis
Abstract:
Understanding the biological function of knots in proteins and their folding process is an open and challenging question in biology. Recent studies classify the topology and geometry of knotted proteins by analysing the distribution of a protein's planar projections using topological objects called knotoids. We approach the analysis of proteins with the same topology by introducing a topologically…
▽ More
Understanding the biological function of knots in proteins and their folding process is an open and challenging question in biology. Recent studies classify the topology and geometry of knotted proteins by analysing the distribution of a protein's planar projections using topological objects called knotoids. We approach the analysis of proteins with the same topology by introducing a topologically inspired statistical metric between their knotoid distributions. We detect geometric differences between trefoil proteins by characterising their entanglement and we recover a clustering by sequence similarity. By looking directly at the geometry and topology of their native states, we are able to probe different folding pathways for proteins forming open-ended trefoil knots. Interestingly, our pipeline reveals that the folding pathway of shallow knotted Carbonic Anhydrases involves the creation of a double-looped structure, differently from what was previously observed for deeply knotted trefoil proteins. We validate this with Molecular Dynamics simulations.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Local Equivalence of Metrics for Multiparameter Persistence Modules
Authors:
Oliver Vipond
Abstract:
An ideal invariant for multiparameter persistence would be discriminative, computable and stable. In this work we analyse the discriminative power of a stable, computable invariant of multiparameter persistence modules: the fibered bar code. The fibered bar code is equivalent to the rank invariant and encodes the bar codes of the 1-parameter submodules of a multiparameter module. This invariant is…
▽ More
An ideal invariant for multiparameter persistence would be discriminative, computable and stable. In this work we analyse the discriminative power of a stable, computable invariant of multiparameter persistence modules: the fibered bar code. The fibered bar code is equivalent to the rank invariant and encodes the bar codes of the 1-parameter submodules of a multiparameter module. This invariant is well known to be globally incomplete. However in this work we show that the fibered bar code is locally complete for finitely presented modules by showing a local equivalence of metrics between the interleaving distance (which is complete on finitely-presented modules) and the matching distance on fibered bar codes. More precisely, we show that: for a finitely-presented multiparameter module $M$ there is a neighbourhood of $M$, in the interleaving distance $d_I$, for which the matching distance, $d_0$, satisfies the following bi-Lipschitz inequalities $\frac{1}{34}d_I(M,N) \leq d_0(M,N) \leq d_I(M,N)$ for all $N$ in this neighbourhood about $M$. As a consequence no other module in this neighbourhood has the same fibered bar code as $M$.
△ Less
Submitted 24 April, 2020;
originally announced April 2020.
-
Random Čech Complexes on Manifolds with Boundary
Authors:
Henry-Louis de Kergorlay,
Ulrike Tillmann,
Oliver Vipond
Abstract:
Let $M$ be a compact, unit volume, Riemannian manifold with boundary. In this paper we study the homology of a random Čech-complex generated by a homogeneous Poisson process in $M$. Our main results are two asymptotic threshold formulas, an upper threshold above which the Čech complex recovers the $k$-th homology of $M$ with high probability, and a lower threshold below which it almost certainly d…
▽ More
Let $M$ be a compact, unit volume, Riemannian manifold with boundary. In this paper we study the homology of a random Čech-complex generated by a homogeneous Poisson process in $M$. Our main results are two asymptotic threshold formulas, an upper threshold above which the Čech complex recovers the $k$-th homology of $M$ with high probability, and a lower threshold below which it almost certainly does not. These thresholds are close together in the sense that they have the same leading term. Here $k$ is positive and strictly less than the dimension $d$ of the manifold.
This extends work of Bobrowski and Weinberger in [BW17] and Bobrowski and Oliveira [BO19] who establish similar formulas when $M$ is a torus and, more generally, is closed and has no boundary. We note that the cases with and without boundary lead to different answers: The corresponding common leading terms for the upper and lower thresholds differ being $\log (n) $ when $M$ is closed and $(2-2/d)\log (n)$ when $M$ has boundary; here $n$ is the expected number of sample points. Our analysis identifies a special type of homological cycle, which we call a $Θ$-like-cycle, which occur close to the boundary and establish that the first order term of the lower threshold is $(2-2/d)\log (n)$.
△ Less
Submitted 18 June, 2019;
originally announced June 2019.
-
Multiparameter Persistence Landscapes
Authors:
Oliver Vipond
Abstract:
An important problem in the field of Topological Data Analysis is defining topological summaries which can be combined with traditional data analytic tools. In recent work Bubenik introduced the persistence landscape, a stable representation of persistence diagrams amenable to statistical analysis and machine learning tools. In this paper we generalise the persistence landscape to multiparameter p…
▽ More
An important problem in the field of Topological Data Analysis is defining topological summaries which can be combined with traditional data analytic tools. In recent work Bubenik introduced the persistence landscape, a stable representation of persistence diagrams amenable to statistical analysis and machine learning tools. In this paper we generalise the persistence landscape to multiparameter persistence modules providing a stable representation of the rank invariant. We show that multiparameter landscapes are stable with respect to the interleaving distance and persistence weighted Wasserstein distance, and that the collection of multiparameter landscapes faithfully represents the rank invariant. Finally we provide example calculations and statistical tests to demonstrate a range of potential applications and how one can interpret the landscapes associated to a multiparameter module.
△ Less
Submitted 24 December, 2018;
originally announced December 2018.