-
The Kinetic Hourglass Data Structure for Computing the Bottleneck Distance of Dynamic Data
Authors:
Elizabeth Munch,
Elena Xinyi Wang,
Carola Wenk
Abstract:
The kinetic data structure (KDS) framework is a powerful tool for maintaining various geometric configurations of continuously moving objects. In this work, we introduce the kinetic hourglass, a novel KDS implementation designed to compute the bottleneck distance for geometric matching problems. We detail the events and updates required for handling general graphs, accompanied by a complexity anal…
▽ More
The kinetic data structure (KDS) framework is a powerful tool for maintaining various geometric configurations of continuously moving objects. In this work, we introduce the kinetic hourglass, a novel KDS implementation designed to compute the bottleneck distance for geometric matching problems. We detail the events and updates required for handling general graphs, accompanied by a complexity analysis. Furthermore, we demonstrate the utility of the kinetic hourglass by applying it to compute the bottleneck distance between two persistent homology transforms (PHTs) derived from shapes in $\mathbb{R}^2$, which are topological summaries obtained by computing persistent homology from every direction in $\mathbb{S}^1$.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
The Patterns of Life Human Mobility Simulation
Authors:
Hossein Amiri,
Will Kohn,
Shiyang Ruan,
Joon-Seok Kim,
Hamdi Kavak,
Andrew Crooks,
Dieter Pfoser,
Carola Wenk,
Andreas Zufle
Abstract:
We demonstrate the Patterns of Life Simulation to create realistic simulations of human mobility in a city. This simulation has recently been used to generate massive amounts of trajectory and check-in data. Our demonstration focuses on using the simulation twofold: (1) using the graphical user interface (GUI), and (2) running the simulation headless by disabling the GUI for faster data generation…
▽ More
We demonstrate the Patterns of Life Simulation to create realistic simulations of human mobility in a city. This simulation has recently been used to generate massive amounts of trajectory and check-in data. Our demonstration focuses on using the simulation twofold: (1) using the graphical user interface (GUI), and (2) running the simulation headless by disabling the GUI for faster data generation. We further demonstrate how the Patterns of Life simulation can be used to simulate any region on Earth by using publicly available data from OpenStreetMap. Finally, we also demonstrate recent improvements to the scalability of the simulation allows simulating up to 100,000 individual agents for years of simulation time. During our demonstration, as well as offline using our guides on GitHub, participants will learn: (1) The theories of human behavior driving the Patters of Life simulation, (2) how to simulate to generate massive amounts of synthetic yet realistic trajectory data, (3) running the simulation for a region of interest chosen by participants using OSM data, (4) learn the scalability of the simulation and understand the properties of generated data, and (5) manage thousands of parallel simulation instances running concurrently.
△ Less
Submitted 11 October, 2024; v1 submitted 30 September, 2024;
originally announced October 2024.
-
Rapid and Precise Topological Comparison with Merge Tree Neural Networks
Authors:
Yu Qin,
Brittany Terese Fasy,
Carola Wenk,
Brian Summa
Abstract:
Merge trees are a valuable tool in the scientific visualization of scalar fields; however, current methods for merge tree comparisons are computationally expensive, primarily due to the exhaustive matching between tree nodes. To address this challenge, we introduce the Merge Tree Neural Network (MTNN), a learned neural network model designed for merge tree comparison. The MTNN enables rapid and hi…
▽ More
Merge trees are a valuable tool in the scientific visualization of scalar fields; however, current methods for merge tree comparisons are computationally expensive, primarily due to the exhaustive matching between tree nodes. To address this challenge, we introduce the Merge Tree Neural Network (MTNN), a learned neural network model designed for merge tree comparison. The MTNN enables rapid and high-quality similarity computation. We first demonstrate how to train graph neural networks, which emerged as effective encoders for graphs, in order to produce embeddings of merge trees in vector spaces for efficient similarity comparison. Next, we formulate the novel MTNN model that further improves the similarity comparisons by integrating the tree and node embeddings with a new topological attention mechanism. We demonstrate the effectiveness of our model on real-world data in different domains and examine our model's generalizability across various datasets. Our experimental analysis demonstrates our approach's superiority in accuracy and efficiency. In particular, we speed up the prior state-of-the-art by more than $100\times$ on the benchmark datasets while maintaining an error rate below $0.1\%$.
△ Less
Submitted 4 October, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
Realizability of Free Spaces of Curves
Authors:
Hugo A. Akitaya,
Maike Buchin,
Majid Mirzanezhad,
Leonie Ryvkin,
Carola Wenk
Abstract:
The free space diagram is a popular tool to compute the well-known Fréchet distance. As the Fréchet distance is used in many different fields, many variants have been established to cover the specific needs of these applications. Often, the question arises whether a certain pattern in the free space diagram is "realizable", i.e., whether there exists a pair of polygonal chains whose free space dia…
▽ More
The free space diagram is a popular tool to compute the well-known Fréchet distance. As the Fréchet distance is used in many different fields, many variants have been established to cover the specific needs of these applications. Often, the question arises whether a certain pattern in the free space diagram is "realizable", i.e., whether there exists a pair of polygonal chains whose free space diagram corresponds to it. The answer to this question may help in deciding the computational complexity of these distance measures, as well as allowing to design more efficient algorithms for restricted input classes that avoid certain free space patterns. Therefore, we study the inverse problem: Given a potential free space diagram, do there exist curves that generate this diagram?
Our problem of interest is closely tied to the classic Distance Geometry problem. We settle the complexity of Distance Geometry in $\mathbb{R}^{> 2}$, showing $\exists\mathbb{R}$-hardness. We use this to show that for curves in $\mathbb{R}^{\ge 2}$, the realizability problem is $\exists\mathbb{R}$-complete, both for continuous and for discrete Fréchet distance. We prove that the continuous case in $\mathbb{R}^1$ is only weakly NP-hard, and we provide a pseudo-polynomial time algorithm and show that it is fixed-parameter tractable. Interestingly, for the discrete case in $\mathbb{R}^1$, we show that the problem becomes solvable in polynomial time.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Visualizing Topological Importance: A Class-Driven Approach
Authors:
Yu Qin,
Brittany Terese Fasy,
Carola Wenk,
Brian Summa
Abstract:
This paper presents the first approach to visualize the importance of topological features that define classes of data. Topological features, with their ability to abstract the fundamental structure of complex data, are an integral component of visualization and analysis pipelines. Although not all topological features present in data are of equal importance. To date, the default definition of fea…
▽ More
This paper presents the first approach to visualize the importance of topological features that define classes of data. Topological features, with their ability to abstract the fundamental structure of complex data, are an integral component of visualization and analysis pipelines. Although not all topological features present in data are of equal importance. To date, the default definition of feature importance is often assumed and fixed. This work shows how proven explainable deep learning approaches can be adapted for use in topological classification. In doing so, it provides the first technique that illuminates what topological structures are important in each dataset in regards to their class label. In particular, the approach uses a learned metric classifier with a density estimator of the points of a persistence diagram as input. This metric learns how to reweigh this density such that classification accuracy is high. By extracting this weight, an importance field on persistent point density can be created. This provides an intuitive representation of persistence point importance that can be used to drive new visualizations. This work provides two examples: Visualization on each diagram directly and, in the case of sublevel set filtrations on images, directly on the images themselves. This work highlights real-world examples of this approach visualizing the important topological features in graph, 3D shape, and medical image data.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
From Curves to Words and Back Again: Geometric Computation of Minimum-Area Homotopy
Authors:
Hsien-Chih Chang,
Brittany Terese Fasy,
Bradley McCoy,
David L. Millman,
Carola Wenk
Abstract:
Let $γ$ be a generic closed curve in the plane. Samuel Blank, in his 1967 Ph.D. thesis, determined if $γ$ is self-overlapping by geometrically constructing a combinatorial word from $γ$. More recently, Zipei Nie, in an unpublished manuscript, computed the minimum homotopy area of $γ$ by constructing a combinatorial word algebraically. We provide a unified framework for working with both words and…
▽ More
Let $γ$ be a generic closed curve in the plane. Samuel Blank, in his 1967 Ph.D. thesis, determined if $γ$ is self-overlapping by geometrically constructing a combinatorial word from $γ$. More recently, Zipei Nie, in an unpublished manuscript, computed the minimum homotopy area of $γ$ by constructing a combinatorial word algebraically. We provide a unified framework for working with both words and determine the settings under which Blank's word and Nie's word are equivalent. Using this equivalence, we give a new geometric proof for the correctness of Nie's algorithm. Unlike previous work, our proof is constructive which allows us to naturally compute the actual homotopy that realizes the minimum area. Furthermore, we contribute to the theory of self-overlapping curves by providing the first polynomial-time algorithm to compute a self-overlapping decomposition of any closed curve $γ$ with minimum area.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Metric and Path-Connectedness Properties of the Frechet Distance for Paths and Graphs
Authors:
Erin Chambers,
Brittany Fasy,
Benjamin Holmgren,
Sushovan Majhi,
Carola Wenk
Abstract:
The Frechet distance is often used to measure distances between paths, with applications in areas ranging from map matching to GPS trajectory analysis to handwriting recognition. More recently, the Frechet distance has been generalized to a distance between two copies of the same graph embedded or immersed in a metric space; this more general setting opens up a wide range of more complex applicati…
▽ More
The Frechet distance is often used to measure distances between paths, with applications in areas ranging from map matching to GPS trajectory analysis to handwriting recognition. More recently, the Frechet distance has been generalized to a distance between two copies of the same graph embedded or immersed in a metric space; this more general setting opens up a wide range of more complex applications in graph analysis. In this paper, we initiate a study of some of the fundamental topological properties of spaces of paths and of graphs mapped to R^n under the Frechet distance, in an effort to lay the theoretical groundwork for understanding how these distances can be used in practice. In particular, we prove whether or not these spaces, and the metric balls therein, are path-connected.
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
Towards Mobility Data Science (Vision Paper)
Authors:
Mohamed Mokbel,
Mahmoud Sakr,
Li Xiong,
Andreas Züfle,
Jussara Almeida,
Taylor Anderson,
Walid Aref,
Gennady Andrienko,
Natalia Andrienko,
Yang Cao,
Sanjay Chawla,
Reynold Cheng,
Panos Chrysanthis,
Xiqi Fei,
Gabriel Ghinita,
Anita Graser,
Dimitrios Gunopulos,
Christian Jensen,
Joon-Seok Kim,
Kyoung-Sook Kim,
Peer Kröger,
John Krumm,
Johannes Lauer,
Amr Magdy,
Mario Nascimento
, et al. (23 additional authors not shown)
Abstract:
Mobility data captures the locations of moving objects such as humans, animals, and cars. With the availability of GPS-equipped mobile devices and other inexpensive location-tracking technologies, mobility data is collected ubiquitously. In recent years, the use of mobility data has demonstrated significant impact in various domains including traffic management, urban planning, and health sciences…
▽ More
Mobility data captures the locations of moving objects such as humans, animals, and cars. With the availability of GPS-equipped mobile devices and other inexpensive location-tracking technologies, mobility data is collected ubiquitously. In recent years, the use of mobility data has demonstrated significant impact in various domains including traffic management, urban planning, and health sciences. In this paper, we present the emerging domain of mobility data science. Towards a unified approach to mobility data science, we envision a pipeline having the following components: mobility data collection, cleaning, analysis, management, and privacy. For each of these components, we explain how mobility data science differs from general data science, we survey the current state of the art and describe open challenges for the research community in the coming years.
△ Less
Submitted 7 March, 2024; v1 submitted 21 June, 2023;
originally announced July 2023.
-
Distance Measures for Geometric Graphs
Authors:
Sushovan Majhi,
Carola Wenk
Abstract:
A geometric graph is a combinatorial graph, endowed with a geometry that is inherited from its embedding in a Euclidean space. Formulation of a meaningful measure of (dis-)similarity in both the combinatorial and geometric structures of two such geometric graphs is a challenging problem in pattern recognition. We study two notions of distance measures for geometric graphs, called the geometric edi…
▽ More
A geometric graph is a combinatorial graph, endowed with a geometry that is inherited from its embedding in a Euclidean space. Formulation of a meaningful measure of (dis-)similarity in both the combinatorial and geometric structures of two such geometric graphs is a challenging problem in pattern recognition. We study two notions of distance measures for geometric graphs, called the geometric edit distance (GED) and geometric graph distance (GGD). While the former is based on the idea of editing one graph to transform it into the other graph, the latter is inspired by inexact matching of the graphs. For decades, both notions have been lending themselves well as measures of similarity between attributed graphs. If used without any modification, however, they fail to provide a meaningful distance measure for geometric graphs -- even cease to be a metric. We have curated their associated cost functions for the context of geometric graphs. Alongside studying the metric properties of GED and GGD, we investigate how the two notions compare. We further our understanding of the computational aspects of GGD by showing that the distance is $\mathcal{NP}$-hard to compute, even if the graphs are planar and arbitrary cost coefficients are allowed.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Minimum-Complexity Graph Simplification under Fréchet-Like Distances
Authors:
Omrit Filtser,
Majid Mirzanezhad,
Carola Wenk
Abstract:
Simplifying graphs is a very applicable problem in numerous domains, especially in computational geometry. Given a geometric graph and a threshold, the minimum-complexity graph simplification asks for computing an alternative graph of minimum complexity so that the distance between the two graphs remains at most the threshold. In this paper, we propose several NP-hardness and algorithmic results d…
▽ More
Simplifying graphs is a very applicable problem in numerous domains, especially in computational geometry. Given a geometric graph and a threshold, the minimum-complexity graph simplification asks for computing an alternative graph of minimum complexity so that the distance between the two graphs remains at most the threshold. In this paper, we propose several NP-hardness and algorithmic results depending on the type of input and simplified graphs, the vertex placement of the simplified graph, and the distance measures between them (graph and traversal distances [1,2]). In general, we show that for arbitrary input and output graphs, the problem is NP-hard under some specific vertex-placement of the simplified graph. When the input and output are trees, and the graph distance is applied from the simplified tree to the input tree, we give an $O(kn^5)$ time algorithm, where $k$ is the number of the leaves of the two trees that are identical and $n$ is the number of vertices of the input.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
A Domain-Oblivious Approach for Learning Concise Representations of Filtered Topological Spaces for Clustering
Authors:
Yu Qin,
Brittany Terese Fasy,
Carola Wenk,
Brian Summa
Abstract:
Persistence diagrams have been widely used to quantify the underlying features of filtered topological spaces in data visualization. In many applications, computing distances between diagrams is essential; however, computing these distances has been challenging due to the computational cost. In this paper, we propose a persistence diagram hashing framework that learns a binary code representation…
▽ More
Persistence diagrams have been widely used to quantify the underlying features of filtered topological spaces in data visualization. In many applications, computing distances between diagrams is essential; however, computing these distances has been challenging due to the computational cost. In this paper, we propose a persistence diagram hashing framework that learns a binary code representation of persistence diagrams, which allows for fast computation of distances. This framework is built upon a generative adversarial network (GAN) with a diagram distance loss function to steer the learning process. Instead of using standard representations, we hash diagrams into binary codes, which have natural advantages in large-scale tasks. The training of this model is domain-oblivious in that it can be computed purely from synthetic, randomly created diagrams. As a consequence, our proposed method is directly applicable to various datasets without the need for retraining the model. These binary codes, when compared using fast Hamming distance, better maintain topological similarity properties between datasets than other vectorized representations. To evaluate this method, we apply our framework to the problem of diagram clustering and we compare the quality and performance of our approach to the state-of-the-art. In addition, we show the scalability of our approach on a dataset with 10k persistence diagrams, which is not possible with current techniques. Moreover, our experimental results demonstrate that our method is significantly faster with the potential of less memory usage, while retaining comparable or better quality comparisons.
△ Less
Submitted 10 August, 2021; v1 submitted 25 May, 2021;
originally announced May 2021.
-
Combinatorial Properties of Self-Overlapping Curves and Interior Boundaries
Authors:
Parker Evans,
Carola Wenk
Abstract:
We study the interplay between the recently defined concept of minimum homotopy area and the classical topic of self-overlapping curves. The latter are plane curves which are the image of the boundary of an immersed disk. Our first contribution is to prove new sufficient combinatorial conditions for a curve to be self-overlapping. We show that a curve $γ$ with Whitney index 1 and without any self-…
▽ More
We study the interplay between the recently defined concept of minimum homotopy area and the classical topic of self-overlapping curves. The latter are plane curves which are the image of the boundary of an immersed disk. Our first contribution is to prove new sufficient combinatorial conditions for a curve to be self-overlapping. We show that a curve $γ$ with Whitney index 1 and without any self-overlapping subcurves is self-overlapping. As a corollary, we obtain sufficient conditions for self-overlappingness solely in terms of the Whitney index of the curve and its subcurves. These results follow from our second contribution, which shows that any plane curve $γ$, modulo a basepoint condition, is transformed into an interior boundary by wrapping around $γ$ with Jordan curves. Equivalently, the minimum homotopy area of $γ$ is reduced to the minimal possible threshold, namely the winding area, through wrapping. In fact, we show that $n+1$ wraps suffice, where $γ$ has $n$ vertices. Our third contribution is to prove the equivalence of various definitions of self-overlapping curves and interior boundaries, often implicit in the literature. We also introduce and characterize zero-obstinance curves, further generalizations of interior boundaries defined by optimality in minimum homotopy area.
△ Less
Submitted 30 March, 2020;
originally announced March 2020.
-
Simplification of Indoor Space Footprints
Authors:
Joon-Seok Kim,
Carola Wenk
Abstract:
Simplification is one of the fundamental operations used in geoinformation science (GIS) to reduce size or representation complexity of geometric objects. Although different simplification methods can be applied depending on one's purpose, a simplification that many applications employ is designed to preserve their spatial properties after simplification. This article addresses one of the 2D simpl…
▽ More
Simplification is one of the fundamental operations used in geoinformation science (GIS) to reduce size or representation complexity of geometric objects. Although different simplification methods can be applied depending on one's purpose, a simplification that many applications employ is designed to preserve their spatial properties after simplification. This article addresses one of the 2D simplification methods, especially working well on human-made structures such as 2D footprints of buildings and indoor spaces. The method simplifies polygons in an iterative manner. The simplification is segment-wise and takes account of intrusion, extrusion, offset, and corner portions of 2D structures preserving its dominant frame.
△ Less
Submitted 15 January, 2020;
originally announced January 2020.
-
Approximating Gromov-Hausdorff Distance in Euclidean Space
Authors:
Sushovan Majhi,
Jeffrey Vitter,
Carola Wenk
Abstract:
The Gromov-Hausdorff distance $(d_{GH})$ proves to be a useful distance measure between shapes. In order to approximate $d_{GH}$ for compact subsets $X,Y\subset\mathbb{R}^d$, we look into its relationship with $d_{H,iso}$, the infimum Hausdorff distance under Euclidean isometries. As already known for dimension $d\geq 2$, the $d_{H,iso}$ cannot be bounded above by a constant factor times $d_{GH}$.…
▽ More
The Gromov-Hausdorff distance $(d_{GH})$ proves to be a useful distance measure between shapes. In order to approximate $d_{GH}$ for compact subsets $X,Y\subset\mathbb{R}^d$, we look into its relationship with $d_{H,iso}$, the infimum Hausdorff distance under Euclidean isometries. As already known for dimension $d\geq 2$, the $d_{H,iso}$ cannot be bounded above by a constant factor times $d_{GH}$. For $d=1$, however, we prove that $d_{H,iso}\leq\frac{5}{4}d_{GH}$. We also show that the bound is tight. In effect, this gives rise to an $O(n\log{n})$-time algorithm to approximate $d_{GH}$ with an approximation factor of $\left(1+\frac{1}{4}\right)$.
△ Less
Submitted 7 June, 2022; v1 submitted 30 December, 2019;
originally announced December 2019.
-
Topological and Geometric Reconstruction of Metric Graphs in $\mathbb{R}^n$
Authors:
Brittany Terese Fasy,
Rafal Komendarczyk,
Sushovan Majhi,
Carola Wenk
Abstract:
We propose an algorithm to estimate the topology of an embedded metric graph from a well-sampled finite subset of the underlying graph.
We propose an algorithm to estimate the topology of an embedded metric graph from a well-sampled finite subset of the underlying graph.
△ Less
Submitted 6 December, 2019;
originally announced December 2019.
-
Threshold-Based Graph Reconstruction Using Discrete Morse Theory
Authors:
Brittany Terese Fasy,
Sushovan Majhi,
Carola Wenk
Abstract:
Discrete Morse theory has recently been applied in metric graph reconstruction from a given density function concentrated around an (unknown) underlying embedded graph. We propose a new noise model for the density function to reconstruct a connected graph both topologically and geometrically.
Discrete Morse theory has recently been applied in metric graph reconstruction from a given density function concentrated around an (unknown) underlying embedded graph. We propose a new noise model for the density function to reconstruct a connected graph both topologically and geometrically.
△ Less
Submitted 28 November, 2019;
originally announced November 2019.
-
Distance Measures for Embedded Graphs
Authors:
Hugo A. Akitaya,
Maike Buchin,
Bernhard Kilgus,
Stef Sijben,
Carola Wenk
Abstract:
We introduce new distance measures for comparing straight-line embedded graphs based on the Fréchet distance and the weak Fréchet distance. These graph distances are defined using continuous mappings and thus take the combinatorial structure as well as the geometric embeddings of the graphs into account. We present a general algorithmic approach for computing these graph distances. Although we sho…
▽ More
We introduce new distance measures for comparing straight-line embedded graphs based on the Fréchet distance and the weak Fréchet distance. These graph distances are defined using continuous mappings and thus take the combinatorial structure as well as the geometric embeddings of the graphs into account. We present a general algorithmic approach for computing these graph distances. Although we show that deciding the distances is NP-hard for general embedded graphs, we prove that our approach yields polynomial time algorithms if the graphs are trees, and for the distance based on the weak Fréchet distance if the graphs are planar embedded. Moreover, we prove that deciding the distances based on the Fréchet distance remains NP-hard for planar embedded graphs and show how our general algorithmic approach yields an exponential time algorithm and a polynomial time approximation algorithm for this case.
△ Less
Submitted 11 September, 2019; v1 submitted 21 December, 2018;
originally announced December 2018.
-
On the Reconstruction of Geodesic Subspaces of $\mathbb{R}^N$
Authors:
Brittany Terese Fasy,
Rafal Komendarczyk,
Sushovan Majhi,
Carola Wenk
Abstract:
We consider the topological and geometric reconstruction of a geodesic subspace of $\mathbb{R}^N$ both from the Čech and Vietoris-Rips filtrations on a finite, Hausdorff-close, Euclidean sample. Our reconstruction technique leverages the intrinsic length metric induced by the geodesics on the subspace. We consider the distortion and convexity radius as our sampling parameters for a successful reco…
▽ More
We consider the topological and geometric reconstruction of a geodesic subspace of $\mathbb{R}^N$ both from the Čech and Vietoris-Rips filtrations on a finite, Hausdorff-close, Euclidean sample. Our reconstruction technique leverages the intrinsic length metric induced by the geodesics on the subspace. We consider the distortion and convexity radius as our sampling parameters for a successful reconstruction. For a geodesic subspace with finite distortion and positive convexity radius, we guarantee a correct computation of its homotopy and homology groups from the sample. For geodesic subspaces of $\mathbb{R}^2$, we also devise an algorithm to output a homotopy equivalent geometric complex that has a very small Hausdorff distance to the unknown shape of interest.
△ Less
Submitted 23 September, 2022; v1 submitted 23 October, 2018;
originally announced October 2018.
-
Global Curve Simplification
Authors:
Mees van de Kerkhof,
Irina Kostitsyna,
Maarten Löffler,
Majid Mirzanezhad,
Carola Wenk
Abstract:
Due to its many applications, \emph{curve simplification} is a long-studied problem in computational geometry and adjacent disciplines, such as graphics, geographical information science, etc. Given a polygonal curve $P$ with $n$ vertices, the goal is to find another polygonal curve $P'$ with a smaller number of vertices such that $P'$ is sufficiently similar to $P$. Quality guarantees of a simpli…
▽ More
Due to its many applications, \emph{curve simplification} is a long-studied problem in computational geometry and adjacent disciplines, such as graphics, geographical information science, etc. Given a polygonal curve $P$ with $n$ vertices, the goal is to find another polygonal curve $P'$ with a smaller number of vertices such that $P'$ is sufficiently similar to $P$. Quality guarantees of a simplification are usually given in a \emph{local} sense, bounding the distance between a shortcut and its corresponding section of the curve. In this work, we aim to provide a systematic overview of curve simplification problems under \emph{global} distance measures that bound the distance between $P$ and $P'$. We consider six different curve distance measures: three variants of the \emph{Hausdorff} distance and three variants of the \emph{Fréchet} distance. And we study different restrictions on the choice of vertices for $P'$. We provide polynomial-time algorithms for some variants of the global curve simplification problem and show NP-hardness for other variants. Through this systematic study we observe, for the first time, some surprising patterns, and suggest directions for future research in this important area.
△ Less
Submitted 22 January, 2020; v1 submitted 26 September, 2018;
originally announced September 2018.
-
Fast Frechet Distance Between Curves With Long Edges
Authors:
Joachim Gudmundsson,
Majid Mirzanezhad,
Ali Mohades,
Carola Wenk
Abstract:
Computing the Fréchet distance between two polygonal curves takes roughly quadratic time. In this paper, we show that for a special class of curves the Fréchet distance computations become easier. Let $P$ and $Q$ be two polygonal curves in $\mathbb{R}^d$ with $n$ and $m$ vertices, respectively. We prove four results for the case when all edges of both curves are long compared to the Fréchet distan…
▽ More
Computing the Fréchet distance between two polygonal curves takes roughly quadratic time. In this paper, we show that for a special class of curves the Fréchet distance computations become easier. Let $P$ and $Q$ be two polygonal curves in $\mathbb{R}^d$ with $n$ and $m$ vertices, respectively. We prove four results for the case when all edges of both curves are long compared to the Fréchet distance between them: (1) a linear-time algorithm for deciding the Fréchet distance between two curves, (2) an algorithm that computes the Fréchet distance in $O((n+m)\log (n+m))$ time, (3) a linear-time $\sqrt{d}$-approximation algorithm, and (4) a data structure that supports $O(m\log^2 n)$-time decision queries, where $m$ is the number of vertices of the query curve and $n$ the number of vertices of the preprocessed curve.
△ Less
Submitted 26 August, 2019; v1 submitted 28 October, 2017;
originally announced October 2017.
-
On Minimum Area Homotopies of Normal Curves in the Plane
Authors:
Brittany Terese Fasy,
Selcuk Karakoc,
Carola Wenk
Abstract:
In this paper, we study the problem of computing a homotopy from a planar curve $C$ to a point that minimizes the area swept. The existence of such a minimum homotopy is a direct result of the solution of Plateau's problem. Chambers and Wang studied the special case that $C$ is the concatenation of two simple curves, and they gave a polynomial-time algorithm for computing a minimum homotopy in thi…
▽ More
In this paper, we study the problem of computing a homotopy from a planar curve $C$ to a point that minimizes the area swept. The existence of such a minimum homotopy is a direct result of the solution of Plateau's problem. Chambers and Wang studied the special case that $C$ is the concatenation of two simple curves, and they gave a polynomial-time algorithm for computing a minimum homotopy in this setting. We study the general case of a normal curve $C$ in the plane, and provide structural properties of minimum homotopies that lead to an algorithm. In particular, we prove that for any normal curve there exists a minimum homotopy that consists entirely of contractions of self-overlapping sub-curves (i.e., consists of contracting a collection of boundaries of immersed disks).
△ Less
Submitted 7 July, 2017;
originally announced July 2017.
-
Fréchet Distance for Curves, Revisited
Authors:
Boris Aronov,
Sariel Har-Peled,
Christian Knauer,
Yusu Wang,
Carola Wenk
Abstract:
$\renewcommand{\Re}{{\rm I\!\hspace{-0.025em} R}} \newcommand{\eps}{\varepsilon} \newcommand{\SetX}{\mathsf{X}} \newcommand{\VorX}[1]{\mathcal{V} \pth{#1}} \newcommand{\Polygon}{\mathsf{P}} \newcommand{\Space}{\overline{\mathsf{m}}} \newcommand{\pth}[2][\!]{#1\left({#2}\right)}$ We revisit the problem of computing Fréchet distance between polygonal curves under $L_1$, $L_2$, and $L_\infty…
▽ More
$\renewcommand{\Re}{{\rm I\!\hspace{-0.025em} R}} \newcommand{\eps}{\varepsilon} \newcommand{\SetX}{\mathsf{X}} \newcommand{\VorX}[1]{\mathcal{V} \pth{#1}} \newcommand{\Polygon}{\mathsf{P}} \newcommand{\Space}{\overline{\mathsf{m}}} \newcommand{\pth}[2][\!]{#1\left({#2}\right)}$ We revisit the problem of computing Fréchet distance between polygonal curves under $L_1$, $L_2$, and $L_\infty$ norms, focusing on discrete Fréchet distance, where only distance between vertices is considered. We develop efficient algorithms for two natural classes of curves. In particular, given two polygonal curves of $n$ vertices each, a $\eps$-approximation of their discrete Fréchet distance can be computed in roughly $O(nκ^3\log n/\eps^3)$ time in three dimensions, if one of the curves is \emph{$κ$-bounded}. Previously, only a $κ$-approximation algorithm was known. If both curves are the so-called \emph{\backbone~curves}, which are widely used to model protein backbones in molecular biology, we can $\eps$-approximate their Fréchet distance in near linear time in two dimensions, and in roughly $O(n^{4/3}\log nm)$ time in three dimensions. In the second part, we propose a pseudo--output-sensitive algorithm for computing Fréchet distance exactly. The complexity of the algorithm is a function of a quantity we call the \emph{\bwnumber{}}, which is quadratic in the worst case, but tends to be much smaller in practice.
△ Less
Submitted 28 April, 2015;
originally announced April 2015.
-
A Comparison and Evaluation of Map Construction Algorithms
Authors:
Mahmuda Ahmed,
Sophia Karagiorgou,
Dieter Pfoser,
Carola Wenk
Abstract:
Map construction methods automatically produce and/or update road network datasets using vehicle tracking data. Enabled by the ubiquitous generation of georeferenced tracking data, there has been a recent surge in map construction algorithms coming from different computer science domains. A cross-comparison of the various algorithms is still very rare, since (i) algorithms and constructed maps are…
▽ More
Map construction methods automatically produce and/or update road network datasets using vehicle tracking data. Enabled by the ubiquitous generation of georeferenced tracking data, there has been a recent surge in map construction algorithms coming from different computer science domains. A cross-comparison of the various algorithms is still very rare, since (i) algorithms and constructed maps are generally not publicly available and (ii) there is no standard approach to assess the result quality, given the lack of benchmark data and quantitative evaluation methods. This work represents a first comprehensive attempt to benchmark map construction algorithms. We provide an evaluation and comparison of seven algorithms using four datasets and four different evaluation measures. In addition to this comprehensive comparison, we make our datasets, source code of map construction algorithms and evaluation measures publicly available on mapconstruction.org. This site has been established as a repository for map con- struction data and algorithms and we invite other researchers to contribute by uploading code and benchmark data supporting their contributions to map construction algorithms.
△ Less
Submitted 12 June, 2014; v1 submitted 19 February, 2014;
originally announced February 2014.
-
Path-Based Distance for Street Map Comparison
Authors:
Mahmuda Ahmed,
Brittany Terese Fasy,
Kyle S. Hickmann,
Carola Wenk
Abstract:
Comparing two geometric graphs embedded in space is important in the field of transportation network analysis. Given street maps of the same city collected from different sources, researchers often need to know how and where they differ. However, the majority of current graph comparison algorithms are based on structural properties of graphs, such as their degree distribution or their local connec…
▽ More
Comparing two geometric graphs embedded in space is important in the field of transportation network analysis. Given street maps of the same city collected from different sources, researchers often need to know how and where they differ. However, the majority of current graph comparison algorithms are based on structural properties of graphs, such as their degree distribution or their local connectivity properties, and do not consider their spatial embedding. This ignores a key property of road networks since similarity of travel over two road networks is intimately tied to the specific spatial embedding. Likewise, many current street map comparison algorithms focus on the spatial embeddings only and do not take structural properties into account, which makes these algorithms insensitive to local connectivity properties and shortest path similarities. We propose a new path-based distance measure to compare two planar geometric graphs embedded in the plane. Our distance measure takes structural as well as spatial properties into account by imposing a distance measure between two road networks based on the Hausdorff distance between the two sets of travel paths they represent. We show that this distance can be approximated in polynomial time and that it preserves structural and spatial properties of the graphs.
△ Less
Submitted 13 February, 2015; v1 submitted 24 September, 2013;
originally announced September 2013.
-
Simple Curve Embedding
Authors:
Jessica Sherette,
Carola Wenk
Abstract:
Given a curve f and a surface S, how hard is it to find a simple curve f' in S that is the most similar to f?
We introduce and study this simple curve embedding problem for piecewise linear curves and surfaces in R^2 and R^3, under Hausdorff distance, weak Frechet distance, and Frechet distance as similarity measures for curves. Surprisingly, while several variants of the problem turn out to hav…
▽ More
Given a curve f and a surface S, how hard is it to find a simple curve f' in S that is the most similar to f?
We introduce and study this simple curve embedding problem for piecewise linear curves and surfaces in R^2 and R^3, under Hausdorff distance, weak Frechet distance, and Frechet distance as similarity measures for curves. Surprisingly, while several variants of the problem turn out to have polynomial-time solutions, we show that in R^3 the simple curve embedding problem is NP-hard under Frechet distance even if S is a plane, as well as under weak Frechet distance if S is a terrain. Additionally, these results give insight into the difficulty of computing the Frechet distance between surfaces, and they imply that the partial Frechet distance between non-planar surfaces is NP-hard as well.
△ Less
Submitted 4 March, 2013;
originally announced March 2013.
-
Computing the Fréchet Distance Between Folded Polygons
Authors:
Atlas F. Cook IV,
Anne Driemel,
Sariel Har-Peled,
Jessica Sherette,
Carola Wenk
Abstract:
Computing the Fréchet distance for surfaces is a surprisingly hard problem and the only known algorithm is limited to computing it between flat surfaces. We adapt this algorithm to create one for computing the Fréchet distance for a class of non-flat surfaces which we call folded polygons. Unfortunately, the original algorithm cannot be extended directly. We present three different methods to adap…
▽ More
Computing the Fréchet distance for surfaces is a surprisingly hard problem and the only known algorithm is limited to computing it between flat surfaces. We adapt this algorithm to create one for computing the Fréchet distance for a class of non-flat surfaces which we call folded polygons. Unfortunately, the original algorithm cannot be extended directly. We present three different methods to adapt it. The first of which is a fixed-parameter tractable algorithm. The second is a polynomial-time approximation algorithm. Finally, we present a restricted class of folded polygons for which we can compute the Fréchet distance in polynomial time.
△ Less
Submitted 15 March, 2011;
originally announced March 2011.
-
Approximating the Fréchet Distance for Realistic Curves in Near Linear Time
Authors:
Anne Driemel,
Sariel Har-Peled,
Carola Wenk
Abstract:
We present simple and practical $(1+\eps)$-approximation algorithm for the Frechet distance between curves. To analyze this algorithm we introduce a new realistic family of curves, $c$-packed curves, that is closed under simplification. We believe the notion of $c$-packed curves to be of independent interest. We show that our algorithm has near linear running time for $c$-packed curves, and show s…
▽ More
We present simple and practical $(1+\eps)$-approximation algorithm for the Frechet distance between curves. To analyze this algorithm we introduce a new realistic family of curves, $c$-packed curves, that is closed under simplification. We believe the notion of $c$-packed curves to be of independent interest. We show that our algorithm has near linear running time for $c$-packed curves, and show similar results for other input models.
△ Less
Submitted 29 January, 2012; v1 submitted 1 March, 2010;
originally announced March 2010.
-
Geodesic Fréchet Distance Inside a Simple Polygon
Authors:
Atlas F. Cook IV,
Carola Wenk
Abstract:
We unveil an alluring alternative to parametric search that applies to both the non-geodesic and geodesic Fréchet optimization problems. This randomized approach is based on a variant of red-blue intersections and is appealing due to its elegance and practical efficiency when compared to parametric search. We present the first algorithm for the geodesic Fréchet distance between two polygonal cur…
▽ More
We unveil an alluring alternative to parametric search that applies to both the non-geodesic and geodesic Fréchet optimization problems. This randomized approach is based on a variant of red-blue intersections and is appealing due to its elegance and practical efficiency when compared to parametric search. We present the first algorithm for the geodesic Fréchet distance between two polygonal curves $A$ and $B$ inside a simple bounding polygon $P$. The geodesic Fréchet decision problem is solved almost as fast as its non-geodesic sibling and requires $O(N^{2\log k)$ time and $O(k+N)$ space after $O(k)$ preprocessing, where $N$ is the larger of the complexities of $A$ and $B$ and $k$ is the complexity of $P$. The geodesic Fréchet optimization problem is solved by a randomized approach in $O(k+N^{2\log kN\log N)$ expected time and $O(k+N^{2)$ space. This runtime is only a logarithmic factor larger than the standard non-geodesic Fréchet algorithm (Alt and Godau 1995). Results are also presented for the geodesic Fréchet distance in a polygonal domain with obstacles and the geodesic Hausdorff distance for sets of points or sets of line segments inside a simple polygon $P$.
△ Less
Submitted 20 February, 2008;
originally announced February 2008.