-
Efficient Trajectory Inference in Wasserstein Space Using Consecutive Averaging
Authors:
Amartya Banerjee,
Harlin Lee,
Nir Sharon,
Caroline Moosmüller
Abstract:
Capturing data from dynamic processes through cross-sectional measurements is seen in many fields, such as computational biology. Trajectory inference deals with the challenge of reconstructing continuous processes from such observations. In this work, we propose methods for B-spline approximation and interpolation of point clouds through consecutive averaging that is intrinsic to the Wasserstein…
▽ More
Capturing data from dynamic processes through cross-sectional measurements is seen in many fields, such as computational biology. Trajectory inference deals with the challenge of reconstructing continuous processes from such observations. In this work, we propose methods for B-spline approximation and interpolation of point clouds through consecutive averaging that is intrinsic to the Wasserstein space. Combining subdivision schemes with optimal transport-based geodesic, our methods carry out trajectory inference at a chosen level of precision and smoothness, and can automatically handle scenarios where particles undergo division over time. We prove linear convergence rates and rigorously evaluate our method on cell data characterized by bifurcations, merges, and trajectory splitting scenarios like $supercells$, comparing its performance against state-of-the-art trajectory inference and interpolation methods. The results not only underscore the effectiveness of our method in inferring trajectories but also highlight the benefit of performing interpolation and approximation that respect the inherent geometric properties of the data.
△ Less
Submitted 10 March, 2025; v1 submitted 30 May, 2024;
originally announced May 2024.
-
Manifold learning in Wasserstein space
Authors:
Keaton Hamm,
Caroline Moosmüller,
Bernhard Schmitzer,
Matthew Thorpe
Abstract:
This paper aims at building the theoretical foundations for manifold learning algorithms in the space of absolutely continuous probability measures $\mathcal{P}_{\mathrm{a.c.}}(Ω)$ with $Ω$ a compact and convex subset of $\mathbb{R}^d$, metrized with the Wasserstein-2 distance $\mathbb{W}$. We begin by introducing a construction of submanifolds $Λ$ in $\mathcal{P}_{\mathrm{a.c.}}(Ω)$ equipped with…
▽ More
This paper aims at building the theoretical foundations for manifold learning algorithms in the space of absolutely continuous probability measures $\mathcal{P}_{\mathrm{a.c.}}(Ω)$ with $Ω$ a compact and convex subset of $\mathbb{R}^d$, metrized with the Wasserstein-2 distance $\mathbb{W}$. We begin by introducing a construction of submanifolds $Λ$ in $\mathcal{P}_{\mathrm{a.c.}}(Ω)$ equipped with metric $\mathbb{W}_Λ$, the geodesic restriction of $\mathbb{W}$ to $Λ$. In contrast to other constructions, these submanifolds are not necessarily flat, but still allow for local linearizations in a similar fashion to Riemannian submanifolds of $\mathbb{R}^d$. We then show how the latent manifold structure of $(Λ,\mathbb{W}_Λ)$ can be learned from samples $\{λ_i\}_{i=1}^N$ of $Λ$ and pairwise extrinsic Wasserstein distances $\mathbb{W}$ on $\mathcal{P}_{\mathrm{a.c.}}(Ω)$ only. In particular, we show that the metric space $(Λ,\mathbb{W}_Λ)$ can be asymptotically recovered in the sense of Gromov--Wasserstein from a graph with nodes $\{λ_i\}_{i=1}^N$ and edge weights $W(λ_i,λ_j)$. In addition, we demonstrate how the tangent space at a sample $λ$ can be asymptotically recovered via spectral analysis of a suitable ``covariance operator'' using optimal transport maps from $λ$ to sufficiently close and diverse samples $\{λ_i\}_{i=1}^N$. The paper closes with some explicit constructions of submanifolds $Λ$ and numerical examples on the recovery of tangent spaces through spectral analysis.
△ Less
Submitted 28 March, 2025; v1 submitted 14 November, 2023;
originally announced November 2023.
-
Approximation properties of slice-matching operators
Authors:
Shiying Li,
Caroline Moosmueller
Abstract:
Iterative slice-matching procedures are efficient schemes for transferring a source measure to a target measure, especially in high dimensions. These schemes have been successfully used in applications such as color transfer and shape retrieval, and are guaranteed to converge under regularity assumptions. In this paper, we explore approximation properties related to a single step of such iterative…
▽ More
Iterative slice-matching procedures are efficient schemes for transferring a source measure to a target measure, especially in high dimensions. These schemes have been successfully used in applications such as color transfer and shape retrieval, and are guaranteed to converge under regularity assumptions. In this paper, we explore approximation properties related to a single step of such iterative schemes by examining an associated slice-matching operator, depending on a source measure, a target measure, and slicing directions. In particular, we demonstrate an invariance property with respect to the source measure, an equivariance property with respect to the target measure, and Lipschitz continuity concerning the slicing directions. We furthermore establish error bounds corresponding to approximating the target measure by one step of the slice-matching scheme and characterize situations in which the slice-matching operator recovers the optimal transport map between two measures. We also investigate connections to affine registration problems with respect to (sliced) Wasserstein distances. These connections can be also be viewed as extensions to the invariance and equivariance properties of the slice-matching operator and illustrate the extent to which slice-matching schemes incorporate affine effects.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Measure transfer via stochastic slicing and matching
Authors:
Shiying Li,
Caroline Moosmueller
Abstract:
This paper studies iterative schemes for measure transfer and approximation problems, which are defined through a slicing-and-matching procedure. Similar to the sliced Wasserstein distance, these schemes benefit from the availability of closed-form solutions for the one-dimensional optimal transport problem and the associated computational advantages. While such schemes have already been successfu…
▽ More
This paper studies iterative schemes for measure transfer and approximation problems, which are defined through a slicing-and-matching procedure. Similar to the sliced Wasserstein distance, these schemes benefit from the availability of closed-form solutions for the one-dimensional optimal transport problem and the associated computational advantages. While such schemes have already been successfully utilized in data science applications, not too many results on their convergence are available. The main contribution of this paper is an almost sure convergence proof for stochastic slicing-and-matching schemes. The proof builds on an interpretation as a stochastic gradient descent scheme on the Wasserstein space. Numerical examples on step-wise image morphing are demonstrated as well.
△ Less
Submitted 12 January, 2025; v1 submitted 11 July, 2023;
originally announced July 2023.
-
Linearized Wasserstein dimensionality reduction with approximation guarantees
Authors:
Alexander Cloninger,
Keaton Hamm,
Varun Khurana,
Caroline Moosmüller
Abstract:
We introduce LOT Wassmap, a computationally feasible algorithm to uncover low-dimensional structures in the Wasserstein space. The algorithm is motivated by the observation that many datasets are naturally interpreted as probability measures rather than points in $\mathbb{R}^n$, and that finding low-dimensional descriptions of such datasets requires manifold learning algorithms in the Wasserstein…
▽ More
We introduce LOT Wassmap, a computationally feasible algorithm to uncover low-dimensional structures in the Wasserstein space. The algorithm is motivated by the observation that many datasets are naturally interpreted as probability measures rather than points in $\mathbb{R}^n$, and that finding low-dimensional descriptions of such datasets requires manifold learning algorithms in the Wasserstein space. Most available algorithms are based on computing the pairwise Wasserstein distance matrix, which can be computationally challenging for large datasets in high dimensions. Our algorithm leverages approximation schemes such as Sinkhorn distances and linearized optimal transport to speed-up computations, and in particular, avoids computing a pairwise distance matrix. We provide guarantees on the embedding quality under such approximations, including when explicit descriptions of the probability measures are not available and one must deal with finite samples instead. Experiments demonstrate that LOT Wassmap attains correct embeddings and that the quality improves with increased sample size. We also show how LOT Wassmap significantly reduces the computational cost when compared to algorithms that depend on pairwise distance computations.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
Supervised learning of sheared distributions using linearized optimal transport
Authors:
Varun Khurana,
Harish Kannan,
Alexander Cloninger,
Caroline Moosmüller
Abstract:
In this paper we study supervised learning tasks on the space of probability measures. We approach this problem by embedding the space of probability measures into $L^2$ spaces using the optimal transport framework. In the embedding spaces, regular machine learning techniques are used to achieve linear separability. This idea has proved successful in applications and when the classes to be separat…
▽ More
In this paper we study supervised learning tasks on the space of probability measures. We approach this problem by embedding the space of probability measures into $L^2$ spaces using the optimal transport framework. In the embedding spaces, regular machine learning techniques are used to achieve linear separability. This idea has proved successful in applications and when the classes to be separated are generated by shifts and scalings of a fixed measure. This paper extends the class of elementary transformations suitable for the framework to families of shearings, describing conditions under which two classes of sheared distributions can be linearly separated. We furthermore give necessary bounds on the transformations to achieve a pre-specified separation level, and show how multiple embeddings can be used to allow for larger families of transformations. We demonstrate our results on image classification tasks.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.
-
Hermite multiwavelets for manifold-valued data
Authors:
Mariantonia Cotronei,
Caroline Moosmüller,
Tomas Sauer,
Nada Sissouno
Abstract:
In this paper we present a construction of interpolatory Hermite multiwavelets for functions that take values in nonlinear geometries such as Riemannian manifolds or Lie groups. We rely on the strong connection between wavelets and subdivision schemes to define a prediction-correction approach based on Hermite subdivision schemes that operate on manifold-valued data. The main result concerns the d…
▽ More
In this paper we present a construction of interpolatory Hermite multiwavelets for functions that take values in nonlinear geometries such as Riemannian manifolds or Lie groups. We rely on the strong connection between wavelets and subdivision schemes to define a prediction-correction approach based on Hermite subdivision schemes that operate on manifold-valued data. The main result concerns the decay of the wavelet coefficients: We show that our manifold-valued construction essentially admits the same coefficient decay as linear Hermite wavelets, which also generalizes results on manifold-valued scalar wavelets.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Linear Optimal Transport Embedding: Provable Wasserstein classification for certain rigid transformations and perturbations
Authors:
Caroline Moosmüller,
Alexander Cloninger
Abstract:
Discriminating between distributions is an important problem in a number of scientific fields. This motivated the introduction of Linear Optimal Transportation (LOT), which embeds the space of distributions into an $L^2$-space. The transform is defined by computing the optimal transport of each distribution to a fixed reference distribution, and has a number of benefits when it comes to speed of c…
▽ More
Discriminating between distributions is an important problem in a number of scientific fields. This motivated the introduction of Linear Optimal Transportation (LOT), which embeds the space of distributions into an $L^2$-space. The transform is defined by computing the optimal transport of each distribution to a fixed reference distribution, and has a number of benefits when it comes to speed of computation and to determining classification boundaries. In this paper, we characterize a number of settings in which LOT embeds families of distributions into a space in which they are linearly separable. This is true in arbitrary dimension, and for families of distributions generated through perturbations of shifts and scalings of a fixed distribution.We also prove conditions under which the $L^2$ distance of the LOT embedding between two distributions in arbitrary dimension is nearly isometric to Wasserstein-2 distance between those distributions. This is of significant computational benefit, as one must only compute $N$ optimal transport maps to define the $N^2$ pairwise distances between $N$ distributions. We demonstrate the benefits of LOT on a number of distribution classification problems.
△ Less
Submitted 25 May, 2021; v1 submitted 20 August, 2020;
originally announced August 2020.
-
Polynomial overreproduction by Hermite subdivision operators, and $p$-Cauchy numbers
Authors:
Caroline Moosmüller,
Tomas Sauer
Abstract:
We study the case of Hermite subdivision operators satisfying a spectral condition of order greater than their size. We show that this can be characterized by operator factorizations involving Taylor operators and difference factorizations of a rank one vector scheme. Giving explicit expressions for the factorization operators, we put into evidence that the factorization only depends on the order…
▽ More
We study the case of Hermite subdivision operators satisfying a spectral condition of order greater than their size. We show that this can be characterized by operator factorizations involving Taylor operators and difference factorizations of a rank one vector scheme. Giving explicit expressions for the factorization operators, we put into evidence that the factorization only depends on the order of the spectral condition but not on the polynomials that define it. We further show that the derivation of these operators is based on an interplay between Stirling numbers and $p$-Cauchy numbers (or generalized Gregory coefficients).
△ Less
Submitted 20 June, 2020; v1 submitted 24 April, 2019;
originally announced April 2019.
-
A note on spectral properties of Hermite subdivision operators
Authors:
Caroline Moosmüller
Abstract:
In this paper we study the connection between the spectral condition of an Hermite subdivision operator and polynomial reproduction properties of the associated subdivision scheme. While it is known that in general the spectral condition does not imply the reproduction of polynomials, we here prove that a special spectral condition (defined by shifted monomials) is actually equivalent to the repro…
▽ More
In this paper we study the connection between the spectral condition of an Hermite subdivision operator and polynomial reproduction properties of the associated subdivision scheme. While it is known that in general the spectral condition does not imply the reproduction of polynomials, we here prove that a special spectral condition (defined by shifted monomials) is actually equivalent to the reproduction of polynomials. We further put into evidence that the sum rule of order $\ell>d$ associated with an Hermite subdivision operator of order $d$ does not imply that the spectral condition of order $\ell$ is satisfied, while it is known that these two concepts are equivalent in the case $\ell=d$.
△ Less
Submitted 24 April, 2018;
originally announced April 2018.
-
Stirling numbers and Gregory coefficients for the factorization of Hermite subdivision operators
Authors:
Caroline Moosmüller,
Svenja Hüning,
Costanza Conti
Abstract:
In this paper we present a factorization framework for Hermite subdivision schemes refining function values and first derivatives, which satisfy a spectral condition of high order. In particular we show that spectral order $d$ allows for $d$ factorizations of the subdivision operator with respect to the Gregory operators: A new sequence of operators we define using Stirling numbers and Gregory coe…
▽ More
In this paper we present a factorization framework for Hermite subdivision schemes refining function values and first derivatives, which satisfy a spectral condition of high order. In particular we show that spectral order $d$ allows for $d$ factorizations of the subdivision operator with respect to the Gregory operators: A new sequence of operators we define using Stirling numbers and Gregory coefficients. We further prove that the $d$-th factorization provides a ``convergence from contractivity'' method for showing $C^d$-convergence of the associated Hermite subdivision scheme. The power of our factorization framework lies in the reduction of computational effort for large $d$: In order to prove $C^d$-convergence, up to now, $d$ factorization steps were needed, while our method requires only one step, independently of $d$. Furthermore, in this paper, we show by an example that the spectral condition is not equivalent to the reproduction of polynomials.
△ Less
Submitted 22 July, 2019; v1 submitted 17 April, 2018;
originally announced April 2018.
-
Level-dependent interpolatory Hermite subdivision schemes and wavelets
Authors:
Mariantonia Cotronei,
Caroline Moosmüller,
Tomas Sauer,
Nada Sissouno
Abstract:
We study many properties of level-dependent Hermite subdivision, focusing on schemes preserving polynomial and exponential data. We specifically consider interpolatory schemes, which give rise to level-dependent multiresolution analyses through a prediction-correction approach. A result on the decay of the associated multiwavelet coefficients, corresponding to a uniformly continuous and differenti…
▽ More
We study many properties of level-dependent Hermite subdivision, focusing on schemes preserving polynomial and exponential data. We specifically consider interpolatory schemes, which give rise to level-dependent multiresolution analyses through a prediction-correction approach. A result on the decay of the associated multiwavelet coefficients, corresponding to a uniformly continuous and differentiable function, is derived. It makes use of the approximation of any such function with a generalized Taylor formula expressed in terms of polynomials and exponentials.
△ Less
Submitted 9 January, 2018;
originally announced January 2018.
-
Increasing the smoothness of vector and Hermite subdivision schemes
Authors:
Caroline Moosmüller,
Nira Dyn
Abstract:
In this paper we suggest a method for transforming a vector subdivision scheme generating $C^{\ell}$ limits to another such scheme of the same dimension, generating $C^{\ell+1}$ limits. In scalar subdivision, it is well known that a scheme generating $C^{\ell}$ limit curves can be transformed to a new scheme producing $C^{\ell+1}$ limit curves by multiplying the scheme's symbol with the smoothing…
▽ More
In this paper we suggest a method for transforming a vector subdivision scheme generating $C^{\ell}$ limits to another such scheme of the same dimension, generating $C^{\ell+1}$ limits. In scalar subdivision, it is well known that a scheme generating $C^{\ell}$ limit curves can be transformed to a new scheme producing $C^{\ell+1}$ limit curves by multiplying the scheme's symbol with the smoothing factor $\tfrac{z+1}{2}$. We extend this approach to vector and Hermite subdivision schemes, by manipulating symbols. The algorithms presented in this paper allow to construct vector (Hermite) subdivision schemes of arbitrarily high regularity from a convergent vector scheme (from a Hermite scheme whose Taylor scheme is convergent with limit functions of vanishing first component).
△ Less
Submitted 4 March, 2018; v1 submitted 17 October, 2017;
originally announced October 2017.