-
An Efficient Transport-Based Dissimilarity Measure for Time Series Classification under Warping Distortions
Authors:
Akram Aldroubi,
Rocío Díaz Martín,
Ivan Medri,
Kristofor E. Pas,
Gustavo K. Rohde,
Abu Hasnat Mohammad Rubaiyat
Abstract:
Time Series Classification (TSC) is an important problem with numerous applications in science and technology. Dissimilarity-based approaches, such as Dynamic Time Warping (DTW), are classical methods for distinguishing time series when time deformations are confounding information. In this paper, starting from a deformation-based model for signal classes we define a problem statement for time ser…
▽ More
Time Series Classification (TSC) is an important problem with numerous applications in science and technology. Dissimilarity-based approaches, such as Dynamic Time Warping (DTW), are classical methods for distinguishing time series when time deformations are confounding information. In this paper, starting from a deformation-based model for signal classes we define a problem statement for time series classification problem. We show that, under theoretically ideal conditions, a continuous version of classic 1NN-DTW method can solve the stated problem, even when only one training sample is available. In addition, we propose an alternative dissimilarity measure based on Optimal Transport and show that it can also solve the aforementioned problem statement at a significantly reduced computational cost. Finally, we demonstrate the application of the newly proposed approach in simulated and real time series classification data, showing the efficacy of the method.
△ Less
Submitted 14 May, 2025; v1 submitted 8 May, 2025;
originally announced May 2025.
-
Reconstructing Graph Signals from Noisy Dynamical Samples
Authors:
Akram Aldroubi,
Victor Bailey,
Ilya Krishtal,
Brendan Miller,
Armenak Petrosyan
Abstract:
We investigate the dynamical sampling space-time trade-off problem within a graph setting. Specifically, we derive necessary and sufficient conditions for space-time sampling that enable the reconstruction of an initial band-limited signal on a graph. Additionally, we develop and test numerical algorithms for approximating the optimal placement of sensors on the graph to minimize the mean squared…
▽ More
We investigate the dynamical sampling space-time trade-off problem within a graph setting. Specifically, we derive necessary and sufficient conditions for space-time sampling that enable the reconstruction of an initial band-limited signal on a graph. Additionally, we develop and test numerical algorithms for approximating the optimal placement of sensors on the graph to minimize the mean squared error when recovering signals from time-space measurements corrupted by i.i.d.~additive noise. Our numerical experiments demonstrate that our approach outperforms previously proposed algorithms for related problems.
△ Less
Submitted 19 November, 2024;
originally announced November 2024.
-
Expected Sliced Transport Plans
Authors:
Xinran Liu,
Rocío Díaz Martín,
Yikun Bai,
Ashkan Shahbazi,
Matthew Thorpe,
Akram Aldroubi,
Soheil Kolouri
Abstract:
The optimal transport (OT) problem has gained significant traction in modern machine learning for its ability to: (1) provide versatile metrics, such as Wasserstein distances and their variants, and (2) determine optimal couplings between probability measures. To reduce the computational complexity of OT solvers, methods like entropic regularization and sliced optimal transport have been proposed.…
▽ More
The optimal transport (OT) problem has gained significant traction in modern machine learning for its ability to: (1) provide versatile metrics, such as Wasserstein distances and their variants, and (2) determine optimal couplings between probability measures. To reduce the computational complexity of OT solvers, methods like entropic regularization and sliced optimal transport have been proposed. The sliced OT framework improves efficiency by comparing one-dimensional projections (slices) of high-dimensional distributions. However, despite their computational efficiency, sliced-Wasserstein approaches lack a transportation plan between the input measures, limiting their use in scenarios requiring explicit coupling. In this paper, we address two key questions: Can a transportation plan be constructed between two probability measures using the sliced transport framework? If so, can this plan be used to define a metric between the measures? We propose a "lifting" operation to extend one-dimensional optimal transport plans back to the original space of the measures. By computing the expectation of these lifted plans, we derive a new transportation plan, termed expected sliced transport (EST) plans. We prove that using the EST plan to weight the sum of the individual Euclidean costs for moving from one point to another results in a valid metric between the input discrete probability measures. We demonstrate the connection between our approach and the recently proposed min-SWGG, along with illustrative numerical examples that support our theoretical findings.
△ Less
Submitted 17 October, 2024; v1 submitted 15 October, 2024;
originally announced October 2024.
-
Signed Cumulative Distribution Transform for Parameter Estimation of 1-D Signals
Authors:
Sumati Thareja,
Gustavo Rohde,
Rocio Diaz Martin,
Ivan Medri,
Akram Aldroubi
Abstract:
We describe a method for signal parameter estimation using the signed cumulative distribution transform (SCDT), a recently introduced signal representation tool based on optimal transport theory. The method builds upon signal estimation using the cumulative distribution transform (CDT) originally introduced for positive distributions. Specifically, we show that Wasserstein-type distance minimizati…
▽ More
We describe a method for signal parameter estimation using the signed cumulative distribution transform (SCDT), a recently introduced signal representation tool based on optimal transport theory. The method builds upon signal estimation using the cumulative distribution transform (CDT) originally introduced for positive distributions. Specifically, we show that Wasserstein-type distance minimization can be performed simply using linear least squares techniques in SCDT space for arbitrary signal classes, thus providing a global minimizer for the estimation problem even when the underlying signal is a nonlinear function of the unknown parameters. Comparisons to current signal estimation methods using $L_p$ minimization shows the advantage of the method.
△ Less
Submitted 16 July, 2022;
originally announced July 2022.
-
Predictive algorithms in dynamical sampling for burst-like forcing terms
Authors:
Akram Aldroubi,
Longxiu Huang,
Keri Kornelson,
Ilya Krishtal
Abstract:
In this paper, we consider the problem of recovery of a burst-like forcing term in an initial value problem (IVP) in the framework of dynamical sampling. We introduce an idea of using two particular classes of samplers that allow one to predict the solution of the IVP over a time interval without a burst. This leads to two different algorithms that stably and accurately approximate the burst-like…
▽ More
In this paper, we consider the problem of recovery of a burst-like forcing term in an initial value problem (IVP) in the framework of dynamical sampling. We introduce an idea of using two particular classes of samplers that allow one to predict the solution of the IVP over a time interval without a burst. This leads to two different algorithms that stably and accurately approximate the burst-like forcing term even in the presence of a measurement acquisition error and a large background source.
△ Less
Submitted 1 September, 2021;
originally announced September 2021.
-
The Signed Cumulative Distribution Transform for 1-D Signal Analysis and Classification
Authors:
Akram Aldroubi,
Rocio Diaz Martin,
Ivan Medri,
Gustavo K. Rohde,
Sumati Thareja
Abstract:
This paper presents a new mathematical signal transform that is especially suitable for decoding information related to non-rigid signal displacements. We provide a measure theoretic framework to extend the existing Cumulative Distribution Transform [ACHA 45 (2018), no. 3, 616-641] to arbitrary (signed) signals on $\overline{\mathbb{R}}$. We present both forward (analysis) and inverse (synthesis)…
▽ More
This paper presents a new mathematical signal transform that is especially suitable for decoding information related to non-rigid signal displacements. We provide a measure theoretic framework to extend the existing Cumulative Distribution Transform [ACHA 45 (2018), no. 3, 616-641] to arbitrary (signed) signals on $\overline{\mathbb{R}}$. We present both forward (analysis) and inverse (synthesis) formulas for the transform, and describe several of its properties including translation, scaling, convexity, linear separability and others. Finally, we describe a metric in transform space, and demonstrate the application of the transform in classifying (detecting) signals under random displacements.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Partitioning signal classes using transport transforms for data analysis and machine learning
Authors:
Akram Aldroubi,
Shiying Li,
Gustavo K. Rohde
Abstract:
A relatively new set of transport-based transforms (CDT, R-CDT, LOT) have shown their strength and great potential in various image and data processing tasks such as parametric signal estimation, classification, cancer detection among many others. It is hence worthwhile to elucidate some of the mathematical properties that explain the successes of these transforms when they are used as tools in da…
▽ More
A relatively new set of transport-based transforms (CDT, R-CDT, LOT) have shown their strength and great potential in various image and data processing tasks such as parametric signal estimation, classification, cancer detection among many others. It is hence worthwhile to elucidate some of the mathematical properties that explain the successes of these transforms when they are used as tools in data analysis, signal processing or data classification. In particular, we give conditions under which classes of signals that are created by algebraic generative models are transformed into convex sets by the transport transforms. Such convexification of the classes simplify the classification and other data analysis and processing problems when viewed in the transform domain. More specifically, we study the extent and limitation of the convexification ability of these transforms under an algebraic generative modeling framework. We hope that this paper will serve as an introduction to these transforms and will encourage mathematicians and other researchers to further explore the theoretical underpinnings and algorithmic tools that will help understand the successes of these transforms and lay the groundwork for further successful applications.
△ Less
Submitted 24 February, 2021; v1 submitted 8 August, 2020;
originally announced August 2020.
-
Radon cumulative distribution transform subspace modeling for image classification
Authors:
Mohammad Shifat-E-Rabbi,
Xuwang Yin,
Abu Hasnat Mohammad Rubaiyat,
Shiying Li,
Soheil Kolouri,
Akram Aldroubi,
Jonathan M. Nichols,
Gustavo K. Rohde
Abstract:
We present a new supervised image classification method applicable to a broad class of image deformation models. The method makes use of the previously described Radon Cumulative Distribution Transform (R-CDT) for image data, whose mathematical properties are exploited to express the image data in a form that is more suitable for machine learning. While certain operations such as translation, scal…
▽ More
We present a new supervised image classification method applicable to a broad class of image deformation models. The method makes use of the previously described Radon Cumulative Distribution Transform (R-CDT) for image data, whose mathematical properties are exploited to express the image data in a form that is more suitable for machine learning. While certain operations such as translation, scaling, and higher-order transformations are challenging to model in native image space, we show the R-CDT can capture some of these variations and thus render the associated image classification problems easier to solve. The method -- utilizing a nearest-subspace algorithm in R-CDT space -- is simple to implement, non-iterative, has no hyper-parameters to tune, is computationally efficient, label efficient, and provides competitive accuracies to state-of-the-art neural networks for many types of classification problems. In addition to the test accuracy performances, we show improvements (with respect to neural network-based methods) in terms of computational efficiency (it can be implemented without the use of GPUs), number of training samples needed for training, as well as out-of-distribution generalization. The Python code for reproducing our results is available at https://github.com/rohdelab/rcdt_ns_classifier.
△ Less
Submitted 2 March, 2022; v1 submitted 7 April, 2020;
originally announced April 2020.
-
CUR Decompositions, Similarity Matrices, and Subspace Clustering
Authors:
Akram Aldroubi,
Keaton Hamm,
Ahmet Bugra Koku,
Ali Sekmen
Abstract:
A general framework for solving the subspace clustering problem using the CUR decomposition is presented. The CUR decomposition provides a natural way to construct similarity matrices for data that come from a union of unknown subspaces $\mathscr{U}=\underset{i=1}{\overset{M}\bigcup}S_i$. The similarity matrices thus constructed give the exact clustering in the noise-free case. Additionally, this…
▽ More
A general framework for solving the subspace clustering problem using the CUR decomposition is presented. The CUR decomposition provides a natural way to construct similarity matrices for data that come from a union of unknown subspaces $\mathscr{U}=\underset{i=1}{\overset{M}\bigcup}S_i$. The similarity matrices thus constructed give the exact clustering in the noise-free case. Additionally, this decomposition gives rise to many distinct similarity matrices from a given set of data, which allow enough flexibility to perform accurate clustering of noisy data. We also show that two known methods for subspace clustering can be derived from the CUR decomposition. An algorithm based on the theoretical construction of similarity matrices is presented, and experiments on synthetic and real data are presented to test the method.
Additionally, an adaptation of our CUR based similarity matrices is utilized to provide a heuristic algorithm for subspace clustering; this algorithm yields the best overall performance to date for clustering the Hopkins155 motion segmentation dataset.
△ Less
Submitted 11 December, 2018; v1 submitted 11 November, 2017;
originally announced November 2017.
-
Phaseless Reconstruction from Space-Time Samples
Authors:
Akram Aldroubi,
llya krishtal,
Sui Tang
Abstract:
Phaseless reconstruction from space-time samples is a nonlinear problem of recovering a function $x$ in a Hilbert space $\mathcal{H}$ from the modulus of linear measurements $\{\lvert \langle x, φ_i\rangle \rvert$, $ \ldots$, $\lvert \langle A^{L_i}x, φ_i \rangle \rvert : i \in\mathscr I\}$, where $\{φ_i; i \in\mathscr I\}\subset \mathcal{H}$ is a set of functionals on $\mathcal{H}$, and $A$ is a…
▽ More
Phaseless reconstruction from space-time samples is a nonlinear problem of recovering a function $x$ in a Hilbert space $\mathcal{H}$ from the modulus of linear measurements $\{\lvert \langle x, φ_i\rangle \rvert$, $ \ldots$, $\lvert \langle A^{L_i}x, φ_i \rangle \rvert : i \in\mathscr I\}$, where $\{φ_i; i \in\mathscr I\}\subset \mathcal{H}$ is a set of functionals on $\mathcal{H}$, and $A$ is a bounded operator on $\mathcal{H}$ that acts as an evolution operator. In this paper, we provide various sufficient or necessary conditions for solving this problem, which has connections to $X$-ray crystallography, the scattering transform, and deep learning.
△ Less
Submitted 16 June, 2017;
originally announced June 2017.
-
Krylov Subspace Methods in Dynamical Sampling
Authors:
Akram Aldroubi,
Ilya Krishtal
Abstract:
Let $B$ be an unknown linear evolution process on $\mathbb C^d\simeq l^2(\mathbb Z_d)$ driving an unknown initial state $x$ and producing the states $\{B^\ell x, \ell = 0,1,\ldots\}$ at different time levels. The problem under consideration in this paper is to find as much information as possible about $B$ and $x$ from the measurements $Y=\{x(i)$, $Bx(i)$, $\dots$,…
▽ More
Let $B$ be an unknown linear evolution process on $\mathbb C^d\simeq l^2(\mathbb Z_d)$ driving an unknown initial state $x$ and producing the states $\{B^\ell x, \ell = 0,1,\ldots\}$ at different time levels. The problem under consideration in this paper is to find as much information as possible about $B$ and $x$ from the measurements $Y=\{x(i)$, $Bx(i)$, $\dots$, $B^{\ell_i}x(i): i \in Ω\subset \mathbb Z^d\}$. If $B$ is a "low-pass" convolution operator, we show that we can recover both $B$ and $x$, almost surely, as long as we double the amount of temporal samples needed in \cite{ADK13} to recover the signal propagated by a known operator $B$. For a general operator $B$, we can recover parts or even all of its spectrum from $Y$. As a special case of our method, we derive the centuries old Prony's method \cite{BDVMC08, P795, PP13} which recovers a vector with an $s$-sparse Fourier transform from $2s$ of its consecutive components.
△ Less
Submitted 3 December, 2014;
originally announced December 2014.
-
Exact Reconstruction of Spatially Undersampled Signals in Evolutionary Systems
Authors:
Akram Aldroubi,
Jacqueline Davis,
Ilya Krishtal
Abstract:
We consider the problem of spatiotemporal sampling in which an initial state $f$ of an evolution process $f_t=A_tf$ is to be recovered from a combined set of coarse samples from varying time levels $\{t_1,\dots,t_N\}$. This new way of sampling, which we call dynamical sampling, differs from standard sampling since at any fixed time $t_i$ there are not enough samples to recover the function $f$ or…
▽ More
We consider the problem of spatiotemporal sampling in which an initial state $f$ of an evolution process $f_t=A_tf$ is to be recovered from a combined set of coarse samples from varying time levels $\{t_1,\dots,t_N\}$. This new way of sampling, which we call dynamical sampling, differs from standard sampling since at any fixed time $t_i$ there are not enough samples to recover the function $f$ or the state $f_{t_i}$. Although dynamical sampling is an inverse problem, it differs from the typical inverse problems in which $f$ is to be recovered from $A_Tf$ for a single time $T$. In this paper, we consider signals that are modeled by $\ell^2(\mathbb Z)$ or a shift invariant space $V\subset L^2(\mathbb R)$.
△ Less
Submitted 4 December, 2013;
originally announced December 2013.
-
Nearness to Local Subspace Algorithm for Subspace and Motion Segmentation
Authors:
Akram Aldroubi,
Ali Sekmen
Abstract:
There is a growing interest in computer science, engineering, and mathematics for modeling signals in terms of union of subspaces and manifolds. Subspace segmentation and clustering of high dimensional data drawn from a union of subspaces are especially important with many practical applications in computer vision, image and signal processing, communications, and information theory. This paper pre…
▽ More
There is a growing interest in computer science, engineering, and mathematics for modeling signals in terms of union of subspaces and manifolds. Subspace segmentation and clustering of high dimensional data drawn from a union of subspaces are especially important with many practical applications in computer vision, image and signal processing, communications, and information theory. This paper presents a clustering algorithm for high dimensional data that comes from a union of lower dimensional subspaces of equal and known dimensions. Such cases occur in many data clustering problems, such as motion segmentation and face recognition. The algorithm is reliable in the presence of noise, and applied to the Hopkins 155 Dataset, it generates the best results to date for motion segmentation. The two motion, three motion, and overall segmentation rates for the video sequences are 99.43%, 98.69%, and 99.24%, respectively.
△ Less
Submitted 14 May, 2012; v1 submitted 11 October, 2010;
originally announced October 2010.
-
Uncertainty Principles and Balian-Low type Theorems in Principal Shift-Invariant Spaces
Authors:
Akram Aldroubi,
Qiyu Sun,
Haichao Wang
Abstract:
In this paper, we consider the time-frequency localization of the generator of a principal shift-invariant space on the real line which has additional shift-invariance. We prove that if a principal shift-invariant space on the real line is translation-invariant then any of its orthonormal (or Riesz) generators is non-integrable. However, for any $n\ge2$, there exist principal shift-invariant space…
▽ More
In this paper, we consider the time-frequency localization of the generator of a principal shift-invariant space on the real line which has additional shift-invariance. We prove that if a principal shift-invariant space on the real line is translation-invariant then any of its orthonormal (or Riesz) generators is non-integrable. However, for any $n\ge2$, there exist principal shift-invariant spaces on the real line that are also $\nZ$-invariant with an integrable orthonormal (or a Riesz) generator $φ$, but $φ$ satisfies $\int_{\mathbb R} |φ(x)|^2 |x|^{1+ε} dx=\infty$ for any $ε>0$ and its Fourier transform $\hatφ$ cannot decay as fast as $ (1+|ξ|)^{-r}$ for any $r>1/2$. Examples are constructed to demonstrate that the above decay properties for the orthormal generator in the time domain and in the frequency domain are optimal.
△ Less
Submitted 25 August, 2010;
originally announced August 2010.
-
A Unified Approach to Sparse Signal Processing
Authors:
F. Marvasti,
A. Amini,
F. Haddadi,
M. Soltanolkotabi,
B. H. Khalaj,
A. Aldroubi,
S. Holm,
S. Sanei,
J. Chambers
Abstract:
A unified view of sparse signal processing is presented in tutorial form by bringing together various fields. For each of these fields, various algorithms and techniques, which have been developed to leverage sparsity, are described succinctly. The common benefits of significant reduction in sampling rate and processing manipulations are revealed.
The key applications of sparse signal processi…
▽ More
A unified view of sparse signal processing is presented in tutorial form by bringing together various fields. For each of these fields, various algorithms and techniques, which have been developed to leverage sparsity, are described succinctly. The common benefits of significant reduction in sampling rate and processing manipulations are revealed.
The key applications of sparse signal processing are sampling, coding, spectral estimation, array processing, component analysis, and multipath channel estimation. In terms of reconstruction algorithms, linkages are made with random sampling, compressed sensing and rate of innovation. The redundancy introduced by channel coding in finite/real Galois fields is then related to sampling with similar reconstruction algorithms. The methods of Prony, Pisarenko, and MUSIC are next discussed for sparse frequency domain representations. Specifically, the relations of the approach of Prony to an annihilating filter and Error Locator Polynomials in coding are emphasized; the Pisarenko and MUSIC methods are further improvements of the Prony method. Such spectral estimation methods is then related to multi-source location and DOA estimation in array processing. The notions of sparse array beamforming and sparse sensor networks are also introduced. Sparsity in unobservable source signals is also shown to facilitate source separation in SCA; the algorithms developed in this area are also widely used in compressed sensing. Finally, the multipath channel estimation problem is shown to have a sparse formulation; algorithms similar to sampling and coding are used to estimate OFDM channels.
△ Less
Submitted 11 February, 2009;
originally announced February 2009.
-
Sequential adaptive compressed sampling via Huffman codes
Authors:
Akram Aldroubi,
Haichao Wang,
Kourosh Zarringhalam
Abstract:
There are two main approaches in compressed sensing: the geometric approach and the combinatorial approach. In this paper we introduce an information theoretic approach and use results from the theory of Huffman codes to construct a sequence of binary sampling vectors to determine a sparse signal. Unlike other approaches, our approach is adaptive in the sense that each sampling vector depends on…
▽ More
There are two main approaches in compressed sensing: the geometric approach and the combinatorial approach. In this paper we introduce an information theoretic approach and use results from the theory of Huffman codes to construct a sequence of binary sampling vectors to determine a sparse signal. Unlike other approaches, our approach is adaptive in the sense that each sampling vector depends on the previous sample. The number of measurements we need for a k-sparse vector in n-dimensional space is no more than O(k log n) and the reconstruction is O(k).
△ Less
Submitted 25 June, 2009; v1 submitted 27 October, 2008;
originally announced October 2008.