-
Beyond R-barycenters: an effective averaging method on Stiefel and Grassmann manifolds
Authors:
Florent Bouchard,
Nils Laurent,
Salem Said,
Nicolas Le Bihan
Abstract:
In this paper, the issue of averaging data on a manifold is addressed. While the Fréchet mean resulting from Riemannian geometry appears ideal, it is unfortunately not always available and often computationally very expensive. To overcome this, R-barycenters have been proposed and successfully applied to Stiefel and Grassmann manifolds. However, R-barycenters still suffer severe limitations as the…
▽ More
In this paper, the issue of averaging data on a manifold is addressed. While the Fréchet mean resulting from Riemannian geometry appears ideal, it is unfortunately not always available and often computationally very expensive. To overcome this, R-barycenters have been proposed and successfully applied to Stiefel and Grassmann manifolds. However, R-barycenters still suffer severe limitations as they rely on iterative algorithms and complicated operators. We propose simpler, yet efficient, barycenters that we call RL-barycenters. We show that, in the setting relevant to most applications, our framework yields astonishingly simple barycenters: arithmetic means projected onto the manifold. We apply this approach to the Stiefel and Grassmann manifolds. On simulated data, our approach is competitive with respect to existing averaging methods, while computationally cheaper.
△ Less
Submitted 20 January, 2025;
originally announced January 2025.
-
Two-way kernel matrix puncturing: towards resource-efficient PCA and spectral clustering
Authors:
Romain Couillet,
Florent Chatelain,
Nicolas Le Bihan
Abstract:
The article introduces an elementary cost and storage reduction method for spectral clustering and principal component analysis. The method consists in randomly "puncturing" both the data matrix $X\in\mathbb{C}^{p\times n}$ (or $\mathbb{R}^{p\times n}$) and its corresponding kernel (Gram) matrix $K$ through Bernoulli masks: $S\in\{0,1\}^{p\times n}$ for $X$ and $B\in\{0,1\}^{n\times n}$ for $K$. T…
▽ More
The article introduces an elementary cost and storage reduction method for spectral clustering and principal component analysis. The method consists in randomly "puncturing" both the data matrix $X\in\mathbb{C}^{p\times n}$ (or $\mathbb{R}^{p\times n}$) and its corresponding kernel (Gram) matrix $K$ through Bernoulli masks: $S\in\{0,1\}^{p\times n}$ for $X$ and $B\in\{0,1\}^{n\times n}$ for $K$. The resulting "two-way punctured" kernel is thus given by $K=\frac{1}{p}[(X \odot S)^{\sf H} (X \odot S)] \odot B$. We demonstrate that, for $X$ composed of independent columns drawn from a Gaussian mixture model, as $n,p\to\infty$ with $p/n\to c_0\in(0,\infty)$, the spectral behavior of $K$ -- its limiting eigenvalue distribution, as well as its isolated eigenvalues and eigenvectors -- is fully tractable and exhibits a series of counter-intuitive phenomena. We notably prove, and empirically confirm on GAN-generated image databases, that it is possible to drastically puncture the data, thereby providing possibly huge computational and storage gains, for a virtually constant (clustering of PCA) performance. This preliminary study opens as such the path towards rethinking, from a large dimensional standpoint, computational and storage costs in elementary machine learning models.
△ Less
Submitted 17 May, 2021; v1 submitted 24 February, 2021;
originally announced February 2021.
-
SALZA: Soft algorithmic complexity estimates for clustering and causality inference
Authors:
Marion Revolle,
Cayre François,
Nicolas Le Bihan
Abstract:
A complete set of practical estimators for the conditional, simple and joint algorihmic complexities is presented, from which a semi-metric is derived. Also, new directed information estimators are proposed that are applied to causality inference on Directed Acyclic Graphs. The performances of these estimators are investigated and shown to compare well with respect to the state-of-the-art Normaliz…
▽ More
A complete set of practical estimators for the conditional, simple and joint algorihmic complexities is presented, from which a semi-metric is derived. Also, new directed information estimators are proposed that are applied to causality inference on Directed Acyclic Graphs. The performances of these estimators are investigated and shown to compare well with respect to the state-of-the-art Normalized Compression Distance (NCD).
△ Less
Submitted 18 July, 2016;
originally announced July 2016.
-
Filtering from Observations on Stiefel Manifolds
Authors:
Jeremie Boulanger,
Salem Said,
Nicolas Le Bihan,
Jonathan Manton
Abstract:
This paper considers the problem of optimal filtering for partially observed signals taking values on the rotation group. More precisely, one or more components are considered not to be available in the measurement of the attitude of a 3D rigid body. In such cases, the observed signal takes its values on a Stiefel manifold. It is demonstrated how to filter the observed signal through the anti-deve…
▽ More
This paper considers the problem of optimal filtering for partially observed signals taking values on the rotation group. More precisely, one or more components are considered not to be available in the measurement of the attitude of a 3D rigid body. In such cases, the observed signal takes its values on a Stiefel manifold. It is demonstrated how to filter the observed signal through the anti-development built from observations. A particle filter implementation is proposed to perform the estimation of the signal partially observed and corrupted by noise. The sampling issue is also addressed and interpolation methods are introduced. Illustration of the proposed technique on synthetic data demonstrates the ability of the approach to estimate the angular velocity of a partially observed 3D system partially observed.
△ Less
Submitted 25 September, 2014;
originally announced September 2014.
-
Isotropic Multiple Scattering Processes on Hyperspheres
Authors:
Nicolas Le Bihan,
Florent Chatelain,
Jonathan H. Manton
Abstract:
This paper presents several results about isotropic random walks and multiple scattering processes on hyperspheres ${\mathbb S}^{p-1}$. It allows one to derive the Fourier expansions on ${\mathbb S}^{p-1}$ of these processes. A result of unimodality for the multiconvolution of symmetrical probability density functions (pdf) on ${\mathbb S}^{p-1}$ is also introduced. Such processes are then studied…
▽ More
This paper presents several results about isotropic random walks and multiple scattering processes on hyperspheres ${\mathbb S}^{p-1}$. It allows one to derive the Fourier expansions on ${\mathbb S}^{p-1}$ of these processes. A result of unimodality for the multiconvolution of symmetrical probability density functions (pdf) on ${\mathbb S}^{p-1}$ is also introduced. Such processes are then studied in the case where the scattering distribution is von Mises Fisher (vMF). Asymptotic distributions for the multiconvolution of vMFs on ${\mathbb S}^{p-1}$ are obtained. Both Fourier expansion and asymptotic approximation allows us to compute estimation bounds for the parameters of Compound Cox Processes (CCP) on ${\mathbb S}^{p-1}$.
△ Less
Submitted 13 December, 2015; v1 submitted 12 August, 2014;
originally announced August 2014.
-
Decompounding on compact Lie groups
Authors:
Salem Said,
Christian Lageman,
Nicolas Le Bihan,
Jonathan H. Manton
Abstract:
Noncommutative harmonic analysis is used to solve a nonparametric estimation problem stated in terms of compound Poisson processes on compact Lie groups. This problem of decompounding is a generalization of a similar classical problem. The proposed solution is based on a char- acteristic function method. The treated problem is important to recent models of the physical inverse problem of multipl…
▽ More
Noncommutative harmonic analysis is used to solve a nonparametric estimation problem stated in terms of compound Poisson processes on compact Lie groups. This problem of decompounding is a generalization of a similar classical problem. The proposed solution is based on a char- acteristic function method. The treated problem is important to recent models of the physical inverse problem of multiple scattering.
△ Less
Submitted 15 July, 2009;
originally announced July 2009.