Search | arXiv e-print repository

doi 10.1109/TIT.2022.3199479

Riemannian statistics meets random matrix theory: towards learning from high-dimensional covariance matrices

Authors: Salem Said, Simon Heuveline, Cyrus Mostajeran

Abstract: Riemannian Gaussian distributions were initially introduced as basic building blocks for learning models which aim to capture the intrinsic structure of statistical populations of positive-definite matrices (here called covariance matrices). While the potential applications of such models have attracted significant attention, a major obstacle still stands in the way of these applications: there se… ▽ More Riemannian Gaussian distributions were initially introduced as basic building blocks for learning models which aim to capture the intrinsic structure of statistical populations of positive-definite matrices (here called covariance matrices). While the potential applications of such models have attracted significant attention, a major obstacle still stands in the way of these applications: there seems to exist no practical method of computing the normalising factors associated with Riemannian Gaussian distributions on spaces of high-dimensional covariance matrices. The present paper shows that this missing method comes from an unexpected new connection with random matrix theory. Its main contribution is to prove that Riemannian Gaussian distributions of real, complex, or quaternion covariance matrices are equivalent to orthogonal, unitary, or symplectic log-normal matrix ensembles. This equivalence yields a highly efficient approximation of the normalising factors, in terms of a rather simple analytic expression. The error due to this approximation decreases like the inverse square of dimension. Numerical experiments are conducted which demonstrate how this new approximation can unlock the difficulties which have impeded applications to real-world datasets of high-dimensional covariance matrices. The paper then turns to Riemannian Gaussian distributions of block-Toeplitz covariance matrices. These are equivalent to yet another kind of random matrix ensembles, here called "acosh-normal" ensembles. Orthogonal and unitary "acosh-normal" ensembles correspond to the cases of block-Toeplitz with Toeplitz blocks, and block-Toeplitz (with general blocks) covariance matrices, respectively. △ Less

Submitted 26 December, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

arXiv:2106.08953 [pdf, other]

doi 10.1109/TIT.2022.3199479

Gaussian distributions on Riemannian symmetric spaces, random matrices, and planar Feynman diagrams

Authors: Simon Heuveline, Salem Said, Cyrus Mostajeran

Abstract: Gaussian distributions can be generalized from Euclidean space to a wide class of Riemannian manifolds. Gaussian distributions on manifolds are harder to make use of in applications since the normalisation factors, which we will refer to as partition functions, are complicated, intractable integrals in general that depend in a highly non-linear way on the mean of the given distribution. Nonetheles… ▽ More Gaussian distributions can be generalized from Euclidean space to a wide class of Riemannian manifolds. Gaussian distributions on manifolds are harder to make use of in applications since the normalisation factors, which we will refer to as partition functions, are complicated, intractable integrals in general that depend in a highly non-linear way on the mean of the given distribution. Nonetheless, on Riemannian symmetric spaces, the partition functions are independent of the mean and reduce to integrals over finite dimensional vector spaces. These are generally still hard to compute numerically when the dimension (more precisely the rank $N$) of the underlying symmetric space gets large. On the space of positive definite Hermitian matrices, it is possible to compute these integrals exactly using methods from random matrix theory and the so-called Stieltjes-Wigert polynomials. In other cases of interest to applications, such as the space of symmetric positive definite (SPD) matrices or the Siegel domain (related to block-Toeplitz covariance matrices), these methods seem not to work quite as well. Nonetheless, it remains possible to compute leading order terms in a large $N$ limit, which provide increasingly accurate approximations as $N$ grows. This limit is inspired by realizing a given partition function as the partition function of a zero-dimensional quantum field theory or even Chern-Simons theory. From this point of view the large $N$ limit arises naturally and saddle-point methods, Feynman diagrams, and certain universalities that relate different spaces emerge. △ Less

Submitted 2 June, 2021; originally announced June 2021.

arXiv:2102.07556 [pdf, other]

Gaussian distributions on Riemannian symmetric spaces in the large N limit

Authors: Simon Heuveline, Salem Said, Cyrus Mostajeran

Abstract: We consider Gaussian distributions on certain Riemannian symmetric spaces. In contrast to the Euclidean case, it is challenging to compute the normalization factors of such distributions, which we refer to as partition functions. In some cases, such as the space of Hermitian positive definite matrices or hyperbolic space, it is possible to compute them exactly using techniques from random matrix t… ▽ More We consider Gaussian distributions on certain Riemannian symmetric spaces. In contrast to the Euclidean case, it is challenging to compute the normalization factors of such distributions, which we refer to as partition functions. In some cases, such as the space of Hermitian positive definite matrices or hyperbolic space, it is possible to compute them exactly using techniques from random matrix theory. However, in most cases which are important to applications, such as the space of symmetric positive definite (SPD) matrices or the Siegel domain, this is only possible numerically. Moreover, when we consider, for instance, high-dimensional SPD matrices, the known algorithms for computing partition functions can become exceedingly slow. Motivated by notions from theoretical physics, we will discuss how to approximate the partition functions in the large $N$ limit: an approximation that gets increasingly better as the dimension of the underlying symmetric space (more precisely, its rank) gets larger. We will give formulas for leading order terms in the case of SPD matrices and related spaces. Furthermore, we will characterize the large $N$ limit of the Siegel domain through a singular integral equation arising as a saddle-point equation. △ Less

Submitted 15 May, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

Showing 1–3 of 3 results for author: Heuveline, S