-
Graph neural networks and non-commuting operators
Authors:
Mauricio Velasco,
Kaiying O'Hare,
Bernardo Rychtenberg,
Soledad Villar
Abstract:
Graph neural networks (GNNs) provide state-of-the-art results in a wide variety of tasks which typically involve predicting features at the vertices of a graph. They are built from layers of graph convolutions which serve as a powerful inductive bias for describing the flow of information among the vertices. Often, more than one data modality is available. This work considers a setting in which se…
▽ More
Graph neural networks (GNNs) provide state-of-the-art results in a wide variety of tasks which typically involve predicting features at the vertices of a graph. They are built from layers of graph convolutions which serve as a powerful inductive bias for describing the flow of information among the vertices. Often, more than one data modality is available. This work considers a setting in which several graphs have the same vertex set and a common vertex-level learning task. This generalizes standard GNN models to GNNs with several graph operators that do not commute. We may call this model graph-tuple neural networks (GtNN).
In this work, we develop the mathematical theory to address the stability and transferability of GtNNs using properties of non-commuting non-expansive operators. We develop a limit theory of graphon-tuple neural networks and use it to prove a universal transferability theorem that guarantees that all graph-tuple neural networks are transferable on convergent graph-tuple sequences. In particular, there is no non-transferable energy under the convergence we consider here. Our theoretical results extend well-known transferability theorems for GNNs to the case of several simultaneous graphs (GtNNs) and provide a strict improvement on what is currently known even in the GNN case.
We illustrate our theoretical results with simple experiments on synthetic and real-world data. To this end, we derive a training procedure that provably enforces the stability of the resulting model.
△ Less
Submitted 6 November, 2024;
originally announced November 2024.
-
Estimation of large covariance matrices via free deconvolution: computational and statistical aspects
Authors:
Reda Chhaibi,
Fabrice Gamboa,
Slim Kammoun,
Mauricio Velasco
Abstract:
The estimation of large covariance matrices has a high dimensional bias. Correcting for this bias can be reformulated via the tool of Free Probability Theory as a free deconvolution.
The goal of this work is a computational and statistical resolution of this problem. Our approach is based on complex-analytic methods methods to invert $S$-transforms. In particular, one needs a theoretical underst…
▽ More
The estimation of large covariance matrices has a high dimensional bias. Correcting for this bias can be reformulated via the tool of Free Probability Theory as a free deconvolution.
The goal of this work is a computational and statistical resolution of this problem. Our approach is based on complex-analytic methods methods to invert $S$-transforms. In particular, one needs a theoretical understanding of the Riemann surfaces where multivalued $S$ transforms live and an efficient computational scheme.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Fundamental Diagram of Traffic Flow from Prigogine-Herman-Enskog Equation
Authors:
W. Marques Jr.,
A. R. Mendez,
R. M. Velasco
Abstract:
Recent applications of a new methodology to measure fundamental traffic relations on freeways shows that many of the critical parameters of the flow-density and speed-spacing diagrams depend on vehicle length. In response to this fact, we present in this work a generalization of the Prigogine-Herman traffic equation for aggressive drivers which takes into account the fact that vehicles are not poi…
▽ More
Recent applications of a new methodology to measure fundamental traffic relations on freeways shows that many of the critical parameters of the flow-density and speed-spacing diagrams depend on vehicle length. In response to this fact, we present in this work a generalization of the Prigogine-Herman traffic equation for aggressive drivers which takes into account the fact that vehicles are not point-like objects but have an effective length. Our approach is similar to that introduced by Enskog for dense gases and provides the construction of fundamental diagrams which are in excellent agreement with empirical traffic data.
△ Less
Submitted 14 February, 2019;
originally announced February 2019.
-
Local angles and dimension estimation from data on manifolds
Authors:
Mateo Díaz,
Adolfo J. Quiroz,
Mauricio Velasco
Abstract:
For data living in a manifold $M\subseteq \mathbb{R}^m$ and a point $p\in M$ we consider a statistic $U_{k,n}$ which estimates the variance of the angle between pairs of vectors $X_i-p$ and $X_j-p$, for data points $X_i$, $X_j$, near $p$, and evaluate this statistic as a tool for estimation of the intrinsic dimension of $M$ at $p$. Consistency of the local dimension estimator is established and th…
▽ More
For data living in a manifold $M\subseteq \mathbb{R}^m$ and a point $p\in M$ we consider a statistic $U_{k,n}$ which estimates the variance of the angle between pairs of vectors $X_i-p$ and $X_j-p$, for data points $X_i$, $X_j$, near $p$, and evaluate this statistic as a tool for estimation of the intrinsic dimension of $M$ at $p$. Consistency of the local dimension estimator is established and the asymptotic distribution of $U_{k,n}$ is found under minimal regularity assumptions. Performance of the proposed methodology is compared against state-of-the-art methods on simulated data.
△ Less
Submitted 3 May, 2018;
originally announced May 2018.