-
Nonparametric regression on random geometric graphs sampled from submanifolds
Authors:
Paul Rosa,
Judith Rousseau
Abstract:
We consider the nonparametric regression problem when the covariates are located on an unknown smooth compact submanifold of a Euclidean space. Under defining a random geometric graph structure over the covariates we analyze the asymptotic frequentist behaviour of the posterior distribution arising from Bayesian priors designed through random basis expansion in the graph Laplacian eigenbasis. Unde…
▽ More
We consider the nonparametric regression problem when the covariates are located on an unknown smooth compact submanifold of a Euclidean space. Under defining a random geometric graph structure over the covariates we analyze the asymptotic frequentist behaviour of the posterior distribution arising from Bayesian priors designed through random basis expansion in the graph Laplacian eigenbasis. Under Holder smoothness assumption on the regression function and the density of the covariates over the submanifold, we prove that the posterior contraction rates of such methods are minimax optimal (up to logarithmic factors) for any positive smoothness index.
△ Less
Submitted 4 November, 2024; v1 submitted 31 May, 2024;
originally announced May 2024.
-
Posterior Contraction Rates for Matérn Gaussian Processes on Riemannian Manifolds
Authors:
Paul Rosa,
Viacheslav Borovitskiy,
Alexander Terenin,
Judith Rousseau
Abstract:
Gaussian processes are used in many machine learning applications that rely on uncertainty quantification. Recently, computational tools for working with these models in geometric settings, such as when inputs lie on a Riemannian manifold, have been developed. This raises the question: can these intrinsic models be shown theoretically to lead to better performance, compared to simply embedding all…
▽ More
Gaussian processes are used in many machine learning applications that rely on uncertainty quantification. Recently, computational tools for working with these models in geometric settings, such as when inputs lie on a Riemannian manifold, have been developed. This raises the question: can these intrinsic models be shown theoretically to lead to better performance, compared to simply embedding all relevant quantities into $\mathbb{R}^d$ and using the restriction of an ordinary Euclidean Gaussian process? To study this, we prove optimal contraction rates for intrinsic Matérn Gaussian processes defined on compact Riemannian manifolds. We also prove analogous rates for extrinsic processes using trace and extension theorems between manifold and ambient Sobolev spaces: somewhat surprisingly, the rates obtained turn out to coincide with those of the intrinsic processes, provided that their smoothness parameters are matched appropriately. We illustrate these rates empirically on a number of examples, which, mirroring prior work, show that intrinsic processes can achieve better performance in practice. Therefore, our work shows that finer-grained analyses are needed to distinguish between different levels of data-efficiency of geometric Gaussian processes, particularly in settings which involve small data set sizes and non-asymptotic behavior.
△ Less
Submitted 29 October, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
-
The $β$-model for Random Graphs --- Regression, Cramér-Rao Bounds, and Hypothesis Testing
Authors:
Johan Wahlström,
Isaac Skog,
Patricio S. La Rosa,
Peter Händel,
Arye Nehorai
Abstract:
We develop a maximum-likelihood based method for regression in a setting where the dependent variable is a random graph and covariates are available on a graph-level. The model generalizes the well-known $β$-model for random graphs by replacing the constant model parameters with regression functions. Cramér-Rao bounds are derived for the undirected $β$-model, the directed $β$-model, and the genera…
▽ More
We develop a maximum-likelihood based method for regression in a setting where the dependent variable is a random graph and covariates are available on a graph-level. The model generalizes the well-known $β$-model for random graphs by replacing the constant model parameters with regression functions. Cramér-Rao bounds are derived for the undirected $β$-model, the directed $β$-model, and the generalized $β$-model. The corresponding maximum likelihood estimators are compared to the bounds by means of simulations. Moreover, examples are given on how to use the presented maximum likelihood estimators to test for directionality and significance. Last, the applicability of the model is demonstrated using dynamic social network data describing communication among healthcare workers.
△ Less
Submitted 14 November, 2016;
originally announced November 2016.