-
Consistent Infill Estimability of the Regression Slope Between Gaussian Random Fields Under Spatial Confounding
Authors:
Abhirup Datta,
Michael L. Stein
Abstract:
The problem of estimating the slope parameter in regression between two spatial processes under confounding by an unmeasured spatial process has received widespread attention in the recent statistical literature. Yet, a fundamental question remains unsolved: when is this slope consistently estimable under spatial confounding, with existing insights being largely empirical or estimator-specific. In…
▽ More
The problem of estimating the slope parameter in regression between two spatial processes under confounding by an unmeasured spatial process has received widespread attention in the recent statistical literature. Yet, a fundamental question remains unsolved: when is this slope consistently estimable under spatial confounding, with existing insights being largely empirical or estimator-specific. In this manuscript, we characterize conditions for consistent estimability of the regression slope between Gaussian random fields (GRFs). Under fixed-domain (infill) asymptotics, we give sufficient conditions for consistent estimability using a novel characterization of the regression slope as the ratio of principal irregular terms of covariances, dictating the relative local behavior of the exposure and confounder processes. When estimability holds, we provide consistent estimators of the slope using local differencing (taking discrete differences or Laplacians of the processes of suitable order). Using functional analysis results on Paley-Wiener spaces, we then provide an easy-to-verify necessary condition for consistent estimability of the slope in terms of the relative spectral tail decays of the confounder and exposure. As a by-product, we establish a novel and general spectral condition on the equivalence of measures on the paths of multivariate GRFs with component fields of varying smoothnesses, a result of independent importance. We show that for the Matérn, power-exponential, generalized Cauchy, and coregionalization families, the necessary and sufficient conditions become identical, thereby providing a complete characterization of consistent estimability of the slope under spatial confounding. The results are extended to accommodate measurement error using local-averaging-and-differencing based estimators. Finite sample behavior is explored via numerical experiments.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Scalable Gaussian Process Computations Using Hierarchical Matrices
Authors:
Christopher J. Geoga,
Mihai Anitescu,
Michael L. Stein
Abstract:
We present a kernel-independent method that applies hierarchical matrices to the problem of maximum likelihood estimation for Gaussian processes. The proposed approximation provides natural and scalable stochastic estimators for its gradient and Hessian, as well as the expected Fisher information matrix, that are computable in quasilinear $O(n \log^2 n)$ complexity for a large range of models. To…
▽ More
We present a kernel-independent method that applies hierarchical matrices to the problem of maximum likelihood estimation for Gaussian processes. The proposed approximation provides natural and scalable stochastic estimators for its gradient and Hessian, as well as the expected Fisher information matrix, that are computable in quasilinear $O(n \log^2 n)$ complexity for a large range of models. To accomplish this, we (i) choose a specific hierarchical approximation for covariance matrices that enables the computation of their exact derivatives and (ii) use a stabilized form of the Hutchinson stochastic trace estimator. Since both the observed and expected information matrices can be computed in quasilinear complexity, covariance matrices for MLEs can also be estimated efficiently. After discussing the associated mathematics, we demonstrate the scalability of the method, discuss details of its implementation, and validate that the resulting MLEs and confidence intervals based on the inverse Fisher information matrix faithfully approach those obtained by the exact likelihood.
△ Less
Submitted 22 March, 2019; v1 submitted 9 August, 2018;
originally announced August 2018.
-
Linear-Cost Covariance Functions for Gaussian Random Fields
Authors:
Jie Chen,
Michael L. Stein
Abstract:
Gaussian random fields (GRF) are a fundamental stochastic model for spatiotemporal data analysis. An essential ingredient of GRF is the covariance function that characterizes the joint Gaussian distribution of the field. Commonly used covariance functions give rise to fully dense and unstructured covariance matrices, for which required calculations are notoriously expensive to carry out for large…
▽ More
Gaussian random fields (GRF) are a fundamental stochastic model for spatiotemporal data analysis. An essential ingredient of GRF is the covariance function that characterizes the joint Gaussian distribution of the field. Commonly used covariance functions give rise to fully dense and unstructured covariance matrices, for which required calculations are notoriously expensive to carry out for large data. In this work, we propose a construction of covariance functions that result in matrices with a hierarchical structure. Empowered by matrix algorithms that scale linearly with the matrix dimension, the hierarchical structure is proved to be efficient for a variety of random field computations, including sampling, kriging, and likelihood evaluation. Specifically, with $n$ scattered sites, sampling and likelihood evaluation has an $O(n)$ cost and kriging has an $O(\log n)$ cost after preprocessing, particularly favorable for the kriging of an extremely large number of sites (e.g., predicting on more sites than observed). We demonstrate comprehensive numerical experiments to show the use of the constructed covariance functions and their appealing computation time. Numerical examples on a laptop include simulated data of size up to one million, as well as a climate data product with over two million observations.
△ Less
Submitted 7 November, 2020; v1 submitted 15 November, 2017;
originally announced November 2017.
-
On a class of space-time intrinsic random functions
Authors:
Michael L. Stein
Abstract:
Power law generalized covariance functions provide a simple model for describing the local behavior of an isotropic random field. This work seeks to extend this class of covariance functions to spatial-temporal processes for which the degree of smoothness in space and in time may differ while maintaining other desirable properties for the covariance functions, including the availability of explici…
▽ More
Power law generalized covariance functions provide a simple model for describing the local behavior of an isotropic random field. This work seeks to extend this class of covariance functions to spatial-temporal processes for which the degree of smoothness in space and in time may differ while maintaining other desirable properties for the covariance functions, including the availability of explicit convergent and asymptotic series expansions.
△ Less
Submitted 19 March, 2013;
originally announced March 2013.
-
When does the screening effect hold?
Authors:
Michael L. Stein
Abstract:
When using optimal linear prediction to interpolate point observations of a mean square continuous stationary spatial process, one often finds that the interpolant mostly depends on those observations located nearest to the predictand. This phenomenon is called the screening effect. However, there are situations in which a screening effect does not hold in a reasonable asymptotic sense, and theore…
▽ More
When using optimal linear prediction to interpolate point observations of a mean square continuous stationary spatial process, one often finds that the interpolant mostly depends on those observations located nearest to the predictand. This phenomenon is called the screening effect. However, there are situations in which a screening effect does not hold in a reasonable asymptotic sense, and theoretical support for the screening effect is limited to some rather specialized settings for the observation locations. This paper explores conditions on the observation locations and the process model under which an asymptotic screening effect holds. A series of examples shows the difficulty in formulating a general result, especially for processes with different degrees of smoothness in different directions, which can naturally occur for spatial-temporal processes. These examples lead to a general conjecture and two special cases of this conjecture are proven. The key condition on the process is that its spectral density should change slowly at high frequencies. Models not satisfying this condition of slow high-frequency change should be used with caution.
△ Less
Submitted 8 March, 2012;
originally announced March 2012.
-
Estimating deformations of isotropic Gaussian random fields on the plane
Authors:
Ethan B. Anderes,
Michael L. Stein
Abstract:
This paper presents a new approach to the estimation of the deformation of an isotropic Gaussian random field on $\mathbb{R}^2$ based on dense observations of a single realization of the deformed random field. Under this framework we investigate the identification and estimation of deformations. We then present a complete methodological package--from model assumptions to algorithmic recovery of…
▽ More
This paper presents a new approach to the estimation of the deformation of an isotropic Gaussian random field on $\mathbb{R}^2$ based on dense observations of a single realization of the deformed random field. Under this framework we investigate the identification and estimation of deformations. We then present a complete methodological package--from model assumptions to algorithmic recovery of the deformation--for the class of nonstationary processes obtained by deforming isotropic Gaussian random fields.
△ Less
Submitted 4 April, 2008;
originally announced April 2008.