Search | arXiv e-print repository

On the impact of observation error correlations in data assimilation, with application to along-track altimeter data

Authors: Olivier Goux, Anthony Weaver, Selime Gürol, Oliver Guillet, Andrea Piacentini

Abstract: Data assimilation involves estimating the state of a system by combining observations from various sources with a background estimate of the state. The weights given to the observations and background state depend on their specified error covariance matrices. Observation errors are often assumed to be uncorrelated even though this assumption is inaccurate for many modern data-sets such as those fr… ▽ More Data assimilation involves estimating the state of a system by combining observations from various sources with a background estimate of the state. The weights given to the observations and background state depend on their specified error covariance matrices. Observation errors are often assumed to be uncorrelated even though this assumption is inaccurate for many modern data-sets such as those from satellite observing systems. As methods allowing for a more realistic representation of observation-error correlations are emerging, our aim in this article is to provide insight on their expected impact in data assimilation. First, we use a simple idealised system to analyse the effect of observation-error correlations on the spectral characteristics of the solution. Next, we assess the relevance of these results in a more realistic setting in which simulated alongtrack (nadir) altimeter observations with correlated errors are assimilated in a global ocean model using a three-dimensional variational assimilation (3D-Var) method. Correlated observation errors are modelled in the 3D-Var system using a diffusion operator. When the correlation length scale of observation error is small compared to that of background error, inflating the observation-error variances can mitigate most of the negative effects from neglecting the observation-error correlations. Accounting for observation-error correlations in this situation still outperforms variance inflation since it allows small-scale information in the observations to be more effectively extracted and does not affect the convergence of the minimization. Conversely, when the correlation length scale of observation error is large compared to that of background error, the effect of observation-error correlations cannot be properly approximated with variance inflation. However, the correlation model needs to be constructed carefully to ensure the minimization problem is adequately conditioned so that a robust solution can be obtained. Practical ways to achieve this are discussed. △ Less

Submitted 12 March, 2025; originally announced March 2025.

arXiv:2312.05068 [pdf, other]

Application of deep learning to the estimation of normalization coefficients in diffusion-based covariance models

Authors: Folke K Skrunes, Mayeul Destouches, Anthony Weaver, Guillaume Coulaud, Olivier Goux, Corentin Lapeyre

Abstract: Variational data assimilation in ocean models depends on the ability to model general correlation operators in the presence of coastlines. Grid-point filters based on diffusion operators are widely used for this purpose, but come with a computational bottleneck - the costly estimation of normalization factors for every model grid point. In this paper, we show that a simple convolutional neural net… ▽ More Variational data assimilation in ocean models depends on the ability to model general correlation operators in the presence of coastlines. Grid-point filters based on diffusion operators are widely used for this purpose, but come with a computational bottleneck - the costly estimation of normalization factors for every model grid point. In this paper, we show that a simple convolutional neural network can effectively learn these normalization factors with better accuracy than the current operational methods. Our network is tested with a two-dimensional diffusion operator from the NEMOVAR ocean data assimilation system, applied to a global ocean grid with approximately one degree horizontal resolution. The network is trained on exact normalization factors estimated by a brute-force method. Knowing that convolutional networks can only model translation-equivariant functions, we ensure that the normalization estimation problem is indeed translation-equivariant. Specifically, we show how the number of inputs of this problem can be reduced while preserving translation equivariance. Adding the distance to the coastline as an input channel is found to improve the performance of the network around coastlines. Extensions to three-dimensional diffusion and to higher horizontal resolutions are discussed. Removing the computational bottleneck associated with normalization opens the way to using adaptive correlation models for operational ocean data assimilation. The code for this work is publicly available at https://github.com/FolkeKS/DL-normalization/tree/core-features △ Less

Submitted 8 December, 2023; originally announced December 2023.

arXiv:2311.06069 [pdf, ps, other]

A filtered multilevel Monte Carlo method for estimating the expectation of cell-centered discretized random fields

Authors: Jérémy Briant, Paul Mycek, Mayeul Destouches, Olivier Goux, Serge Gratton, Selime Gürol, Ehouarn Simon, Anthony T. Weaver

Abstract: In this paper, we investigate the use of multilevel Monte Carlo (MLMC) methods for estimating the expectation of discretized random fields. Specifically, we consider a setting in which the input and output vectors of numerical simulators have inconsistent dimensions across the multilevel hierarchy. This motivates the introduction of grid transfer operators borrowed from multigrid methods. By adapt… ▽ More In this paper, we investigate the use of multilevel Monte Carlo (MLMC) methods for estimating the expectation of discretized random fields. Specifically, we consider a setting in which the input and output vectors of numerical simulators have inconsistent dimensions across the multilevel hierarchy. This motivates the introduction of grid transfer operators borrowed from multigrid methods. By adapting mathematical tools from multigrid methods, we perform a theoretical spectral analysis of the MLMC estimator of the expectation of discretized random fields, in the specific case of linear, symmetric and circulant simulators. We then propose filtered MLMC (F-MLMC) estimators based on a filtering mechanism similar to the smoothing process of multigrid methods, and we show that the filtering operators improve the estimation of both the small- and large-scale components of the variance, resulting in a reduction of the total variance of the estimator. Next, the conclusions of the spectral analysis are experimentally verified with a one-dimensional illustration. Finally, the proposed F-MLMC estimator is applied to the problem of estimating the discretized variance field of a diffusion-based covariance operator, which amounts to estimating the expectation of a discretized random field. The numerical experiments support the conclusions of the theoretical analysis even with non-linear simulators, and demonstrate the improvements brought by the F-MLMC estimator compared to both a crude MC and an unfiltered MLMC estimator. △ Less

Submitted 4 June, 2025; v1 submitted 10 November, 2023; originally announced November 2023.

MSC Class: 65C05; 62P12

arXiv:2212.02305 [pdf, other]

Impact of correlated observation errors on the convergence of the conjugate gradient algorithm in variational data assimilation

Authors: Olivier Goux, Selime Gürol, Anthony T. Weaver, Oliver Guillet, Youssef Diouane

Abstract: An important class of nonlinear weighted least-squares problems arises from the assimilation of observations in atmospheric and ocean models. In variational data assimilation, inverse error covariance matrices define the weighting matrices of the least-squares problem. For observation errors, a diagonal matrix (i.e., uncorrelated errors) is often assumed for simplicity even when observation errors… ▽ More An important class of nonlinear weighted least-squares problems arises from the assimilation of observations in atmospheric and ocean models. In variational data assimilation, inverse error covariance matrices define the weighting matrices of the least-squares problem. For observation errors, a diagonal matrix (i.e., uncorrelated errors) is often assumed for simplicity even when observation errors are suspected to be correlated. While accounting for observationerror correlations should improve the quality of the solution, it also affects the convergence rate of the minimization algorithms used to iterate to the solution. If the minimization process is stopped before reaching full convergence, which is usually the case in operational applications, the solution may be degraded even if the observation-error correlations are correctly accounted for. In this article, we explore the influence of the observation-error correlation matrix (R) on the convergence rate of a preconditioned conjugate gradient (PCG) algorithm applied to a one-dimensional variational data assimilation (1D-Var) problem. We design the idealised 1D-Var system to include two key features used in more complex systems: we use the background error covariance matrix (B) as a preconditioner (B-PCG); and we use a diffusion operator to model spatial correlations in B and R. Analytical and numerical results with the 1D-Var system show a strong sensitivity of the convergence rate of B-PCG to the parameters of the diffusion-based correlation models. Depending on the parameter choices, correlated observation errors can either speed up or slow down the convergence. In practice, a compromise may be required in the parameter specifications of B and R between staying close to the best available estimates on the one hand and ensuring an adequate convergence rate of the minimization algorithm on the other. △ Less

Submitted 5 December, 2022; originally announced December 2022.

Showing 1–4 of 4 results for author: Goux, O