-
Optimal low-rank approximations for linear Gaussian inverse problems on Hilbert spaces, Part II: posterior mean approximation
Authors:
Giuseppe Carere,
Han Cheng Lie
Abstract:
In this work, we construct optimal low-rank approximations for the Gaussian posterior distribution in linear Gaussian inverse problems. The parameter space is a separable Hilbert space of possibly infinite dimension, and the data space is assumed to be finite-dimensional. We consider various types of approximation families for the posterior. We first consider approximate posteriors in which the me…
▽ More
In this work, we construct optimal low-rank approximations for the Gaussian posterior distribution in linear Gaussian inverse problems. The parameter space is a separable Hilbert space of possibly infinite dimension, and the data space is assumed to be finite-dimensional. We consider various types of approximation families for the posterior. We first consider approximate posteriors in which the means vary among a class of either structure-preserving or structure-ignoring low-rank transformations of the data, and in which the posterior covariance is kept fixed. We give necessary and sufficient conditions for these approximating posteriors to be equivalent to the exact posterior, for all possible realisations of the data simultaneously. For such approximations, we measure approximation error with the Kullback-Leibler, Rényi and Amari $α$-divergences for $α\in(0,1)$, and with the Hellinger distance, all averaged over the data distribution. With these losses, we find the optimal approximations and formulate an equivalent condition for their uniqueness, extending the work in finite dimensions of Spantini et al. (SIAM J. Sci. Comput. 2015). We then consider joint approximation of the mean and covariance, by also varying the posterior covariance over the low-rank updates considered in Part I of this work. For the reverse Kullback-Leibler divergence, we show that the separate optimal approximations of the mean and of the covariance can be combined to yield an optimal joint approximation of the mean and covariance. In addition, we interpret the joint approximation with the optimal structure-ignoring approximate mean in terms of an optimal projector in parameter space.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
Optimal low-rank approximations for linear Gaussian inverse problems on Hilbert spaces, Part I: posterior covariance approximation
Authors:
Giuseppe Carere,
Han Cheng Lie
Abstract:
For linear inverse problems with Gaussian priors and Gaussian observation noise, the posterior is Gaussian, with mean and covariance determined by the conditioning formula. Using the Feldman-Hajek theorem, we analyse the prior-to-posterior update and its low-rank approximation for infinite-dimensional Hilbert parameter spaces and finite-dimensional observations. We show that the posterior distribu…
▽ More
For linear inverse problems with Gaussian priors and Gaussian observation noise, the posterior is Gaussian, with mean and covariance determined by the conditioning formula. Using the Feldman-Hajek theorem, we analyse the prior-to-posterior update and its low-rank approximation for infinite-dimensional Hilbert parameter spaces and finite-dimensional observations. We show that the posterior distribution differs from the prior on a finite-dimensional subspace, and construct low-rank approximations to the posterior covariance, while keeping the mean fixed. Since in infinite dimensions, not all low-rank covariance approximations yield approximate posterior distributions which are equivalent to the posterior and prior distribution, we characterise the low-rank covariance approximations which do yield this equivalence, and their respective inverses, or `precisions'. For such approximations, a family of measure approximation problems is solved by identifying the low-rank approximations which are optimal for various losses simultaneously. These loss functions include the family of Rényi divergences, the Amari $α$-divergences for $α\in(0,1)$, the Hellinger metric and the Kullback-Leibler divergence. Our results extend those of Spantini et al. (SIAM J. Sci. Comput. 2015) to Hilbertian parameter spaces, and provide theoretical underpinning for the construction of low-rank approximations of discretised versions of the infinite-dimensional inverse problem, by formulating discretization independent results.
△ Less
Submitted 4 April, 2025; v1 submitted 31 March, 2025;
originally announced March 2025.
-
Optimal low-rank approximations for linear Gaussian inverse problems on Hilbert spaces, Part I: posterior covariance approximation
Authors:
Giuseppe Carere,
Han Cheng Lie
Abstract:
For linear inverse problems with Gaussian priors and Gaussian observation noise, the posterior is Gaussian, with mean and covariance determined by the conditioning formula. Using the Feldman-Hajek theorem, we analyse the prior-to-posterior update and its low-rank approximation for infinite-dimensional Hilbert parameter spaces and finite-dimensional observations. We show that the posterior distribu…
▽ More
For linear inverse problems with Gaussian priors and Gaussian observation noise, the posterior is Gaussian, with mean and covariance determined by the conditioning formula. Using the Feldman-Hajek theorem, we analyse the prior-to-posterior update and its low-rank approximation for infinite-dimensional Hilbert parameter spaces and finite-dimensional observations. We show that the posterior distribution differs from the prior on a finite-dimensional subspace, and construct low-rank approximations to the posterior covariance, while keeping the mean fixed. Since in infinite dimensions, not all low-rank covariance approximations yield approximate posterior distributions which are equivalent to the posterior and prior distribution, we characterise the low-rank covariance approximations which do yield this equivalence, and their respective inverses, or `precisions'. For such approximations, a family of measure approximation problems is solved by identifying the low-rank approximations which are optimal for various losses simultaneously. These loss functions include the family of Rényi divergences, the Amari $α$-divergences for $α\in(0,1)$, the Hellinger metric and the Kullback-Leibler divergence. Our results extend those of Spantini et al. (SIAM J. Sci. Comput. 2015) to Hilbertian parameter spaces, and provide theoretical underpinning for the construction of low-rank approximations of discretised versions of the infinite-dimensional inverse problem, by formulating discretisation independent results.
△ Less
Submitted 4 April, 2025; v1 submitted 1 November, 2024;
originally announced November 2024.
-
Generalised rank-constrained approximations of Hilbert-Schmidt operators on separable Hilbert spaces and applications
Authors:
Giuseppe Carere,
Han Cheng Lie
Abstract:
In this work we solve, for given bounded operators $B,C$ and Hilbert-Schmidt operator $M$ acting on potentially infinite-dimensional separable Hilbert spaces, the reduced rank approximation problem, $\min\{\lVert M-BXC\rVert_{L_2}:\ \text{dim ran}\ X\leq r\}.$ This extends the result of Sondermann (Statistische Hefte, 1986) and Friedland and Torokhti (SIAM J. Matrix Analysis and Applications, 2007…
▽ More
In this work we solve, for given bounded operators $B,C$ and Hilbert-Schmidt operator $M$ acting on potentially infinite-dimensional separable Hilbert spaces, the reduced rank approximation problem, $\min\{\lVert M-BXC\rVert_{L_2}:\ \text{dim ran}\ X\leq r\}.$ This extends the result of Sondermann (Statistische Hefte, 1986) and Friedland and Torokhti (SIAM J. Matrix Analysis and Applications, 2007), which studies this problem in the case of matrices $M$, $B$, $C$, $X$, and the analysis involves the Moore-Penrose inverse. In classical approximation problems that can be solved by the singular value decomposition or Moore-Penrose inverse, the solution satisfies a minimal norm property. Friedland and Torokhti state such a minimal norm property of the solution. We show that this minimal norm property does not hold in general and give a modified minimality property that does hold. We show that the solution may be discontinuous in infinite-dimensional settings. We give conditions for continuity of the solutions and construct continuous approximations when such conditions are not met. Finally, we study problems from signal processing, reduced rank regression and linear operator learning under a rank constraint. Our theoretical results enable us to explicitly find solutions to these problems and to characterise their existence, uniqueness and minimality property.
△ Less
Submitted 3 April, 2025; v1 submitted 9 August, 2024;
originally announced August 2024.
-
A weighted POD-reduction approach for parametrized PDE-constrained Optimal Control Problems with random inputs and applications to environmental sciences
Authors:
Giuseppe Carere,
Maria Strazzullo,
Francesco Ballarin,
Gianluigi Rozza,
Rob Stevenson
Abstract:
Reduced basis approximations of Optimal Control Problems (OCPs) governed by steady partial differential equations (PDEs) with random parametric inputs are analyzed and constructed. Such approximations are based on a Reduced Order Model, which in this work is constructed using the method of weighted Proper Orthogonal Decomposition. This Reduced Order Model then is used to efficiently compute the re…
▽ More
Reduced basis approximations of Optimal Control Problems (OCPs) governed by steady partial differential equations (PDEs) with random parametric inputs are analyzed and constructed. Such approximations are based on a Reduced Order Model, which in this work is constructed using the method of weighted Proper Orthogonal Decomposition. This Reduced Order Model then is used to efficiently compute the reduced basis approximation for any outcome of the random parameter. We demonstrate that such OCPs are well-posed by applying the adjoint approach, which also works in the presence of admissibility constraints and in the case of non linear-quadratic OCPs, and thus is more general than the conventional Lagrangian approach. We also show that a step in the construction of these Reduced Order Models, known as the aggregation step, is not fundamental and can in principle be skipped for noncoercive problems, leading to a cheaper online phase. Numerical applications in three scenarios from environmental science are considered, in which the governing PDE is steady and the control is distributed. Various parameter distributions are taken, and several implementations of the weighted Proper Orthogonal Decomposition are compared by choosing different quadrature rules.
△ Less
Submitted 19 October, 2021; v1 submitted 28 February, 2021;
originally announced March 2021.