Search | arXiv e-print repository

Partially stochastic deep learning with uncertainty quantification for model predictive heating control

Authors: Emma Hannula, Arttu Häkkinen, Antti Solonen, Felipe Uribe, Jana de Wiljes, Lassi Roininen

Abstract: Improving the energy efficiency of building heating systems is crucial for reducing global energy consumption and greenhouse gas emissions. Traditional control methods rely on static heating curves that are based solely on outdoor temperature, neglecting system state measurements, such as indoor temperature, and free heat sources, such as solar gain. A more effective strategy is model predictive c… ▽ More Improving the energy efficiency of building heating systems is crucial for reducing global energy consumption and greenhouse gas emissions. Traditional control methods rely on static heating curves that are based solely on outdoor temperature, neglecting system state measurements, such as indoor temperature, and free heat sources, such as solar gain. A more effective strategy is model predictive control (MPC), which optimizes heating control by incorporating system state predictions based on weather forecasts, among other factors. However, current industrial MPC solutions often employ simplified physics-inspired indoor temperature models, sacrificing accuracy for robustness and interpretability. To bridge this gap, we propose a partially stochastic deep learning (DL) architecture for building-specific indoor temperature modeling. Unlike most studies that evaluate model performance through simulations or limited test buildings, our experiments across a large dataset of 100 real-world buildings, covering various heating season conditions, demonstrate that the proposed model outperforms a widely used industrial physics-based model in predictive accuracy. The proposed DL architecture shows significant potential to improve thermal comfort and energy efficiency in heating MPC solutions. Although its computational cost is higher than that of the reference model, we discuss why this trade-off is manageable, even in large-scale applications. Unlike deterministic black-box approaches, the partially stochastic DL model offers a critical advantage by enabling pre-assessment of model feasibility through predictive uncertainty quantification. This work advances heating MPC, particularly for buildings with comprehensive datasets on their thermal behavior under various weather conditions. △ Less

Submitted 18 August, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

arXiv:2408.16594 [pdf, other]

Continuous Gaussian mixture solution for linear Bayesian inversion with application to Laplace priors

Authors: Rafael Flock, Yiqiu Dong, Felipe Uribe, Olivier Zahm

Abstract: We focus on Bayesian inverse problems with Gaussian likelihood, linear forward model, and priors that can be formulated as a Gaussian mixture. Such a mixture is expressed as an integral of Gaussian density functions weighted by a mixing density over the mixing variables. Within this framework, the corresponding posterior distribution also takes the form of a Gaussian mixture, and we derive the clo… ▽ More We focus on Bayesian inverse problems with Gaussian likelihood, linear forward model, and priors that can be formulated as a Gaussian mixture. Such a mixture is expressed as an integral of Gaussian density functions weighted by a mixing density over the mixing variables. Within this framework, the corresponding posterior distribution also takes the form of a Gaussian mixture, and we derive the closed-form expression for its posterior mixing density. To sample from the posterior Gaussian mixture, we propose a two-step sampling method. First, we sample the mixture variables from the posterior mixing density, and then we sample the variables of interest from Gaussian densities conditioned on the sampled mixing variables. However, the posterior mixing density is relatively difficult to sample from, especially in high dimensions. Therefore, we propose to replace the posterior mixing density by a dimension-reduced approximation, and we provide a bound in the Hellinger distance for the resulting approximate posterior. We apply the proposed approach to a posterior with Laplace prior, where we introduce two dimension-reduced approximations for the posterior mixing density. Our numerical experiments indicate that samples generated via the proposed approximations have very low correlation and are close to the exact posterior. △ Less

Submitted 29 August, 2024; originally announced August 2024.

arXiv:2403.13665 [pdf, other]

Bayesian inversion with Student's t priors based on Gaussian scale mixtures

Authors: Angelina Senchukova, Felipe Uribe, Lassi Roininen

Abstract: Many inverse problems focus on recovering a quantity of interest that is a priori known to exhibit either discontinuous or smooth behavior. Within the Bayesian approach to inverse problems, such structural information can be encoded using Markov random field priors. We propose a class of priors that combine Markov random field structure with Student's t distribution. This approach offers flexibili… ▽ More Many inverse problems focus on recovering a quantity of interest that is a priori known to exhibit either discontinuous or smooth behavior. Within the Bayesian approach to inverse problems, such structural information can be encoded using Markov random field priors. We propose a class of priors that combine Markov random field structure with Student's t distribution. This approach offers flexibility in modeling diverse structural behaviors depending on available data. Flexibility is achieved by including the degrees of freedom parameter of Student's t distribution in the formulation of the Bayesian inverse problem. To facilitate posterior computations, we employ Gaussian scale mixture representation for the Student's t Markov random field prior, which allows expressing the prior as a conditionally Gaussian distribution depending on auxiliary hyperparameters. Adopting this representation, we can derive most of the posterior conditional distributions in a closed form and utilize the Gibbs sampler to explore the posterior. We illustrate the method with two numerical examples: signal deconvolution and image deblurring. △ Less

Submitted 15 July, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

arXiv:2207.09147 [pdf, other]

Horseshoe priors for edge-preserving linear Bayesian inversion

Authors: Felipe Uribe, Yiqiu Dong, Per Christian Hansen

Abstract: In many large-scale inverse problems, such as computed tomography and image deblurring, characterization of sharp edges in the solution is desired. Within the Bayesian approach to inverse problems, edge-preservation is often achieved using Markov random field priors based on heavy-tailed distributions. Another strategy, popular in statistics, is the application of hierarchical shrinkage priors. An… ▽ More In many large-scale inverse problems, such as computed tomography and image deblurring, characterization of sharp edges in the solution is desired. Within the Bayesian approach to inverse problems, edge-preservation is often achieved using Markov random field priors based on heavy-tailed distributions. Another strategy, popular in statistics, is the application of hierarchical shrinkage priors. An advantage of this formulation lies in expressing the prior as a conditionally Gaussian distribution depending of global and local hyperparameters which are endowed with heavy-tailed hyperpriors. In this work, we revisit the shrinkage horseshoe prior and introduce its formulation for edge-preserving settings. We discuss a sampling framework based on the Gibbs sampler to solve the resulting hierarchical formulation of the Bayesian inverse problem. In particular, one of the conditional distributions is high-dimensional Gaussian, and the rest are derived in closed form by using a scale mixture representation of the heavy-tailed hyperpriors. Applications from imaging science show that our computational procedure is able to compute sharp edge-preserving posterior point estimates with reduced uncertainty. △ Less

Submitted 19 July, 2022; originally announced July 2022.

arXiv:2203.01030 [pdf, other]

Structural Gaussian Priors for Bayesian CT reconstruction of Subsea Pipes

Authors: Silja L. Christensen, Nicolai A. B. Riis, Felipe Uribe, Jakob S. Jørgensen

Abstract: A non-destructive testing (NDT) application of X-ray computed tomography (CT) is inspection of subsea pipes in operation via 2D cross-sectional scans. Data acquisition is time-consuming and costly due to the challenging subsea environment. Reducing the number of projections in a scan can yield time and cost savings, but compromises the reconstruction quality, if conventional reconstruction methods… ▽ More A non-destructive testing (NDT) application of X-ray computed tomography (CT) is inspection of subsea pipes in operation via 2D cross-sectional scans. Data acquisition is time-consuming and costly due to the challenging subsea environment. Reducing the number of projections in a scan can yield time and cost savings, but compromises the reconstruction quality, if conventional reconstruction methods are used. In this work we take a Bayesian approach to CT reconstruction and focus on designing an effective prior to make use of available structural information about the pipe geometry. We propose a new class of structural Gaussian priors to enforce expected material properties in different regions of the reconstructed image based on independent Gaussian priors in combination with global regularity through a Gaussian Markov Random Field (GMRF) prior. Numerical experiments with synthetic and real data show that the proposed structural Gaussian prior can reduce artifacts and enhance contrast in the reconstruction compared to using only a global GMRF prior or no prior at all. We show how the resulting posterior distribution can be efficiently sampled even for large-scale images, which is essential for practical NDT applications. △ Less

Submitted 21 September, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

Comments: 21 pages, 10 figures, presented at "10th International Conference on Inverse Problems in Engineering" in Italy, May 2022

MSC Class: 65R32; 65C20; 94A08; 65K10 ACM Class: G.3; G.1.6

arXiv:2104.06919 [pdf, other]

A hybrid Gibbs sampler for edge-preserving tomographic reconstruction with uncertain view angles

Authors: Felipe Uribe, Johnathan M. Bardsley, Yiqiu Dong, Per Christian Hansen, Nicolai A. B. Riis

Abstract: In computed tomography, data consist of measurements of the attenuation of X-rays passing through an object. The goal is to reconstruct the linear attenuation coefficient of the object's interior. For each position of the X-ray source, characterized by its angle with respect to a fixed coordinate system, one measures a set of data referred to as a view. A common assumption is that these view angle… ▽ More In computed tomography, data consist of measurements of the attenuation of X-rays passing through an object. The goal is to reconstruct the linear attenuation coefficient of the object's interior. For each position of the X-ray source, characterized by its angle with respect to a fixed coordinate system, one measures a set of data referred to as a view. A common assumption is that these view angles are known, but in some applications they are known with imprecision. We propose a framework to solve a Bayesian inverse problem that jointly estimates the view angles and an image of the object's attenuation coefficient. We also include a few hyperparameters that characterize the likelihood and the priors. Our approach is based on a Gibbs sampler where the associated conditional densities are simulated using different sampling schemes - hence the term hybrid. In particular, the conditional distribution associated with the reconstruction is nonlinear in the image pixels, non-Gaussian and high-dimensional. We approach this distribution by constructing a Laplace approximation that represents the target conditional locally at each Gibbs iteration. This enables sampling of the attenuation coefficients in an efficient manner using iterative reconstruction algorithms. The numerical results show that our algorithm is able to jointly identify the image and the view angles, while also providing uncertainty estimates of both. We demonstrate our method with 2D X-ray computed tomography problems using fan beam configurations. △ Less

Submitted 14 April, 2021; originally announced April 2021.

arXiv:2006.05496 [pdf, other]

Cross-entropy-based importance sampling with failure-informed dimension reduction for rare event simulation

Authors: Felipe Uribe, Iason Papaioannou, Youssef M. Marzouk, Daniel Straub

Abstract: The estimation of rare event or failure probabilities in high dimensions is of interest in many areas of science and technology. We consider problems where the rare event is expressed in terms of a computationally costly numerical model. Importance sampling with the cross-entropy method offers an efficient way to address such problems provided that a suitable parametric family of biasing densities… ▽ More The estimation of rare event or failure probabilities in high dimensions is of interest in many areas of science and technology. We consider problems where the rare event is expressed in terms of a computationally costly numerical model. Importance sampling with the cross-entropy method offers an efficient way to address such problems provided that a suitable parametric family of biasing densities is employed. Although some existing parametric distribution families are designed to perform efficiently in high dimensions, their applicability within the cross-entropy method is limited to problems with dimension of O(1e2). In this work, rather than directly building sampling densities in high dimensions, we focus on identifying the intrinsic low-dimensional structure of the rare event simulation problem. To this end, we exploit a connection between rare event simulation and Bayesian inverse problems. This allows us to adapt dimension reduction techniques from Bayesian inference to construct new, effectively low-dimensional, biasing distributions within the cross-entropy method. In particular, we employ the approach in [47], as it enables control of the error in the approximation of the optimal biasing distribution. We illustrate our method using two standard high-dimensional reliability benchmark problems and one structural mechanics application involving random fields. △ Less

Submitted 9 June, 2020; originally announced June 2020.

Showing 1–7 of 7 results for author: Uribe, F