-
Greedy Stein Variational Gradient Descent: An algorithmic approach for wave prospection problems
Authors:
Jose L. Varona-Santana,
Marcos A. Capistrán
Abstract:
In this project, we propose a Variational Inference algorithm to approximate posterior distributions. Building on prior methods, we develop the Gradient-Steered Stein Variational Gradient Descent (G-SVGD) approach. This method introduces a novel loss function that combines a weighted gradient and the Evidence Lower Bound (ELBO) to enhance convergence speed and accuracy. The learning rate is determ…
▽ More
In this project, we propose a Variational Inference algorithm to approximate posterior distributions. Building on prior methods, we develop the Gradient-Steered Stein Variational Gradient Descent (G-SVGD) approach. This method introduces a novel loss function that combines a weighted gradient and the Evidence Lower Bound (ELBO) to enhance convergence speed and accuracy. The learning rate is determined through a suboptimal minimization of this loss function within a gradient descent framework.
The G-SVGD method is compared against the standard Stein Variational Gradient Descent (SVGD) approach, employing the ADAM optimizer for learning rate adaptation, as well as the Markov Chain Monte Carlo (MCMC) method. We assess performance in two wave prospection models representing low-contrast and high-contrast subsurface scenarios. To achieve robust numerical approximations in the forward model solver, a five-point operator is employed, while the adjoint method improves accuracy in computing gradients of the log posterior.
Our findings demonstrate that G-SVGD accelerates convergence and offers improved performance in scenarios where gradient evaluation is computationally expensive. The abstract highlights the algorithm's applicability to wave prospection models and its potential for broader applications in Bayesian inference. Finally, we discuss the benefits and limitations of G-SVGD, emphasizing its contribution to advancing computational efficiency in uncertainty quantification.
△ Less
Submitted 31 January, 2025;
originally announced January 2025.
-
Bayesian sequential data assimilation for COVID-19 forecasting
Authors:
Maria L. Daza-Torres,
Marcos A. Capistrán,
Antonio Capella,
J. Andrés Christen
Abstract:
We introduce a Bayesian sequential data assimilation method for COVID-19 forecasting. It is assumed that suitable transmission, epidemic and observation models are available and previously validated and the transmission and epidemic models are coded into a dynamical system. The observation model depends on the dynamical system state variables and parameters, and is cast as a likelihood function. W…
▽ More
We introduce a Bayesian sequential data assimilation method for COVID-19 forecasting. It is assumed that suitable transmission, epidemic and observation models are available and previously validated and the transmission and epidemic models are coded into a dynamical system. The observation model depends on the dynamical system state variables and parameters, and is cast as a likelihood function. We elicit prior distributions of the effective population size, the dynamical system initial conditions and infectious contact rate, and use Markov Chain Monte Carlo sampling to make inference and prediction of quantities of interest (QoI) at the onset of the epidemic outbreak. The forecast is sequentially updated over a sliding window of epidemic records as new data becomes available. Prior distributions for the state variables at the new forecasting time are assembled using the dynamical system, calibrated for the previous forecast. Moreover, changes in the contact rate and effective population size are naturally introduced through auto-regressive models on the corresponding parameters. We show our forecasting method's performance using a SEIR type model and COVID-19 data from several Mexican localities.
△ Less
Submitted 10 March, 2021;
originally announced March 2021.
-
Bayesian Analysis of Glucose Dynamics during the Oral Glucose Tolerance Test (OGTT)
Authors:
Hugo Flores-Arguedas,
Marcos A. Capistrán
Abstract:
In this paper, we propose a model of the dynamics of the blood glucose level during an Oral Glucose Tolerance Test (OGTT). This dynamic includes the action of insulin and glucagon in the glucose homeostasis process as the reaction of an oral stimulus. We propose a Bayesian approach in the inference of five parameters related to insulin secretion, glucagon secretion, gastrointestinal emptying, and…
▽ More
In this paper, we propose a model of the dynamics of the blood glucose level during an Oral Glucose Tolerance Test (OGTT). This dynamic includes the action of insulin and glucagon in the glucose homeostasis process as the reaction of an oral stimulus. We propose a Bayesian approach in the inference of five parameters related to insulin secretion, glucagon secretion, gastrointestinal emptying, and basal glucose level. Two insulin indicators related to the glucose level in blood and in the gastrointestinal tract allow us to suggest a classification for patients with impaired insulin sensitivity.
△ Less
Submitted 11 May, 2021; v1 submitted 5 December, 2020;
originally announced December 2020.
-
Filtering and improved Uncertainty Quantification in the dynamic estimation of effective reproduction numbers
Authors:
Marcos A. Capistrán,
Antonio Capella,
J. Andrés Christen
Abstract:
The effective reproduction number $R_t$ measures an infectious disease's transmissibility as the number of secondary infections in one reproduction time in a population having both susceptible and non-susceptible hosts. Current approaches do not quantify the uncertainty correctly in estimating $R_t$, as expected by the observed variability in contagion patterns. We elaborate on the Bayesian estima…
▽ More
The effective reproduction number $R_t$ measures an infectious disease's transmissibility as the number of secondary infections in one reproduction time in a population having both susceptible and non-susceptible hosts. Current approaches do not quantify the uncertainty correctly in estimating $R_t$, as expected by the observed variability in contagion patterns. We elaborate on the Bayesian estimation of $R_t$ by improving on the Poisson sampling model of Cori et al. (2013). By adding an autoregressive latent process, we build a Dynamic Linear Model on the log of observed $R_t$s, resulting in a filtering type Bayesian inference. We use a conjugate analysis, and all calculations are explicit. Results show an improved uncertainty quantification on the estimation of $R_t$'s, with a reliable method that could safely be used by non-experts and within other forecasting systems. We illustrate our approach with recent data from the current COVID19 epidemic in Mexico.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
Error control in the numerical posterior distribution in the Bayesian UQ analysis of a semilinear evolution PDE
Authors:
Maria L. Daza-Torres,
J. Cricelio Montesinos-López,
Marcos A. Capistrán,
J. Andrés Christen,
Heikki Haario
Abstract:
We elaborate on results obtained in \cite{christen2018} for controlling the numerical posterior error for Bayesian UQ problems, now considering forward maps arising from the solution of a semilinear evolution partial differential equation. Results in \cite{christen2018} demand an estimate for the absolute global error (AGE) of the numeric forward map. Our contribution is a numerical method for com…
▽ More
We elaborate on results obtained in \cite{christen2018} for controlling the numerical posterior error for Bayesian UQ problems, now considering forward maps arising from the solution of a semilinear evolution partial differential equation. Results in \cite{christen2018} demand an estimate for the absolute global error (AGE) of the numeric forward map. Our contribution is a numerical method for computing the AGE for semilinear evolution PDEs and shows the potential applicability of \cite{christen2018} in this important wide range family of PDEs. Numerical examples are given to illustrate the efficiency of the proposed method, obtaining numerical posterior distributions for unknown parameters that are nearly identical to the corresponding theoretical posterior, by keeping their Bayes factor close to 1.
△ Less
Submitted 5 November, 2020; v1 submitted 13 January, 2020;
originally announced January 2020.
-
Estimating a pressure dependent thermal conductivity coefficient with applications in food technology
Authors:
Marcos A Capistran,
Juan Antonio Infante del Rio
Abstract:
In this paper we introduce a method to estimate a pressure dependent thermal conductivity coefficient arising in a heat diffusion model with applications in food technology. To address the known smoothing effect of the direct problem, we model the uncertainty of the conductivity coefficient as a hierarchical Gaussian Markov random field (GMRF) restricted to uniqueness conditions for the solution o…
▽ More
In this paper we introduce a method to estimate a pressure dependent thermal conductivity coefficient arising in a heat diffusion model with applications in food technology. To address the known smoothing effect of the direct problem, we model the uncertainty of the conductivity coefficient as a hierarchical Gaussian Markov random field (GMRF) restricted to uniqueness conditions for the solution of the inverse problem established in Fraguela et al. Furthermore, we propose a Single Variable Exchange Metropolis-Hastings algorithm to sample the corresponding conditional probability distribution of the conductivity coefficient given observations of the temperature. Sensitivity analysis of the direct problem suggests that large integration times are necessary to identify the thermal conductivity coefficient. Numerical evidence indicates that a signal to noise ratio of roughly 1000 suffices to reliably retrieve the thermal conductivity coefficient.
△ Less
Submitted 7 March, 2019;
originally announced March 2019.
-
A computational geometry method for the inverse scattering problem
Authors:
Maria L. Daza-Torres,
Juan Antonio Infante del Río,
Marcos A. Capistrán,
J. Andrés Christen
Abstract:
In this paper we demonstrate a computational method to solve the inverse scattering problem for a star-shaped, smooth, penetrable obstacle in 2D. Our method is based on classical ideas from computational geometry. First, we approximate the support of a scatterer by a point cloud. Secondly, we use the Bayesian paradigm to model the joint conditional probability distribution of the non-convex hull o…
▽ More
In this paper we demonstrate a computational method to solve the inverse scattering problem for a star-shaped, smooth, penetrable obstacle in 2D. Our method is based on classical ideas from computational geometry. First, we approximate the support of a scatterer by a point cloud. Secondly, we use the Bayesian paradigm to model the joint conditional probability distribution of the non-convex hull of the point cloud and the constant refractive index of the scatterer given near field data. Of note, we use the non-convex hull of the point cloud as spline control points to evaluate, on a finer mesh, the volume potential arising in the integral equation formulation of the direct problem. Finally, in order to sample the arising posterior distribution, we propose a probability transition kernel that commutes with affine transformations of space. Our findings indicate that our method is reliable to retrieve the support and constant refractive index of the scatterer simultaneously. Indeed, our sampling method is robust to estimate a quantity of interest such as the area of the scatterer. We conclude pointing out a series of generalizations of our method.
△ Less
Submitted 23 July, 2018;
originally announced July 2018.
-
Inference for stochastic kinetic models from multiple data sources for joint estimation of infection dynamics from aggregate reports and virological data
Authors:
Oksana A. Chkrebtii,
Yury E. García,
Marcos A. Capistrán,
Daniel E. Noyola
Abstract:
Before the current pandemic, influenza and respiratory syncytial virus (RSV) were the leading etiological agents of seasonal acute respiratory infections (ARI) around the world. In this setting, medical doctors typically based the diagnosis of ARI on patients' symptoms alone and did not routinely conduct virological tests necessary to identify individual viruses, limiting the ability to study the…
▽ More
Before the current pandemic, influenza and respiratory syncytial virus (RSV) were the leading etiological agents of seasonal acute respiratory infections (ARI) around the world. In this setting, medical doctors typically based the diagnosis of ARI on patients' symptoms alone and did not routinely conduct virological tests necessary to identify individual viruses, limiting the ability to study the interaction between multiple pathogens and to make public health recommendations. We consider a stochastic kinetic model (SKM) for two interacting ARI pathogens circulating in a large population and an empirically-motivated background process for infections with other pathogens causing similar symptoms. An extended marginal sampling approach, based on the linear noise approximation to the SKM, integrates multiple data sources and additional model components. We infer the parameters defining the pathogens' dynamics and interaction within a Bayesian model and explore the posterior trajectories of infections for each illness based on aggregate infection reports from six epidemic seasons collected by the state health department and a subset of virological tests from a sentinel program at a general hospital in San Luis Potosí, México. We interpret the results and make recommendations for future data collection strategies.
△ Less
Submitted 17 February, 2022; v1 submitted 27 October, 2017;
originally announced October 2017.
-
Numerical posterior distribution error control and expected Bayes Factors in the bayesian Uncertainty Quantification of Inverse Problems
Authors:
J. Andrés Christen,
Marcos A. Capistrán,
Miguel Ángel Moreles
Abstract:
In the bayesian analysis of Inverse Problems most relevant cases the forward maps (FM, or regressor function) are defined in terms of a system of (O, P)DE's with intractable solutions. These necessarily involve a numerical method to find approximate versions of such solutions and lead to a numerical/approximate posterior distribution. Recently several results have been published on the regularity…
▽ More
In the bayesian analysis of Inverse Problems most relevant cases the forward maps (FM, or regressor function) are defined in terms of a system of (O, P)DE's with intractable solutions. These necessarily involve a numerical method to find approximate versions of such solutions and lead to a numerical/approximate posterior distribution. Recently several results have been published on the regularity conditions required on such numerical methods to ensure converge of the numerical to the theoretical posterior. However, more practical guidelines are needed to ensure a suitable working numerical posterior. ]Capistran2016] prove for ODE's that the Bayes Factor of the approximate vs the theoretical model tends to 1 in the same order as the numerical method order. In this work we generalize the latter paper in that we consider 1) also PDE's, 2) correlated observations, 3) practical guidelines in a multidimensional setting and 4) explore the use of expected Bayes Factors. This permits us to obtain bounds on the absolute global errors to be tolerated by the FM numerical solver, which we illustrate with some examples. Since the Bayes Factor is kept above 0.95 we expect that the resulting numerical posterior is basically indistinguishable from the theoretical posterior, even though we are using an approximate numerical FM. The method is illustrated with some examples using synthetic data.
△ Less
Submitted 29 August, 2017; v1 submitted 7 July, 2016;
originally announced July 2016.
-
An analysis of the interaction between influenza and respiratory syncytial virus based on acute respiratory infection records
Authors:
Yendry N. Arguedas-Flatts,
Marcos A. Capistrán,
J. Andrés Christen,
Daniel E. Noyola
Abstract:
Under the hypothesis that both influenza and respiratory syncytial virus (RSV) are the two leading causes of acute respiratory infections (ARI), in this paper we have used a standard two-pathogen epidemic model as a regressor to explain, on a yearly basis, high season ARI data in terms of the contact rates and initial conditions of the mathematical model. The rationale is that ARI high season is a…
▽ More
Under the hypothesis that both influenza and respiratory syncytial virus (RSV) are the two leading causes of acute respiratory infections (ARI), in this paper we have used a standard two-pathogen epidemic model as a regressor to explain, on a yearly basis, high season ARI data in terms of the contact rates and initial conditions of the mathematical model. The rationale is that ARI high season is a transient regime of a noisy system, e.g., the system is driven away from equilibrium every year by fluctuations in variables such as humidity, temperature, viral mutations and human behavior. Using the value of the replacement number as a phenotypic trait associated to fitness, we provide evidence that influenza and RSV coexists throughout the ARI high season through superinfection.
△ Less
Submitted 29 November, 2013;
originally announced December 2013.
-
Towards Uncertainty Quantification and Inference in the stochastic SIR Epidemic Model
Authors:
Marcos A. Capistrán,
J. Andrés Christen,
Jorge X. Velasco-Hernández
Abstract:
In this paper we introduce a novel method to conduct inference with models defined through a continuous-time Markov process, and we apply these results to a classical stochastic SIR model as a case study. Using the inverse-size expansion of van Kampen we obtain approximations for first and second moments for the state variables. These approximate moments are in turn matched to the moments of an in…
▽ More
In this paper we introduce a novel method to conduct inference with models defined through a continuous-time Markov process, and we apply these results to a classical stochastic SIR model as a case study. Using the inverse-size expansion of van Kampen we obtain approximations for first and second moments for the state variables. These approximate moments are in turn matched to the moments of an inputed generic discrete distribution aimed at generating an approximate likelihood that is valid both for low count or high count data. We conduct a full Bayesian inference to estimate epidemic parameters using informative priors. Excellent estimations and predictions are obtained both in a synthetic data scenario and in two Dengue fever case studies.
△ Less
Submitted 9 November, 2011;
originally announced November 2011.