-
Precision Annealing Monte Carlo Methods for Statistical Data Assimilation: Metropolis-Hastings Procedures
Authors:
Adrian S. Wong,
Kangbo Hao,
Zheng Fang,
Henry D. I. Abarbanel
Abstract:
Statistical Data Assimilation (SDA) is the transfer of information from field or laboratory observations to a user selected model of the dynamical system producing those observations. The data is noisy and the model has errors; the information transfer addresses properties of the conditional probability distribution of the states of the model conditioned on the observations. The quantities of inte…
▽ More
Statistical Data Assimilation (SDA) is the transfer of information from field or laboratory observations to a user selected model of the dynamical system producing those observations. The data is noisy and the model has errors; the information transfer addresses properties of the conditional probability distribution of the states of the model conditioned on the observations. The quantities of interest in SDA are the conditional expected values of functions of the model state, and these require the approximate evaluation of high dimensional integrals. We introduce a conditional probability distribution and use the Laplace method with annealing to identify the maxima of the conditional probability distribution. The annealing method slowly increases the precision term of the model as it enters the Laplace method. In this paper, we extend the idea of precision annealing (PA) to Monte Carlo calculations of conditional expected values using Metropolis-Hastings methods.
△ Less
Submitted 14 January, 2019;
originally announced January 2019.
-
Strategic Monte Carlo Methods for State and Parameter Estimation in High Dimensional Nonlinear Problems
Authors:
Sasha Shirman,
Henry D. I. Abarbanel
Abstract:
In statistical data assimilation one seeks the largest maximum of the conditional probability distribution $P(\mathbf{X},\mathbf{p}|\mathbf{Y})$ of model states, $\mathbf{X}$, and parameters,$\mathbf{p}$, conditioned on observations $\mathbf{Y}$ through minimizing the `action', $A(\mathbf{X}) = -\log P(\mathbf{X},\mathbf{p}|\mathbf{Y})$. This determines the dominant contribution to the expected va…
▽ More
In statistical data assimilation one seeks the largest maximum of the conditional probability distribution $P(\mathbf{X},\mathbf{p}|\mathbf{Y})$ of model states, $\mathbf{X}$, and parameters,$\mathbf{p}$, conditioned on observations $\mathbf{Y}$ through minimizing the `action', $A(\mathbf{X}) = -\log P(\mathbf{X},\mathbf{p}|\mathbf{Y})$. This determines the dominant contribution to the expected values of functions of $\mathbf{X}$ but does not give information about the structure of $P(\mathbf{X},\mathbf{p}|\mathbf{Y})$ away from the maximum. We introduce a Monte Carlo sampling method, called Strategic Monte Carlo (SMC) sampling, for estimating $P(\mathbf{X}, \mathbf{p}|\mathbf{Y})$ in the neighborhood of its largest maximum to remedy this limitation. SMC begins with a systematic variational annealing (VA) procedure for finding the smallest minimum of $A(\mathbf{X})$. SMC generates accurate estimates for the mean, standard deviation and other higher moments of $P(\mathbf{X},\mathbf{p}|\mathbf{Y})$. Additionally, the random search allows for an understanding of any multimodal structure that may underly the dynamics of the problem. SMC generates a gaussian probability control term based on the paths determined by VA to minimize a cost function $A(\mathbf{X},\mathbf{p})$. This probability is sampled during the Monte Carlo search of the cost function to constrain the search to high probability regions of the surface thus substantially reducing the time necessary to sufficiently explore the space.
△ Less
Submitted 24 May, 2018;
originally announced May 2018.
-
Machine Learning as Statistical Data Assimilation
Authors:
H. D. I. Abarbanel,
P. J. Rozdeba,
S. Shirman
Abstract:
We identify a strong equivalence between neural network based machine learning (ML) methods and the formulation of statistical data assimilation (DA), known to be a problem in statistical physics. DA, as used widely in physical and biological sciences, systematically transfers information in observations to a model of the processes producing the observations. The correspondence is that layer label…
▽ More
We identify a strong equivalence between neural network based machine learning (ML) methods and the formulation of statistical data assimilation (DA), known to be a problem in statistical physics. DA, as used widely in physical and biological sciences, systematically transfers information in observations to a model of the processes producing the observations. The correspondence is that layer label in the ML setting is the analog of time in the data assimilation setting. Utilizing aspects of this equivalence we discuss how to establish the global minimum of the cost functions in the ML context, using a variational annealing method from DA. This provides a design method for optimal networks for ML applications and may serve as the basis for understanding the success of "deep learning". Results from an ML example are presented.
When the layer label is taken to be continuous, the Euler-Lagrange equation for the ML optimization problem is an ordinary differential equation, and we see that the problem being solved is a two point boundary value problem. The use of continuous layers is denoted "deepest learning". The Hamiltonian version provides a direct rationale for back propagation as a solution method for the canonical momentum; however, it suggests other solution methods are to be preferred.
△ Less
Submitted 19 October, 2017;
originally announced October 2017.