Search | arXiv e-print repository

Data Consistency Approach to Model Validation

Authors: Andreas Svensson, Dave Zachariah, Petre Stoica, Thomas B. Schön

Abstract: In scientific inference problems, the underlying statistical modeling assumptions have a crucial impact on the end results. There exist, however, only a few automatic means for validating these fundamental modelling assumptions. The contribution in this paper is a general criterion to evaluate the consistency of a set of statistical models with respect to observed data. This is achieved by automat… ▽ More In scientific inference problems, the underlying statistical modeling assumptions have a crucial impact on the end results. There exist, however, only a few automatic means for validating these fundamental modelling assumptions. The contribution in this paper is a general criterion to evaluate the consistency of a set of statistical models with respect to observed data. This is achieved by automatically gauging the models' ability to generate data that is similar to the observed data. Importantly, the criterion follows from the model class itself and is therefore directly applicable to a broad range of inference problems with varying data types, ranging from independent univariate data to high-dimensional time-series. The proposed data consistency criterion is illustrated, evaluated and compared to several well-established methods using three synthetic and two real data sets. △ Less

Submitted 20 May, 2019; v1 submitted 17 August, 2018; originally announced August 2018.

Journal ref: IEEE Access, 7(1):59788-59796, 2019

arXiv:1712.02675 [pdf, other]

How consistent is my model with the data? Information-Theoretic Model Check

Authors: Andreas Svensson, Dave Zachariah, Thomas B. Schön

Abstract: The choice of model class is fundamental in statistical learning and system identification, no matter whether the class is derived from physical principles or is a generic black-box. We develop a method to evaluate the specified model class by assessing its capability of reproducing data that is similar to the observed data record. This model check is based on the information-theoretic properties… ▽ More The choice of model class is fundamental in statistical learning and system identification, no matter whether the class is derived from physical principles or is a generic black-box. We develop a method to evaluate the specified model class by assessing its capability of reproducing data that is similar to the observed data record. This model check is based on the information-theoretic properties of models viewed as data generators and is applicable to e.g. sequential data and nonlinear dynamical models. The method can be understood as a specific two-sided posterior predictive test. We apply the information-theoretic model check to both synthetic and real data and compare it with a classical whiteness test. △ Less

Submitted 19 December, 2017; v1 submitted 7 December, 2017; originally announced December 2017.

Comments: The title has been updated, but no other significant changes have been made from the previous version

arXiv:1711.10765 [pdf, other]

Learning nonlinear state-space models using smooth particle-filter-based likelihood approximations

Authors: Andreas Svensson, Fredrik Lindsten, Thomas B. Schön

Abstract: When classical particle filtering algorithms are used for maximum likelihood parameter estimation in nonlinear state-space models, a key challenge is that estimates of the likelihood function and its derivatives are inherently noisy. The key idea in this paper is to run a particle filter based on a current parameter estimate, but then use the output from this particle filter to re-evaluate the lik… ▽ More When classical particle filtering algorithms are used for maximum likelihood parameter estimation in nonlinear state-space models, a key challenge is that estimates of the likelihood function and its derivatives are inherently noisy. The key idea in this paper is to run a particle filter based on a current parameter estimate, but then use the output from this particle filter to re-evaluate the likelihood function approximation also for other parameter values. This results in a (local) deterministic approximation of the likelihood and any standard optimization routine can be applied to find the maximum of this local approximation. By iterating this procedure we eventually arrive at a final parameter estimate. △ Less

Submitted 29 November, 2017; originally announced November 2017.

arXiv:1703.02419 [pdf, ps, other]

doi 10.1016/j.ymssp.2017.10.033

Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo

Authors: Thomas B. Schön, Andreas Svensson, Lawrence Murray, Fredrik Lindsten

Abstract: Probabilistic modeling provides the capability to represent and manipulate uncertainty in data, models, predictions and decisions. We are concerned with the problem of learning probabilistic models of dynamical systems from measured data. Specifically, we consider learning of probabilistic nonlinear state-space models. There is no closed-form solution available for this problem, implying that we a… ▽ More Probabilistic modeling provides the capability to represent and manipulate uncertainty in data, models, predictions and decisions. We are concerned with the problem of learning probabilistic models of dynamical systems from measured data. Specifically, we consider learning of probabilistic nonlinear state-space models. There is no closed-form solution available for this problem, implying that we are forced to use approximations. In this tutorial we will provide a self-contained introduction to one of the state-of-the-art methods---the particle Metropolis--Hastings algorithm---which has proven to offer a practical approximation. This is a Monte Carlo based method, where the particle filter is used to guide a Markov chain Monte Carlo method through the parameter space. One of the key merits of the particle Metropolis--Hastings algorithm is that it is guaranteed to converge to the "true solution" under mild assumptions, despite being based on a particle filter with only a finite number of particles. We will also provide a motivating numerical example illustrating the method using a modeling language tailored for sequential Monte Carlo methods. The intention of modeling languages of this kind is to open up the power of sophisticated Monte Carlo methods---including particle Metropolis--Hastings---to a large group of users without requiring them to know all the underlying mathematical details. △ Less

Submitted 15 December, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

Comments: Thomas B. Schön, Andreas Svensson, Lawrence Murray and Fredrik Lindsten, 2018. Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo. In Mechanical Systems and Signal Processing, Volume 104, pp. 866-883

arXiv:1702.01618 [pdf, other]

doi 10.1016/j.ymssp.2017.09.016

Learning of state-space models with highly informative observations: a tempered Sequential Monte Carlo solution

Authors: Andreas Svensson, Thomas B. Schön, Fredrik Lindsten

Abstract: Probabilistic (or Bayesian) modeling and learning offers interesting possibilities for systematic representation of uncertainty using probability theory. However, probabilistic learning often leads to computationally challenging problems. Some problems of this type that were previously intractable can now be solved on standard personal computers thanks to recent advances in Monte Carlo methods. In… ▽ More Probabilistic (or Bayesian) modeling and learning offers interesting possibilities for systematic representation of uncertainty using probability theory. However, probabilistic learning often leads to computationally challenging problems. Some problems of this type that were previously intractable can now be solved on standard personal computers thanks to recent advances in Monte Carlo methods. In particular, for learning of unknown parameters in nonlinear state-space models, methods based on the particle filter (a Monte Carlo method) have proven very useful. A notoriously challenging problem, however, still occurs when the observations in the state-space model are highly informative, i.e. when there is very little or no measurement noise present, relative to the amount of process noise. The particle filter will then struggle in estimating one of the basic components for probabilistic learning, namely the likelihood $p($data$|$parameters$)$. To this end we suggest an algorithm which initially assumes that there is substantial amount of artificial measurement noise present. The variance of this noise is sequentially decreased in an adaptive fashion such that we, in the end, recover the original problem or possibly a very close approximation of it. The main component in our algorithm is a sequential Monte Carlo (SMC) sampler, which gives our proposed method a clear resemblance to the SMC^2 method. Another natural link is also made to the ideas underlying the approximate Bayesian computation (ABC). We illustrate it with numerical examples, and in particular show promising results for a challenging Wiener-Hammerstein benchmark problem. △ Less

Submitted 13 December, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

Journal ref: Mechanical Systems and Signal Processing, Volume 104 (May 2018), Pages 915-928

arXiv:1603.05486 [pdf, other]

A flexible state space model for learning nonlinear dynamical systems

Authors: Andreas Svensson, Thomas B. Schön

Abstract: We consider a nonlinear state-space model with the state transition and observation functions expressed as basis function expansions. The coefficients in the basis function expansions are learned from data. Using a connection to Gaussian processes we also develop priors on the coefficients, for tuning the model flexibility and to prevent overfitting to data, akin to a Gaussian process state-space… ▽ More We consider a nonlinear state-space model with the state transition and observation functions expressed as basis function expansions. The coefficients in the basis function expansions are learned from data. Using a connection to Gaussian processes we also develop priors on the coefficients, for tuning the model flexibility and to prevent overfitting to data, akin to a Gaussian process state-space model. The priors can alternatively be seen as a regularization, and helps the model in generalizing the data without sacrificing the richness offered by the basis function expansion. To learn the coefficients and other unknown parameters efficiently, we tailor an algorithm using state-of-the-art sequential Monte Carlo methods, which comes with theoretical guarantees on the learning. Our approach indicates promising results when evaluated on a classical benchmark as well as real data. △ Less

Submitted 28 March, 2017; v1 submitted 17 March, 2016; originally announced March 2016.

Journal ref: Automatica 80(2017), page 189-199

arXiv:1510.00563 [pdf, other]

doi 10.1109/CAMSAP.2015.7383841

Nonlinear State Space Model Identification Using a Regularized Basis Function Expansion

Authors: Andreas Svensson, Thomas B. Schön, Arno Solin, Simo Särkkä

Abstract: This paper is concerned with black-box identification of nonlinear state space models. By using a basis function expansion within the state space model, we obtain a flexible structure. The model is identified using an expectation maximization approach, where the states and the parameters are updated iteratively in such a way that a maximum likelihood estimate is obtained. We use recent particle me… ▽ More This paper is concerned with black-box identification of nonlinear state space models. By using a basis function expansion within the state space model, we obtain a flexible structure. The model is identified using an expectation maximization approach, where the states and the parameters are updated iteratively in such a way that a maximum likelihood estimate is obtained. We use recent particle methods with sound theoretical properties to infer the states, whereas the model parameters can be updated using closed-form expressions by exploiting the fact that our model is linear in the parameters. Not to over-fit the flexible model to the data, we also propose a regularization scheme without increasing the computational burden. Importantly, this opens up for systematic use of regularization in nonlinear state space models. We conclude by evaluating our proposed approach on one simulation example and two real-data problems. △ Less

Submitted 2 October, 2015; originally announced October 2015.

Comments: Accepted to the 6th IEEE international workshop on computational advances in multi-sensor adaptive processing (CAMSAP), Cancun, Mexico, December 2015

arXiv:1506.02267 [pdf, other]

Computationally Efficient Bayesian Learning of Gaussian Process State Space Models

Authors: Andreas Svensson, Arno Solin, Simo Särkkä, Thomas B. Schön

Abstract: Gaussian processes allow for flexible specification of prior assumptions of unknown dynamics in state space models. We present a procedure for efficient Bayesian learning in Gaussian process state space models, where the representation is formed by projecting the problem onto a set of approximate eigenfunctions derived from the prior covariance structure. Learning under this family of models can b… ▽ More Gaussian processes allow for flexible specification of prior assumptions of unknown dynamics in state space models. We present a procedure for efficient Bayesian learning in Gaussian process state space models, where the representation is formed by projecting the problem onto a set of approximate eigenfunctions derived from the prior covariance structure. Learning under this family of models can be conducted using a carefully crafted particle MCMC algorithm. This scheme is computationally efficient and yet allows for a fully Bayesian treatment of the problem. Compared to conventional system identification tools or existing learning methods, we show competitive performance and reliable quantification of uncertainties in the model. △ Less

Submitted 15 April, 2016; v1 submitted 7 June, 2015; originally announced June 2015.

arXiv:1503.06058 [pdf, other]

doi 10.1016/j.ifacol.2015.12.224

Sequential Monte Carlo Methods for System Identification

Authors: Thomas B. Schön, Fredrik Lindsten, Johan Dahlin, Johan Wågberg, Christian A. Naesseth, Andreas Svensson, Liang Dai

Abstract: One of the key challenges in identifying nonlinear and possibly non-Gaussian state space models (SSMs) is the intractability of estimating the system state. Sequential Monte Carlo (SMC) methods, such as the particle filter (introduced more than two decades ago), provide numerical solutions to the nonlinear state estimation problems arising in SSMs. When combined with additional identification tech… ▽ More One of the key challenges in identifying nonlinear and possibly non-Gaussian state space models (SSMs) is the intractability of estimating the system state. Sequential Monte Carlo (SMC) methods, such as the particle filter (introduced more than two decades ago), provide numerical solutions to the nonlinear state estimation problems arising in SSMs. When combined with additional identification techniques, these algorithms provide solid solutions to the nonlinear system identification problem. We describe two general strategies for creating such combinations and discuss why SMC is a natural tool for implementing these strategies. △ Less

Submitted 10 March, 2016; v1 submitted 20 March, 2015; originally announced March 2015.

Comments: In proceedings of the 17th IFAC Symposium on System Identification (SYSID). Added cover page

arXiv:1502.03697 [pdf, other]

Nonlinear state space smoothing using the conditional particle filter

Authors: Andreas Svensson, Thomas B. Schön, Manon Kok

Abstract: To estimate the smoothing distribution in a nonlinear state space model, we apply the conditional particle filter with ancestor sampling. This gives an iterative algorithm in a Markov chain Monte Carlo fashion, with asymptotic convergence results. The computational complexity is analyzed, and our proposed algorithm is successfully applied to the challenging problem of sensor fusion between ultra-w… ▽ More To estimate the smoothing distribution in a nonlinear state space model, we apply the conditional particle filter with ancestor sampling. This gives an iterative algorithm in a Markov chain Monte Carlo fashion, with asymptotic convergence results. The computational complexity is analyzed, and our proposed algorithm is successfully applied to the challenging problem of sensor fusion between ultra-wideband and accelerometer/gyroscope measurements for indoor positioning. It appears to be a competitive alternative to existing nonlinear smoothing algorithms, in particular the forward filtering-backward simulation smoother. △ Less

Submitted 16 September, 2015; v1 submitted 12 February, 2015; originally announced February 2015.

Comments: Accepted for the 17th IFAC Symposium on System Identification (SYSID), Beijing, China, October 2015

arXiv:1502.01908 [pdf, other]

doi 10.1109/CAMSAP.2015.7383840

Marginalizing Gaussian Process Hyperparameters using Sequential Monte Carlo

Authors: Andreas Svensson, Johan Dahlin, Thomas B. Schön

Abstract: Gaussian process regression is a popular method for non-parametric probabilistic modeling of functions. The Gaussian process prior is characterized by so-called hyperparameters, which often have a large influence on the posterior model and can be difficult to tune. This work provides a method for numerical marginalization of the hyperparameters, relying on the rigorous framework of sequential Mont… ▽ More Gaussian process regression is a popular method for non-parametric probabilistic modeling of functions. The Gaussian process prior is characterized by so-called hyperparameters, which often have a large influence on the posterior model and can be difficult to tune. This work provides a method for numerical marginalization of the hyperparameters, relying on the rigorous framework of sequential Monte Carlo. Our method is well suited for online problems, and we demonstrate its ability to handle real-world problems with several dimensions and compare it to other marginalization methods. We also conclude that our proposed method is a competitive alternative to the commonly used point estimates maximizing the likelihood, both in terms of computational load and its ability to handle multimodal posteriors. △ Less

Submitted 2 October, 2015; v1 submitted 6 February, 2015; originally announced February 2015.

Comments: Accepted to the 6th IEEE international workshop on computational advances in multi-sensor adaptive processing (CAMSAP), Cancun, Mexico, December 2015

arXiv:1409.7287 [pdf, other]

doi 10.1109/CDC.2014.7040409

Identification of jump Markov linear models using particle filters

Authors: Andreas Svensson, Thomas B. Schön, Fredrik Lindsten

Abstract: Jump Markov linear models consists of a finite number of linear state space models and a discrete variable encoding the jumps (or switches) between the different linear models. Identifying jump Markov linear models makes for a challenging problem lacking an analytical solution. We derive a new expectation maximization (EM) type algorithm that produce maximum likelihood estimates of the model param… ▽ More Jump Markov linear models consists of a finite number of linear state space models and a discrete variable encoding the jumps (or switches) between the different linear models. Identifying jump Markov linear models makes for a challenging problem lacking an analytical solution. We derive a new expectation maximization (EM) type algorithm that produce maximum likelihood estimates of the model parameters. Our development hinges upon recent progress in combining particle filters with Markov chain Monte Carlo methods in solving the nonlinear state smoothing problem inherent in the EM formulation. Key to our development is that we exploit a conditionally linear Gaussian substructure in the model, allowing for an efficient algorithm. △ Less

Submitted 25 September, 2014; originally announced September 2014.

Comments: Accepted to 53rd IEEE International Conference on Decision and Control (CDC), 2014 (Los Angeles, CA, USA)

Journal ref: Proc. of IEEE 53rd Conference on Decision and Control (CDC), pp.6504,6509, 15-17 Dec. 2014 (Los Angeles, CA, USA)

Showing 1–12 of 12 results for author: Svensson, A