Search | arXiv e-print repository

Bayesian Inference for Non-Synchronously Observed Diffusions

Authors: Ajay Jasra, Kengo Kamatani, Amin Wu

Abstract: We consider the problem of Bayesian inference for bi-variate data observed in time but with observation times which occur non-synchronously. In particular, this occurs in a wide variety of applications in finance, such as high-frequency trading or crude oil futures trading. We adopt a diffusion model for the data and formulate a Bayesian model with priors on unknown parameters along with a latent… ▽ More We consider the problem of Bayesian inference for bi-variate data observed in time but with observation times which occur non-synchronously. In particular, this occurs in a wide variety of applications in finance, such as high-frequency trading or crude oil futures trading. We adopt a diffusion model for the data and formulate a Bayesian model with priors on unknown parameters along with a latent representation for the the so-called missing data. We then consider computational methodology to fit the model using Markov chain Monte Carlo (MCMC). We have to resort to time-discretization methods as the complete data likelihood is intractable and this can cause considerable issues for MCMC when the data are observed in low frequencies. In a high frequency observation frequencies we present a simple particle MCMC method based on an Euler--Maruyama time discretization, which can be enhanced using multilevel Monte Carlo (MLMC). In the low frequency observation regime we introduce a novel bridging representation of the posterior in continuous time to deal with the issues of MCMC in this case. This representation is discretized and fitted using MCMC and MLMC. We apply our methodology to real and simulated data to establish the efficacy of our methodology. △ Less

Submitted 1 March, 2025; originally announced March 2025.

arXiv:2408.03682 [pdf, other]

Automated Techniques for Efficient Sampling of Piecewise-Deterministic Markov Processes

Authors: Charly Andral, Kengo Kamatani

Abstract: Piecewise deterministic Markov processes (PDMPs) are a class of continuous-time Markov processes that were recently used to develop a new class of Markov chain Monte Carlo algorithms. However, the implementation of the processes is challenging due to the continuous-time aspect and the necessity of integrating the rate function. Recently, Corbella, Spencer, and Roberts (2022) proposed a new algorit… ▽ More Piecewise deterministic Markov processes (PDMPs) are a class of continuous-time Markov processes that were recently used to develop a new class of Markov chain Monte Carlo algorithms. However, the implementation of the processes is challenging due to the continuous-time aspect and the necessity of integrating the rate function. Recently, Corbella, Spencer, and Roberts (2022) proposed a new algorithm to automate the implementation of the Zig-Zag sampler. However, the efficiency of the algorithm highly depends on a hyperparameter ($t_{\text{max}}$) that is fixed all along the run of the algorithm and needs preliminary runs to tune. In this work, we relax this assumption and propose a new variant of their algorithm that let this parameter change over time and automatically adapt to the target distribution. We also replace the Brent optimization algorithm by a grid-based method to compute the upper bound of the rate function. This method is more robust to the regularity of the function and gives a tighter upper bound while being quicker to compute. We also extend the algorithm to other PDMPs and provide a Python implementation of the algorithm based on JAX. △ Less

Submitted 7 August, 2024; originally announced August 2024.

arXiv:2310.06533 [pdf, other]

Multilevel Monte Carlo for a class of Partially Observed Processes in Neuroscience

Authors: Mohamed Maama, Ajay Jasra, Kengo Kamatani

Abstract: In this paper we consider Bayesian parameter inference associated to a class of partially observed stochastic differential equations (SDE) driven by jump processes. Such type of models can be routinely found in applications, of which we focus upon the case of neuroscience. The data are assumed to be observed regularly in time and driven by the SDE model with unknown parameters. In practice the SDE… ▽ More In this paper we consider Bayesian parameter inference associated to a class of partially observed stochastic differential equations (SDE) driven by jump processes. Such type of models can be routinely found in applications, of which we focus upon the case of neuroscience. The data are assumed to be observed regularly in time and driven by the SDE model with unknown parameters. In practice the SDE may not have an analytically tractable solution and this leads naturally to a time-discretization. We adapt the multilevel Markov chain Monte Carlo method of [11], which works with a hierarchy of time discretizations and show empirically and theoretically that this is preferable to using one single time discretization. The improvement is in terms of the computational cost needed to obtain a pre-specified numerical error. Our approach is illustrated on models that are found in neuroscience. △ Less

Submitted 30 November, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

arXiv:2309.02998 [pdf, other]

Multilevel Particle Filters for a Class of Partially Observed Piecewise Deterministic Markov Processes

Authors: Ajay Jasra, Kengo Kamatani, Mohamed Maama

Abstract: In this paper we consider the filtering of a class of partially observed piecewise deterministic Markov processes (PDMPs). In particular, we assume that an ordinary differential equation (ODE) drives the deterministic element and can only be solved numerically via a time discretization. We develop, based upon the approach in [20], a new particle and multilevel particle filter (MLPF) in order to ap… ▽ More In this paper we consider the filtering of a class of partially observed piecewise deterministic Markov processes (PDMPs). In particular, we assume that an ordinary differential equation (ODE) drives the deterministic element and can only be solved numerically via a time discretization. We develop, based upon the approach in [20], a new particle and multilevel particle filter (MLPF) in order to approximate the filter associated to the discretized ODE. We provide a bound on the mean square error associated to the MLPF which provides guidance on setting the simulation parameter of that algorithm and implies that significant computational gains can be obtained versus using a particle filter. Our theoretical claims are confirmed in several numerical examples. △ Less

Submitted 6 September, 2023; originally announced September 2023.

arXiv:2305.00694 [pdf, other]

Scaling of Piecewise Deterministic Monte Carlo for Anisotropic Targets

Authors: Joris Bierkens, Kengo Kamatani, Gareth O. Roberts

Abstract: Piecewise deterministic Markov processes (PDMPs) are a type of continuous-time Markov process that combine deterministic flows with jumps. Recently, PDMPs have garnered attention within the Monte Carlo community as a potential alternative to traditional Markov chain Monte Carlo (MCMC) methods. The Zig-Zag sampler and the Bouncy Particle Sampler are commonly used examples of the PDMP methodology wh… ▽ More Piecewise deterministic Markov processes (PDMPs) are a type of continuous-time Markov process that combine deterministic flows with jumps. Recently, PDMPs have garnered attention within the Monte Carlo community as a potential alternative to traditional Markov chain Monte Carlo (MCMC) methods. The Zig-Zag sampler and the Bouncy Particle Sampler are commonly used examples of the PDMP methodology which have also yielded impressive theoretical properties, but little is known about their robustness to extreme dependence or anisotropy of the target density. It turns out that PDMPs may suffer from poor mixing due to anisotropy and this paper investigates this effect in detail in the stylised but important Gaussian case. To this end, we employ a multi-scale analysis framework in this paper. Our results show that when the Gaussian target distribution has two scales, of order $1$ and $ε$, the computational cost of the Bouncy Particle Sampler is of order $ε^{-1}$, and the computational cost of the Zig-Zag sampler is $ε^{-2}$. In comparison, the cost of the traditional MCMC methods such as RWM is of order $ε^{-2}$, at least when the dimensionality of the small component is more than $1$. Therefore, there is a robustness advantage to using PDMPs in this context. △ Less

Submitted 22 October, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

Comments: 28 pages, 27 figures, supplementary materials included as ancillary file, to appear in Bernoulli Journal

arXiv:2111.06148 [pdf, other]

Haar-Weave-Metropolis kernel

Authors: Kengo Kamatani, Xiaolin Song

Abstract: Recently, many Markov chain Monte Carlo methods have been developed with deterministic reversible transform proposals inspired by the Hamiltonian Monte Carlo method. The deterministic transform is relatively easy to reconcile with the local information (gradient etc.) of the target distribution. However, as the ergodic theory suggests, these deterministic proposal methods seem to be incompatible w… ▽ More Recently, many Markov chain Monte Carlo methods have been developed with deterministic reversible transform proposals inspired by the Hamiltonian Monte Carlo method. The deterministic transform is relatively easy to reconcile with the local information (gradient etc.) of the target distribution. However, as the ergodic theory suggests, these deterministic proposal methods seem to be incompatible with robustness and lead to poor convergence, especially in the case of target distributions with heavy tails. On the other hand, the Markov kernel using the Haar measure is relatively robust since it learns global information about the target distribution introducing global parameters. However, it requires a density preserving condition, and many deterministic proposals break this condition. In this paper, we carefully select deterministic transforms that preserve the structure and create a Markov kernel, the Weave-Metropolis kernel, using the deterministic transforms. By combining with the Haar measure, we also introduce the Haar-Weave-Metropolis kernel. In this way, the Markov kernel can employ the local information of the target distribution using the deterministic proposal, and thanks to the Haar measure, it can employ the global information of the target distribution. Finally, we show through numerical experiments that the performance of the proposed method is superior to other methods in terms of effective sample size and mean square jump distance per second. △ Less

Submitted 11 November, 2021; originally announced November 2021.

Comments: 24 pages, 3 figures

MSC Class: 62F15; 65C05;

arXiv:2008.02906 [pdf, other]

MCMC Algorithms for Posteriors on Matrix Spaces

Authors: Alexandros Beskos, Kengo Kamatani

Abstract: We study Markov chain Monte Carlo (MCMC) algorithms for target distributions defined on matrix spaces. Such an important sampling problem has yet to be analytically explored. We carry out a major step in covering this gap by developing the proper theoretical framework that allows for the identification of ergodicity properties of typical MCMC algorithms, relevant in such a context. Beyond the stan… ▽ More We study Markov chain Monte Carlo (MCMC) algorithms for target distributions defined on matrix spaces. Such an important sampling problem has yet to be analytically explored. We carry out a major step in covering this gap by developing the proper theoretical framework that allows for the identification of ergodicity properties of typical MCMC algorithms, relevant in such a context. Beyond the standard Random-Walk Metropolis (RWM) and preconditioned Crank--Nicolson (pCN), a contribution of this paper in the development of a novel algorithm, termed the `Mixed' pCN (MpCN). RWM and pCN are shown not to be geometrically ergodic for an important class of matrix distributions with heavy tails. In contrast, MpCN is robust across targets with different tail behaviour and has very good empirical performance within the class of heavy-tailed distributions. Geometric ergodicity for MpCN is not fully proven in this work, as some remaining drift conditions are quite challenging to obtain owing to the complexity of the state space. We do, however, make a lot of progress towards a proof, and show in detail the last steps left for future work. We illustrate the computational performance of the various algorithms through numerical applications, including calibration on real data of a challenging model arising in financial statistics. △ Less

Submitted 6 November, 2021; v1 submitted 6 August, 2020; originally announced August 2020.

Comments: 45 pages, 18 figures

MSC Class: 65C05; 60H35; 60J76

arXiv:2006.13777 [pdf, other]

The Boomerang Sampler

Authors: Joris Bierkens, Sebastiano Grazzi, Kengo Kamatani, Gareth Roberts

Abstract: This paper introduces the Boomerang Sampler as a novel class of continuous-time non-reversible Markov chain Monte Carlo algorithms. The methodology begins by representing the target density as a density, $e^{-U}$, with respect to a prescribed (usually) Gaussian measure and constructs a continuous trajectory consisting of a piecewise elliptical path. The method moves from one elliptical orbit to an… ▽ More This paper introduces the Boomerang Sampler as a novel class of continuous-time non-reversible Markov chain Monte Carlo algorithms. The methodology begins by representing the target density as a density, $e^{-U}$, with respect to a prescribed (usually) Gaussian measure and constructs a continuous trajectory consisting of a piecewise elliptical path. The method moves from one elliptical orbit to another according to a rate function which can be written in terms of $U$. We demonstrate that the method is easy to implement and demonstrate empirically that it can out-perform existing benchmark piecewise deterministic Markov processes such as the bouncy particle sampler and the Zig-Zag. In the Bayesian statistics context, these competitor algorithms are of substantial interest in the large data context due to the fact that they can adopt data subsampling techniques which are exact (ie induce no error in the stationary distribution). We demonstrate theoretically and empirically that we can also construct a control-variate subsampling boomerang sampler which is also exact, and which possesses remarkable scaling properties in the large data limit. We furthermore illustrate a factorised version on the simulation of diffusion bridges. △ Less

Submitted 11 August, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

Comments: Accepted for publication in the proceedings of ICML 2020. Code available at https://github.com/jbierkens/ICML-boomerang

MSC Class: 68W20; 60J25

arXiv:2005.05584 [pdf, other]

Non-reversible guided Metropolis kernel

Authors: Kengo Kamatani, Xiaolin Song

Abstract: We construct a class of non-reversible Metropolis kernels as a multivariate extension of the guided-walk kernel proposed by Gustafson 1998. The main idea of our method is to introduce a projection that maps a state space to a totally ordered group. By using Haar measure, we construct a novel Markov kernel termed Haar-mixture kernel, which is of interest in its own right. This is achieved by induci… ▽ More We construct a class of non-reversible Metropolis kernels as a multivariate extension of the guided-walk kernel proposed by Gustafson 1998. The main idea of our method is to introduce a projection that maps a state space to a totally ordered group. By using Haar measure, we construct a novel Markov kernel termed Haar-mixture kernel, which is of interest in its own right. This is achieved by inducing a topological structure to the totally ordered group. Our proposed method, the Delta-guided Metropolis--Haar kernel, is constructed by using the Haar-mixture kernel as a proposal kernel. The proposed non-reversible kernel is at least 10 times better than the random-walk Metropolis kernel and Hamiltonian Monte Carlo kernel for the logistic regression and a discretely observed stochastic process in terms of effective sample size per second. △ Less

Submitted 12 March, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

Comments: 27 pages, 5 figures

arXiv:1704.00117 [pdf, other]

A Multi-Index Markov Chain Monte Carlo Method

Authors: Ajay Jasra, Kengo Kamatani, Kody Law, Yan Zhou

Abstract: In this article we consider computing expectations w.r.t.~probability laws associated to a certain class of stochastic systems. In order to achieve such a task, one must not only resort to numerical approximation of the expectation, but also to a biased discretization of the associated probability. We are concerned with the situation for which the discretization is required in multiple dimensions,… ▽ More In this article we consider computing expectations w.r.t.~probability laws associated to a certain class of stochastic systems. In order to achieve such a task, one must not only resort to numerical approximation of the expectation, but also to a biased discretization of the associated probability. We are concerned with the situation for which the discretization is required in multiple dimensions, for instance in space and time. In such contexts, it is known that the multi-index Monte Carlo (MIMC) method can improve upon i.i.d.~sampling from the most accurate approximation of the probability law. Indeed by a non-trivial modification of the multilevel Monte Carlo (MLMC) method and it can reduce the work to obtain a given level of error, relative to the afore mentioned i.i.d.~sampling and relative even to MLMC. In this article we consider the case when such probability laws are too complex to sampled independently. We develop a modification of the MIMC method which allows one to use standard Markov chain Monte Carlo (MCMC) algorithms to replace independent and coupled sampling, in certain contexts. We prove a variance theorem which shows that using our MIMCMC method is preferable, in the sense above, to i.i.d.~sampling from the most accurate approximation, under assumptions. The method is numerically illustrated on a problem associated to a stochastic partial differential equation (SPDE). △ Less

Submitted 26 October, 2017; v1 submitted 31 March, 2017; originally announced April 2017.

arXiv:1701.05892 [pdf, other]

Bayesian Static Parameter Estimation for Partially Observed Diffusions via Multilevel Monte Carlo

Authors: Ajay Jasra, Kengo Kamatani, Kody J. H. Law, Yan Zhou

Abstract: In this article we consider static Bayesian parameter estimation for partially observed diffusions that are discretely observed. We work under the assumption that one must resort to discretizing the underlying diffusion process, for instance using the Euler-Maruyama method. Given this assumption, we show how one can use Markov chain Monte Carlo (MCMC) and particularly particle MCMC [Andrieu, C., D… ▽ More In this article we consider static Bayesian parameter estimation for partially observed diffusions that are discretely observed. We work under the assumption that one must resort to discretizing the underlying diffusion process, for instance using the Euler-Maruyama method. Given this assumption, we show how one can use Markov chain Monte Carlo (MCMC) and particularly particle MCMC [Andrieu, C., Doucet, A. and Holenstein, R. (2010). Particle Markov chain Monte Carlo methods (with discussion). J. R. Statist. Soc. Ser. B, 72, 269--342] to implement a new approximation of the multilevel (ML) Monte Carlo (MC) collapsing sum identity. Our approach comprises constructing an approximate coupling of the posterior density of the joint distribution over parameter and hidden variables at two different discretization levels and then correcting by an importance sampling method. The variance of the weights are independent of the length of the observed data set. The utility of such a method is that, for a prescribed level of mean square error, the cost of this MLMC method is provably less than i.i.d. sampling from the posterior associated to the most precise discretization. However the method here comprises using only known and efficient simulation methodologies. The theoretical results are illustrated by inference of the parameters of two prototypical processes given noisy partial observations of the process: the first is an Ornstein Uhlenbeck process and the second is a more general Langevin equation. △ Less

Submitted 20 January, 2017; originally announced January 2017.

arXiv:1605.04963 [pdf, other]

Multilevel Particle Filters: Normalizing Constant Estimation

Authors: Ajay Jasra, Kengo Kamatani, Prince Prepah Osei, Yan Zhou

Abstract: In this article we introduce two new estimates of the normalizing constant (or marginal likelihood) for partially observed diffusion (POD) processes, with discrete observations. One estimate is biased but non-negative and the other is unbiased but not almost surely non-negative. Our method uses the multilevel particle filter of Jasra et al (2015). We show that, under assumptions, for Euler discret… ▽ More In this article we introduce two new estimates of the normalizing constant (or marginal likelihood) for partially observed diffusion (POD) processes, with discrete observations. One estimate is biased but non-negative and the other is unbiased but not almost surely non-negative. Our method uses the multilevel particle filter of Jasra et al (2015). We show that, under assumptions, for Euler discretized PODs and a given $\varepsilon>0$. in order to obtain a mean square error (MSE) of $\mathcal{O}(\varepsilon^2)$ one requires a work of $\mathcal{O}(\varepsilon^{-2.5})$ for our new estimates versus a standard particle filter that requires a work of $\mathcal{O}(\varepsilon^{-3})$. Our theoretical results are supported by numerical simulations. △ Less

Submitted 16 May, 2016; originally announced May 2016.

Comments: arXiv admin note: substantial text overlap with arXiv:1510.04977

arXiv:1602.02889 [pdf, other]

Ergodicity of Markov chain Monte Carlo with reversible proposal

Authors: Kengo Kamatani

Abstract: We describe ergodic properties of some Metropolis-Hastings (MH) algorithms for heavy-tailed target distributions. The analysis usually falls into sub-geometric ergodicity framework but we prove that the mixed preconditioned Crank-Nicolson (MpCN) algorithm has geometric ergodicity even for heavy-tailed target distributions. This useful property comes from the fact that the MpCN algorithm becomes a… ▽ More We describe ergodic properties of some Metropolis-Hastings (MH) algorithms for heavy-tailed target distributions. The analysis usually falls into sub-geometric ergodicity framework but we prove that the mixed preconditioned Crank-Nicolson (MpCN) algorithm has geometric ergodicity even for heavy-tailed target distributions. This useful property comes from the fact that the MpCN algorithm becomes a random-walk Metropolis algorithm under suitable transformation. △ Less

Submitted 9 February, 2016; originally announced February 2016.

Comments: 14 pages

MSC Class: 65C05; 65C40; 60J05

arXiv:1510.04977 [pdf, other]

Multilevel particle filter

Authors: Ajay Jasra, Kengo Kamatani, Kody J. H. Law, Yan Zhou

Abstract: In this paper the filtering of partially observed diffusions, with discrete-time observations, is considered. It is assumed that only biased approximations of the diffusion can be obtained, for choice of an accuracy parameter indexed by $l$. A multilevel estimator is proposed, consisting of a telescopic sum of increment estimators associated to the successive levels. The work associated to… ▽ More In this paper the filtering of partially observed diffusions, with discrete-time observations, is considered. It is assumed that only biased approximations of the diffusion can be obtained, for choice of an accuracy parameter indexed by $l$. A multilevel estimator is proposed, consisting of a telescopic sum of increment estimators associated to the successive levels. The work associated to $\mathcal{O}(\varepsilon^2)$ mean-square error between the multilevel estimator and average with respect to the filtering distribution is shown to scale optimally, for example as $\mathcal{O}(\varepsilon^{-2})$ for optimal rates of convergence of the underlying diffusion approximation. The method is illustrated on some toy examples as well as estimation of interest rate based on real S&P 500 stock price data. △ Less

Submitted 16 October, 2015; originally announced October 2015.

arXiv:1412.6231 [pdf, other]

Efficient strategy for the Markov chain Monte Carlo in high-dimension with heavy-tailed target probability distribution

Authors: Kengo Kamatani

Abstract: The purpose of this paper is to introduce a new Markov chain Monte Carlo method and exhibit its efficiency by simulation and high-dimensional asymptotic theory. Key fact is that our algorithm has a reversible proposal transition kernel, which is designed to have a heavy-tailed invariant probability distribution. The high-dimensional asymptotic theory is studied for a class of heavy-tailed target p… ▽ More The purpose of this paper is to introduce a new Markov chain Monte Carlo method and exhibit its efficiency by simulation and high-dimensional asymptotic theory. Key fact is that our algorithm has a reversible proposal transition kernel, which is designed to have a heavy-tailed invariant probability distribution. The high-dimensional asymptotic theory is studied for a class of heavy-tailed target probability distribution. As the number of dimension of the state space goes to infinity, we will show that our algorithm has a much better convergence rate than that of the preconditioned Crank Nicolson (pCN) algorithm and the random-walk Metropolis (RWM) algorithm. We also show that our algorithm is at least as good as the pCN algorithm and better than the RWM algorithm for light-tailed target probability distribution. △ Less

Submitted 19 December, 2014; originally announced December 2014.

Comments: 30pages, 17 figures

arXiv:1412.3501 [pdf, other]

A Stable Particle Filter in High-Dimensions

Authors: Alex Beskos, Dan Crisan, Ajay Jasra, Kengo Kamatani, Yan Zhou

Abstract: We consider the numerical approximation of the filtering problem in high dimensions, that is, when the hidden state lies in $\mathbb{R}^d$ with $d$ large. For low dimensional problems, one of the most popular numerical procedures for consistent inference is the class of approximations termed particle filters or sequential Monte Carlo methods. However, in high dimensions, standard particle filters… ▽ More We consider the numerical approximation of the filtering problem in high dimensions, that is, when the hidden state lies in $\mathbb{R}^d$ with $d$ large. For low dimensional problems, one of the most popular numerical procedures for consistent inference is the class of approximations termed particle filters or sequential Monte Carlo methods. However, in high dimensions, standard particle filters (e.g. the bootstrap particle filter) can have a cost that is exponential in $d$ for the algorithm to be stable in an appropriate sense. We develop a new particle filter, called the \emph{space-time particle filter}, for a specific family of state-space models in discrete time. This new class of particle filters provide consistent Monte Carlo estimates for any fixed $d$, as do standard particle filters. Moreover, we expect that the state-space particle filter will scale much better with $d$ than the standard filter. We illustrate this analytically for a model of a simple i.i.d. structure and one of a Markovian structure in the $d$-dimensional space-direction, when we show that the algorithm exhibits certain stability properties as $d$ increases at a cost $\mathcal{O}(nNd^2)$, where $n$ is the time parameter and $N$ is the number of Monte Carlo samples, that are fixed and independent of $d$. Similar results are expected to hold, under a more general structure than the i.i.d.~one. independently of the dimension. Our theoretical results are also supported by numerical simulations on practical models of complex structures. The results suggest that it is indeed possible to tackle some high dimensional filtering problems using the space-time particle filter that standard particle filters cannot handle. △ Less

Submitted 10 December, 2014; originally announced December 2014.

arXiv:1406.5392 [pdf, ps, other]

Rate optimality of Random walk Metropolis algorithm in high-dimension with heavy-tailed target distribution

Authors: Kengo Kamatani

Abstract: The choice of the increment distribution is crucial for the random-walk Metropolis-Hastings (RWM) algorithm. In this paper we study the optimal choice in high-dimension setting among all possible increment distributions. The conclusion is rather counter intuitive, but the optimal rate of convergence is attained by the usual choice, the normal distribution as the increment distribution. In particul… ▽ More The choice of the increment distribution is crucial for the random-walk Metropolis-Hastings (RWM) algorithm. In this paper we study the optimal choice in high-dimension setting among all possible increment distributions. The conclusion is rather counter intuitive, but the optimal rate of convergence is attained by the usual choice, the normal distribution as the increment distribution. In particular, no heavy-tailed increment distribution can improve the rate. △ Less

Submitted 20 May, 2016; v1 submitted 20 June, 2014; originally announced June 2014.

Comments: 12pages

arXiv:1108.2477 [pdf, other]

doi 10.1051/ps/2014004

Local degeneracy of Markov chain Monte Carlo methods

Authors: Kengo Kamatani

Abstract: We study asymptotic behavior of Monte Carlo method. Local consistency is one of an ideal property of Monte Carlo method. However, it may fail to hold local consistency for several reason. In fact, in practice, it is more important to study such a non-ideal behavior. We call local degeneracy for one of a non-ideal behavior of Monte Carlo methods. We show some equivalent conditions for local degener… ▽ More We study asymptotic behavior of Monte Carlo method. Local consistency is one of an ideal property of Monte Carlo method. However, it may fail to hold local consistency for several reason. In fact, in practice, it is more important to study such a non-ideal behavior. We call local degeneracy for one of a non-ideal behavior of Monte Carlo methods. We show some equivalent conditions for local degeneracy. As an application we study a Gibbs sampler (data augmentation) for cumulative logit model with or without marginal augmentation. It is well known that natural Gibbs sampler does not work well for this model. In a sense of local consistency and degeneracy, marginal augmentation is shown to improve the asymptotic property. However, when the number of categories is large, both methods are not locally consistent. △ Less

Submitted 11 January, 2012; v1 submitted 6 August, 2011; originally announced August 2011.

Comments: 30 pages, 3 figures

arXiv:1103.5679 [pdf, other]

Weak consistency of Markov chain Monte Carlo methods

Authors: Kengo Kamatani

Abstract: Markov chain Monte Calro methods (MCMC) are commonly used in Bayesian statistics. In the last twenty years, many results have been established for the calculation of the exact convergence rate of MCMC methods. We introduce another rate of convergence for MCMC methods by approximation techniques. This rate can be obtained by the convergence of the Markov chain to a diffusion process. We apply it to… ▽ More Markov chain Monte Calro methods (MCMC) are commonly used in Bayesian statistics. In the last twenty years, many results have been established for the calculation of the exact convergence rate of MCMC methods. We introduce another rate of convergence for MCMC methods by approximation techniques. This rate can be obtained by the convergence of the Markov chain to a diffusion process. We apply it to a simple mixture model and obtain its convergence rate. Numerical simulations are performed to illustrate the effect of the rate. △ Less

Submitted 25 September, 2013; v1 submitted 29 March, 2011; originally announced March 2011.

Comments: 14 pages

Journal ref: Bulletin of Informatics and Cybernetics, 45 (2013) 103-123

arXiv:1012.0996 [pdf, other]

doi 10.1007/s10463-013-0403-3

Local Consistency of Markov Chain Monte Carlo Methods

Authors: Kengo Kamatani

Abstract: In this paper, we introduce the notion of efficiency (consistency) and examine some asymptotic properties of Markov chain Monte Carlo methods. We apply these results to the data augmentation (DA) procedure for independent and identically distributed observations. More precisely, we show that if both the sample size and the running time of the DA procedure tend to infinity the empirical distributio… ▽ More In this paper, we introduce the notion of efficiency (consistency) and examine some asymptotic properties of Markov chain Monte Carlo methods. We apply these results to the data augmentation (DA) procedure for independent and identically distributed observations. More precisely, we show that if both the sample size and the running time of the DA procedure tend to infinity the empirical distribution of the DA procedure tends to the posterior distribution. This is a local property of the DA procedure, which may be, in some cases, more helpful than the global properties to describe its behavior. The advantages of using the local properties are the simplicity and the generality of the results. The local properties provide useful insight into the problem of how to construct efficient algorithms. △ Less

Submitted 25 September, 2013; v1 submitted 5 December, 2010; originally announced December 2010.

Comments: 12 pages

Journal ref: Ann. Inst. Statist. Math. 66(1) (2014) 63-74

Showing 1–20 of 20 results for author: Kamatani, K