Showing 1–2 of 2 results for author: Barber, S

Search v0.5.6 released 2020-02-24

arXiv:2104.13140 [pdf, other]

math.ST

Mixture models for spherical data with applications to protein bioinformatics

Authors: Kanti V. Mardia, Stuart Barber, Philippa M. Burdett, John T. Kent, Thomas Hamelryck

Abstract: Finite mixture models are fitted to spherical data. Kent distributions are used for the components of the mixture because they allow considerable flexibility. Previous work on such mixtures has used an approximate maximum likelihood estimator for the parameters of a single component. However, the approximation causes problems when using the EM algorithm to estimate the parameters in a mixture mode… ▽ More Finite mixture models are fitted to spherical data. Kent distributions are used for the components of the mixture because they allow considerable flexibility. Previous work on such mixtures has used an approximate maximum likelihood estimator for the parameters of a single component. However, the approximation causes problems when using the EM algorithm to estimate the parameters in a mixture model. Hence the exact maximum likelihood estimator is used here for the individual components. This paper is motivated by a challenging prize problem in structural bioinformatics of how proteins fold. It is known that hydrogen bonds play a key role in the folding of a protein. We explore this hydrogen bond geometry using a data set describing bonds between two amino acids in proteins. An appropriate coordinate system to represent the hydrogen bond geometry is proposed, with each bond represented as a point on a sphere. We fit mixtures of Kent distributions to different subsets of the hydrogen bond data to gain insight into how the secondary structure elements bond together, since the distribution of hydrogen bonds depends on which secondary structure elements are involved. △ Less

Submitted 27 April, 2021; originally announced April 2021.

Comments: 17 pages, 9 figures

MSC Class: 62H11 (Primary) 62P10; 92-08 (Secondary)
arXiv:1311.2038 [pdf, other]

math.ST

The Rate of Convergence for Approximate Bayesian Computation

Authors: Stuart Barber, Jochen Voss, Mark Webster

Abstract: Approximate Bayesian Computation (ABC) is a popular computational method for likelihood-free Bayesian inference. The term "likelihood-free" refers to problems where the likelihood is intractable to compute or estimate directly, but where it is possible to generate simulated data $X$ relatively easily given a candidate set of parameters $θ$ simulated from a prior distribution. Parameters which gene… ▽ More Approximate Bayesian Computation (ABC) is a popular computational method for likelihood-free Bayesian inference. The term "likelihood-free" refers to problems where the likelihood is intractable to compute or estimate directly, but where it is possible to generate simulated data $X$ relatively easily given a candidate set of parameters $θ$ simulated from a prior distribution. Parameters which generate simulated data within some tolerance $δ$ of the observed data $x^*$ are regarded as plausible, and a collection of such $θ$ is used to estimate the posterior distribution $θ\,|\,X\!=\!x^*$. Suitable choice of $δ$ is vital for ABC methods to return good approximations to $θ$ in reasonable computational time. While ABC methods are widely used in practice, particularly in population genetics, study of the mathematical properties of ABC estimators is still in its infancy. We prove that ABC estimates converge to the exact solution under very weak assumptions and, under slightly stronger assumptions, quantify the rate of this convergence. Our results can be used to guide the choice of the tolerance parameter $δ$. △ Less

Submitted 18 July, 2014; v1 submitted 8 November, 2013; originally announced November 2013.

Comments: 25 pages, 3 figures; address the distinction between fixed number of proposals and fixed number of accepted samples more explicitly

MSC Class: 62F12; 62F15; 65C05

Search v0.5.6 released 2020-02-24