-
Predictive Bayesian selection of multistep Markov chains, applied to the detection of the hot hand and other statistical dependencies in free throws
Authors:
Joshua C. Chang
Abstract:
Consider the problem of modeling memory effects in discrete-state random walks using higher-order Markov chains. This paper explores cross validation and information criteria as proxies for a model's predictive accuracy. Our objective is to select, from data, the number of prior states of recent history upon which a trajectory is statistically dependent. Through simulations, I evaluate these crite…
▽ More
Consider the problem of modeling memory effects in discrete-state random walks using higher-order Markov chains. This paper explores cross validation and information criteria as proxies for a model's predictive accuracy. Our objective is to select, from data, the number of prior states of recent history upon which a trajectory is statistically dependent. Through simulations, I evaluate these criteria in the case where data are drawn from systems with fixed orders of history, noting trends in the relative performance of the criteria. As a real-world illustrative example of these methods, this manuscript evaluates the problem of detecting statistical dependencies in shot outcomes in free throw shooting. Over three NBA seasons analyzed, several players exhibited statistical dependencies in free throw hitting probability of various types - hot handedness, cold handedness, and error correction. For the 2013-2014 through 2015-2016 NBA seasons, I detected statistical dependencies in 23% of all player-seasons. Focusing on a single player, in two of these three seasons, LeBron James shot a better percentage after an immediate miss than otherwise. In those seasons, conditioning on the previous outcome makes for a more predictive model than treating free throw makes as independent. When extended to data from the 2016-2017 NBA season specifically for LeBron James, a model depending on the previous shot (single-step Markovian) does not clearly beat a model with independent outcomes. An error-correcting variable length model of two parameters, where James shoots a higher percentage after a missed free throw than otherwise, is more predictive than either model.
△ Less
Submitted 20 February, 2019; v1 submitted 26 June, 2017;
originally announced June 2017.
-
Determination of hysteresis in finite-state random walks using Bayesian cross validation
Authors:
Joshua C. Chang
Abstract:
Consider the problem of modeling hysteresis for finite-state random walks using higher-order Markov chains. This Letter introduces a Bayesian framework to determine, from data, the number of prior states of recent history upon which a trajectory is statistically dependent. The general recommendation is to use leave-one-out cross validation, using an easily-computable formula that is provided in cl…
▽ More
Consider the problem of modeling hysteresis for finite-state random walks using higher-order Markov chains. This Letter introduces a Bayesian framework to determine, from data, the number of prior states of recent history upon which a trajectory is statistically dependent. The general recommendation is to use leave-one-out cross validation, using an easily-computable formula that is provided in closed form. Importantly, Bayes factors using flat model priors are biased in favor of too-complex a model (more hysteresis) when a large amount of data is present and the Akaike information criterion (AIC) is biased in favor of too-sparse a model (less hysteresis) when few data are present.
△ Less
Submitted 20 July, 2018; v1 submitted 20 February, 2017;
originally announced February 2017.
-
Bayesian field theoretic reconstruction of bond potential and bond mobility in single molecule force spectroscopy
Authors:
Joshua C. Chang,
Pak-Wing Fok,
Tom Chou
Abstract:
Quantifying the forces between and within macromolecules is a necessary first step in understanding the mechanics of molecular structure, protein folding, and enzyme function and performance. In such macromolecular settings, dynamic single-molecule force spectroscopy (DFS) has been used to distort bonds. The resulting responses, in the form of rupture forces, work applied, and trajectories of disp…
▽ More
Quantifying the forces between and within macromolecules is a necessary first step in understanding the mechanics of molecular structure, protein folding, and enzyme function and performance. In such macromolecular settings, dynamic single-molecule force spectroscopy (DFS) has been used to distort bonds. The resulting responses, in the form of rupture forces, work applied, and trajectories of displacements, have been used to reconstruct bond potentials. Such approaches often rely on simple parameterizations of one-dimensional bond potentials, assumptions on equilibrium starting states, and/or large amounts of trajectory data. Parametric approaches typically fail at inferring complex-shaped bond potentials with multiple minima, while piecewise estimation may not guarantee smooth results with the appropriate behavior at large distances. Existing techniques, particularly those based on work theorems, also do not address spatial variations in the diffusivity that may arise from spatially inhomogeneous coupling to other degrees of freedom in the macromolecule, thereby presenting an incomplete picture of the overall bond dynamics. To solve these challenges, we have developed a comprehensive empirical Bayesian approach that incorporates data and regularization terms directly into a path integral. All experiemental and statistical parameters in our method are estimated empirically directly from the data. Upon testing our method on simulated data, our regularized approach requires fewer data and allows simultaneous inference of both complex bond potentials and diffusivity profiles.
△ Less
Submitted 23 February, 2015;
originally announced February 2015.
-
Regulatory inhibition of biological tissue mineralization by calcium phosphate through post-nucleation shielding by Fetuin-A
Authors:
Joshua C. Chang,
Robert M. Miura
Abstract:
In vertebrates, insufficient availability of calcium and phosphate ions in extracellular fluids leads to loss of bone density and neuronal hyper-excitability. To counteract this problem, calcium ions are present at high concentrations throughout body fluids -- at concentrations exceeding the saturation point. This condition leads to the opposite situation where unwanted mineral sedimentation may o…
▽ More
In vertebrates, insufficient availability of calcium and phosphate ions in extracellular fluids leads to loss of bone density and neuronal hyper-excitability. To counteract this problem, calcium ions are present at high concentrations throughout body fluids -- at concentrations exceeding the saturation point. This condition leads to the opposite situation where unwanted mineral sedimentation may occur. Remarkably, ectopic or out-of-place sedimentation into soft tissues is rare, in spite of the thermodynamic driving factors. This fortunate fact is due to the presence of auto-regulatory proteins that are found in abundance in bodily fluids. Yet, many important inflammatory disorders such as atherosclerosis and osteoarthritis are associated with this undesired calcification. Hence, it is important to gain an understanding of the regulatory process and the conditions under which it can go awry. In this manuscript, we adapt mean-field classical nucleation theory to the case of surface-shielding in order to study the regulation of sedimentation of calcium phosphate salts in biological tissues through the mechanism of post-nuclear shielding of nascent mineral particles by binding proteins. We develop a mathematical description of this phenomenon using a countable system of hyperbolic partial differential equations. A critical concentration of regulatory protein is identified as a function of the physical parameters that describe the system.
△ Less
Submitted 23 May, 2016; v1 submitted 11 January, 2015;
originally announced January 2015.
-
A path-integral approach to Bayesian inference for inverse problems using the semiclassical approximation
Authors:
Joshua C Chang,
Van Savage,
Tom Chou
Abstract:
We demonstrate how path integrals often used in problems of theoretical physics can be adapted to provide a machinery for performing Bayesian inference in function spaces. Such inference comes about naturally in the study of inverse problems of recovering continuous (infinite dimensional) coefficient functions from ordinary or partial differential equations (ODE, PDE), a problem which is typically…
▽ More
We demonstrate how path integrals often used in problems of theoretical physics can be adapted to provide a machinery for performing Bayesian inference in function spaces. Such inference comes about naturally in the study of inverse problems of recovering continuous (infinite dimensional) coefficient functions from ordinary or partial differential equations (ODE, PDE), a problem which is typically ill-posed. Regularization of these problems using $L^2$ function spaces (Tikhonov regularization) is equivalent to Bayesian probabilistic inference, using a Gaussian prior. The Bayesian interpretation of inverse problem regularization is useful since it allows one to quantify and characterize error and degree of precision in the solution of inverse problems, as well as examine assumptions made in solving the problem -- namely whether the subjective choice of regularization is compatible with prior knowledge. Using path-integral formalism, Bayesian inference can be explored through various perturbative techniques, such as the semiclassical approximation, which we use in this manuscript. Perturbative path-integral approaches, while offering alternatives to computational approaches like Markov-Chain-Monte-Carlo (MCMC), also provide natural starting points for MCMC methods that can be used to refine approximations.
In this manuscript, we illustrate a path-integral formulation for inverse problems and demonstrate it on an inverse problem in membrane biophysics as well as inverse problems in potential theories involving the Poisson equation.
△ Less
Submitted 22 July, 2014; v1 submitted 10 December, 2013;
originally announced December 2013.
-
Iterative graph cuts for image segmentation with a nonlinear statistical shape prior
Authors:
Joshua C. Chang,
Tom Chou
Abstract:
Shape-based regularization has proven to be a useful method for delineating objects within noisy images where one has prior knowledge of the shape of the targeted object. When a collection of possible shapes is available, the specification of a shape prior using kernel density estimation is a natural technique. Unfortunately, energy functionals arising from kernel density estimation are of a form…
▽ More
Shape-based regularization has proven to be a useful method for delineating objects within noisy images where one has prior knowledge of the shape of the targeted object. When a collection of possible shapes is available, the specification of a shape prior using kernel density estimation is a natural technique. Unfortunately, energy functionals arising from kernel density estimation are of a form that makes them impossible to directly minimize using efficient optimization algorithms such as graph cuts. Our main contribution is to show how one may recast the energy functional into a form that is minimizable iteratively and efficiently using graph cuts.
△ Less
Submitted 22 February, 2013; v1 submitted 21 August, 2012;
originally announced August 2012.