-
Arcsine laws for random walks generated from random permutations with applications to genomics
Authors:
Xiao Fang,
Han Liang Gan,
Susan Holmes,
Haiyan Huang,
Erol Peköz,
Adrian Röllin,
Wenpin Tang
Abstract:
A classical result for the simple symmetric random walk with $2n$ steps is that the number of steps above the origin, the time of the last visit to the origin, and the time of the maximum height all have exactly the same distribution and converge when scaled to the arcsine law. Motivated by applications in genomics, we study the distributions of these statistics for the non-Markovian random walk g…
▽ More
A classical result for the simple symmetric random walk with $2n$ steps is that the number of steps above the origin, the time of the last visit to the origin, and the time of the maximum height all have exactly the same distribution and converge when scaled to the arcsine law. Motivated by applications in genomics, we study the distributions of these statistics for the non-Markovian random walk generated from the ascents and descents of a uniform random permutation and a Mallows($q$) permutation and show that they have the same asymptotic distributions as for the simple random walk. We also give an unexpected conjecture, along with numerical evidence and a partial proof in special cases, for the result that the number of steps above the origin by step $2n$ for the uniform permutation generated walk has exactly the same discrete arcsine distribution as for the simple random walk, even though the other statistics for these walks have very different laws. We also give explicit error bounds to the limit theorems using Stein's method for the arcsine distribution, as well as functional central limit theorems and a strong embedding of the Mallows$(q)$ permutation which is of independent interest.
△ Less
Submitted 23 January, 2020;
originally announced January 2020.
-
Learning Weighted Submanifolds with Variational Autoencoders and Riemannian Variational Autoencoders
Authors:
Nina Miolane,
Susan Holmes
Abstract:
Manifold-valued data naturally arises in medical imaging. In cognitive neuroscience, for instance, brain connectomes base the analysis of coactivation patterns between different brain regions on the analysis of the correlations of their functional Magnetic Resonance Imaging (fMRI) time series - an object thus constrained by construction to belong to the manifold of symmetric positive definite matr…
▽ More
Manifold-valued data naturally arises in medical imaging. In cognitive neuroscience, for instance, brain connectomes base the analysis of coactivation patterns between different brain regions on the analysis of the correlations of their functional Magnetic Resonance Imaging (fMRI) time series - an object thus constrained by construction to belong to the manifold of symmetric positive definite matrices. One of the challenges that naturally arises consists of finding a lower-dimensional subspace for representing such manifold-valued data. Traditional techniques, like principal component analysis, are ill-adapted to tackle non-Euclidean spaces and may fail to achieve a lower-dimensional representation of the data - thus potentially pointing to the absence of lower-dimensional representation of the data. However, these techniques are restricted in that: (i) they do not leverage the assumption that the connectomes belong on a pre-specified manifold, therefore discarding information; (ii) they can only fit a linear subspace to the data. In this paper, we are interested in variants to learn potentially highly curved submanifolds of manifold-valued data. Motivated by the brain connectomes example, we investigate a latent variable generative model, which has the added benefit of providing us with uncertainty estimates - a crucial quantity in the medical applications we are considering. While latent variable models have been proposed to learn linear and nonlinear spaces for Euclidean data, or geodesic subspaces for manifold data, no intrinsic latent variable model exists to learn nongeodesic subspaces for manifold data. This paper fills this gap and formulates a Riemannian variational autoencoder with an intrinsic generative model of manifold-valued data. We evaluate its performances on synthetic and real datasets by introducing the formalism of weighted Riemannian submanifolds.
△ Less
Submitted 19 November, 2019;
originally announced November 2019.
-
Template shape estimation: correcting an asymptotic bias
Authors:
Nina Miolane,
Susan Holmes,
Xavier Pennec
Abstract:
We use tools from geometric statistics to analyze the usual estimation procedure of a template shape. This applies to shapes from landmarks, curves, surfaces, images etc. We demonstrate the asymptotic bias of the template shape estimation using the stratified geometry of the shape space. We give a Taylor expansion of the bias with respect to a parameter $σ$ describing the measurement error on the…
▽ More
We use tools from geometric statistics to analyze the usual estimation procedure of a template shape. This applies to shapes from landmarks, curves, surfaces, images etc. We demonstrate the asymptotic bias of the template shape estimation using the stratified geometry of the shape space. We give a Taylor expansion of the bias with respect to a parameter $σ$ describing the measurement error on the data. We propose two bootstrap procedures that quantify the bias and correct it, if needed. They are applicable for any type of shape data. We give a rule of thumb to provide intuition on whether the bias has to be corrected. This exhibits the parameters that control the bias' magnitude. We illustrate our results on simulated and real shape data.
△ Less
Submitted 2 February, 2017; v1 submitted 6 September, 2016;
originally announced October 2016.
-
Curvature and Concentration of Hamiltonian Monte Carlo in High Dimensions
Authors:
Susan Holmes,
Simon Rubinstein-Salzedo,
Christof Seiler
Abstract:
In this article, we analyze Hamiltonian Monte Carlo (HMC) by placing it in the setting of Riemannian geometry using the Jacobi metric, so that each step corresponds to a geodesic on a suitable Riemannian manifold. We then combine the notion of curvature of a Markov chain due to Joulin and Ollivier with the classical sectional curvature from Riemannian geometry to derive error bounds for HMC in imp…
▽ More
In this article, we analyze Hamiltonian Monte Carlo (HMC) by placing it in the setting of Riemannian geometry using the Jacobi metric, so that each step corresponds to a geodesic on a suitable Riemannian manifold. We then combine the notion of curvature of a Markov chain due to Joulin and Ollivier with the classical sectional curvature from Riemannian geometry to derive error bounds for HMC in important cases, where we have positive curvature. These cases include several classical distributions such as multivariate Gaussians, and also distributions arising in the study of Bayesian image registration. The theoretical development suggests the sectional curvature as a new diagnostic tool for convergence for certain Markov chains.
△ Less
Submitted 19 May, 2015; v1 submitted 3 July, 2014;
originally announced July 2014.
-
Solitary Matter Waves in Combined Linear and Nonlinear Potentials: Detection, Stability, and Dynamics
Authors:
Scott Holmes,
Mason A. Porter,
Peter Krüger,
Panayotis G. Kevrekidis
Abstract:
We study statically homogeneous Bose-Einstein condensates with spatially inhomogeneous interactions and outline an experimental realization of compensating linear and nonlinear potentials that can yield constant-density solutions. We illustrate how the presence of a step in the nonlinearity coefficient can only be revealed dynamically and consider, in particular, how to reveal it by exploiting the…
▽ More
We study statically homogeneous Bose-Einstein condensates with spatially inhomogeneous interactions and outline an experimental realization of compensating linear and nonlinear potentials that can yield constant-density solutions. We illustrate how the presence of a step in the nonlinearity coefficient can only be revealed dynamically and consider, in particular, how to reveal it by exploiting the inhomogeneity of the sound speed with a defect-dragging experiment. We conduct computational experiments and observe the spontaneous emergence of dark solitary waves. We use effective-potential theory to perform a detailed analytical investigation of the existence and stability of solitary waves in this setting, and we corroborate these results computationally using a Bogoliubov-de Gennes linear stability analysis. We find that dark solitary waves are unstable for all step widths, whereas bright solitary waves can become stable through a symmetry-breaking bifurcation as one varies the step width. Using phase-plane analysis, we illustrate the scenarios that permit this bifurcation and explore the dynamical outcomes of the interaction between the solitary wave and the step.
△ Less
Submitted 24 September, 2013; v1 submitted 8 January, 2013;
originally announced January 2013.
-
Sampling From A Manifold
Authors:
Persi Diaconis,
Susan Holmes,
Mehrdad Shahshahani
Abstract:
We develop algorithms for sampling from a probability distribution on a submanifold embedded in Rn. Applications are given to the evaluation of algorithms in 'Topological Statistics'; to goodness of fit tests in exponential families and to Neyman's smooth test. This article is partially expository, giving an introduction to the tools of geometric measure theory.
We develop algorithms for sampling from a probability distribution on a submanifold embedded in Rn. Applications are given to the evaluation of algorithms in 'Topological Statistics'; to goodness of fit tests in exponential families and to Neyman's smooth test. This article is partially expository, giving an introduction to the tools of geometric measure theory.
△ Less
Submitted 4 July, 2012; v1 submitted 28 June, 2012;
originally announced June 2012.
-
Analysis of casino shelf shuffling machines
Authors:
Persi Diaconis,
Jason Fulman,
Susan Holmes
Abstract:
Many casinos routinely use mechanical card shuffling machines. We were asked to evaluate a new product, a shelf shuffler. This leads to new probability, new combinatorics and to some practical advice which was adopted by the manufacturer. The interplay between theory, computing, and real-world application is developed.
Many casinos routinely use mechanical card shuffling machines. We were asked to evaluate a new product, a shelf shuffler. This leads to new probability, new combinatorics and to some practical advice which was adopted by the manufacturer. The interplay between theory, computing, and real-world application is developed.
△ Less
Submitted 23 July, 2013; v1 submitted 14 July, 2011;
originally announced July 2011.
-
Interval graph limits
Authors:
Persi Diaconis,
Susan Holmes,
Svante Janson
Abstract:
We work out the graph limit theory for dense interval graphs. The theory developed departs from the usual description of a graph limit as a symmetric function $W(x,y)$ on the unit square, with $x$ and $y$ uniform on the interval $(0,1)$. Instead, we fix a $W$ and change the underlying distribution of the coordinates $x$ and $y$. We find choices such that our limits are continuous. Connections to r…
▽ More
We work out the graph limit theory for dense interval graphs. The theory developed departs from the usual description of a graph limit as a symmetric function $W(x,y)$ on the unit square, with $x$ and $y$ uniform on the interval $(0,1)$. Instead, we fix a $W$ and change the underlying distribution of the coordinates $x$ and $y$. We find choices such that our limits are continuous. Connections to random interval graphs are given, including some examples. We also show a continuity result for the chromatic number and clique number of interval graphs. Some results on uniqueness of the limit description are given for general graph limits.
△ Less
Submitted 14 February, 2011;
originally announced February 2011.
-
An Exposition of Götze's Estimation of the Rate of Convergence in the Multivariate Central Limit Theorem
Authors:
Rabi Bhattacharya,
Susan Holmes
Abstract:
We provide an explanation of the main ideas underlying Götze's main result in using Stein's method. We also provide detailed derivations of various intermediate estimates. Curiously, we are led to a different dimensional dependence of the constant than that given Götze's paper. We would like to dedicate this to Charles Stein on the occasion of his 90th birthday.
We provide an explanation of the main ideas underlying Götze's main result in using Stein's method. We also provide detailed derivations of various intermediate estimates. Curiously, we are led to a different dimensional dependence of the constant than that given Götze's paper. We would like to dedicate this to Charles Stein on the occasion of his 90th birthday.
△ Less
Submitted 22 March, 2010;
originally announced March 2010.
-
Threshold graph limits and random threshold graphs
Authors:
Persi Diaconis,
Susan Holmes,
Svante Janson
Abstract:
We study the limit theory of large threshold graphs and apply this to a variety of models for random threshold graphs. The results give a nice set of examples for the emerging theory of graph limits.
We study the limit theory of large threshold graphs and apply this to a variety of models for random threshold graphs. The results give a nice set of examples for the emerging theory of graph limits.
△ Less
Submitted 17 August, 2009;
originally announced August 2009.