Search | arXiv e-print repository

arXiv:2012.04245 [pdf, ps, other]

Efficient Numerical Algorithms for the Generalized Langevin Equation

Authors: Benedict Leimkuhler, Matthias Sachs

Abstract: We study the design and implementation of numerical methods to solve the generalized Langevin equation (GLE) focusing on canonical sampling properties of numerical integrators. For this purpose, we cast the GLE in an extended phase space formulation and derive a family of splitting methods which generalize existing Langevin dynamics integration methods. We show exponential convergence in law and t… ▽ More We study the design and implementation of numerical methods to solve the generalized Langevin equation (GLE) focusing on canonical sampling properties of numerical integrators. For this purpose, we cast the GLE in an extended phase space formulation and derive a family of splitting methods which generalize existing Langevin dynamics integration methods. We show exponential convergence in law and the validity of a central limit theorem for the Markov chains obtained via these integration methods, and we show that the dynamics of a suggested integration scheme is consistent with asymptotic limits of the exact dynamics and can reproduce (in the short memory limit) a superconvergence property for the analogous splitting of underdamped Langevin dynamics. We then apply our proposed integration method to several model systems, including a Bayesian inference problem. We demonstrate in numerical experiments that our method outperforms other proposed GLE integration schemes in terms of the accuracy of sampling. Moreover, using a parameterization of the memory kernel in the GLE as proposed by Ceriotti et al [9], our experiments indicate that the obtained GLE-based sampling scheme outperforms state-of-the-art sampling schemes based on underdamped Langevin dynamics in terms of robustness and efficiency. △ Less

Submitted 8 December, 2020; originally announced December 2020.

MSC Class: 65C30; 65C40; 82M37

arXiv:2010.05220 [pdf, other]

Nonparametric bounds for causal effects in imperfect randomized experiments

Authors: Erin E. Gabriel, Arvid Sjölander, Michael C. Sachs

Abstract: Nonignorable missingness and noncompliance can occur even in well-designed randomized experiments making the intervention effect that the experiment was designed to estimate nonidentifiable. Nonparametric causal bounds provide a way to narrow the range of possible values for a nonidentifiable causal effect with minimal assumptions. We derive novel bounds for the causal risk difference for a binary… ▽ More Nonignorable missingness and noncompliance can occur even in well-designed randomized experiments making the intervention effect that the experiment was designed to estimate nonidentifiable. Nonparametric causal bounds provide a way to narrow the range of possible values for a nonidentifiable causal effect with minimal assumptions. We derive novel bounds for the causal risk difference for a binary outcome and intervention in randomized experiments with nonignorable missingness caused by a variety of mechanisms and with or without noncompliance. We illustrate the use of the proposed bounds in our motivating data example of peanut consumption on the development of peanut allergies in infants. △ Less

Submitted 11 October, 2020; originally announced October 2020.

Comments: 35 pages, 5 figures, includes supplementary materials

arXiv:2008.07843 [pdf, other]

Non-reversible Markov chain Monte Carlo for sampling of districting maps

Authors: Gregory Herschlag, Jonathan C. Mattingly, Matthias Sachs, Evan Wyse

Abstract: Evaluating the degree of partisan districting (Gerrymandering) in a statistical framework typically requires an ensemble of districting plans which are drawn from a prescribed probability distribution that adheres to a realistic and non-partisan criteria. In this article we introduce novel non-reversible Markov chain Monte-Carlo (MCMC) methods for the sampling of such districting plans which have… ▽ More Evaluating the degree of partisan districting (Gerrymandering) in a statistical framework typically requires an ensemble of districting plans which are drawn from a prescribed probability distribution that adheres to a realistic and non-partisan criteria. In this article we introduce novel non-reversible Markov chain Monte-Carlo (MCMC) methods for the sampling of such districting plans which have improved mixing properties in comparison to previously used (reversible) MCMC algorithms. In doing so we extend the current framework for construction of non-reversible Markov chains on discrete sampling spaces by considering a generalization of skew detailed balance. We provide a detailed description of the proposed algorithms and evaluate their performance in numerical experiments. △ Less

Submitted 18 August, 2020; originally announced August 2020.

Comments: 38 pages

MSC Class: 60J10; 60J20; 62P99 ACM Class: G.3; G.2

arXiv:2004.04254 [pdf, other]

Posterior computation with the Gibbs zig-zag sampler

Authors: Matthias Sachs, Deborshee Sen, Jianfeng Lu, David Dunson

Abstract: An intriguing new class of piecewise deterministic Markov processes (PDMPs) has recently been proposed as an alternative to Markov chain Monte Carlo (MCMC). In order to facilitate the application to a larger class of problems, we propose a new class of PDMPs termed Gibbs zig-zag samplers, which allow parameters to be updated in blocks with a zig-zag sampler applied to certain parameters and tradit… ▽ More An intriguing new class of piecewise deterministic Markov processes (PDMPs) has recently been proposed as an alternative to Markov chain Monte Carlo (MCMC). In order to facilitate the application to a larger class of problems, we propose a new class of PDMPs termed Gibbs zig-zag samplers, which allow parameters to be updated in blocks with a zig-zag sampler applied to certain parameters and traditional MCMC-style updates to others. We demonstrate the flexibility of this framework on posterior sampling for logistic models with shrinkage priors for high-dimensional regression and random effects and provide conditions for geometric ergodicity and the validity of a central limit theorem. △ Less

Submitted 22 May, 2022; v1 submitted 8 April, 2020; originally announced April 2020.

arXiv:2002.10519 [pdf, other]

doi 10.1080/01621459.2020.1832502

Causal bounds for outcome-dependent sampling in observational studies

Authors: Erin E. Gabriel, Michael C. Sachs, Arvid Sjölander

Abstract: Outcome-dependent sampling designs are common in many different scientific fields including epidemiology, ecology, and economics. As with all observational studies, such designs often suffer from unmeasured confounding, which generally precludes the nonparametric identification of causal effects. Nonparametric bounds can provide a way to narrow the range of possible values for a nonidentifiable ca… ▽ More Outcome-dependent sampling designs are common in many different scientific fields including epidemiology, ecology, and economics. As with all observational studies, such designs often suffer from unmeasured confounding, which generally precludes the nonparametric identification of causal effects. Nonparametric bounds can provide a way to narrow the range of possible values for a nonidentifiable causal effect without making additional untestable assumptions. The nonparametric bounds literature has almost exclusively focused on settings with random sampling, and the bounds have often been derived with a particular linear programming method. We derive novel bounds for the causal risk difference, often referred to as the average treatment effect, in six settings with outcome-dependent sampling and unmeasured confounding for a binary outcome and exposure. Our derivations of the bounds illustrate two approaches that may be applicable in other settings where the bounding problem cannot be directly stated as a system of linear constraints. We illustrate our derived bounds in a real data example involving the effect of vitamin D concentration on mortality. △ Less

Submitted 11 October, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

Comments: 36 pages, 3 figures. Update to include revisions after peer review. In Press at the Journal of the American Statistical Association, Theory and methods

arXiv:1908.09363 [pdf, other]

Hypocoercivity properties of adaptive Langevin dynamics

Authors: Benedict Leimkuhler, Matthias Sachs, Gabriel Stoltz

Abstract: Adaptive Langevin dynamics is a method for sampling the Boltzmann-Gibbs distribution at prescribed temperature in cases where the potential gradient is subject to stochastic perturbation of unknown magnitude. The method replaces the friction in underdamped Langevin dynamics with a dynamical variable, updated according to a negative feedback loop control law as in the Nosé-Hoover thermostat. Using… ▽ More Adaptive Langevin dynamics is a method for sampling the Boltzmann-Gibbs distribution at prescribed temperature in cases where the potential gradient is subject to stochastic perturbation of unknown magnitude. The method replaces the friction in underdamped Langevin dynamics with a dynamical variable, updated according to a negative feedback loop control law as in the Nosé-Hoover thermostat. Using a hypocoercivity analysis we show that the law of Adaptive Langevin dynamics converges exponentially rapidly to the stationary distribution, with a rate that can be quantified in terms of the key parameters of the dynamics. This allows us in particular to obtain a central limit theorem with respect to the time averages computed along a stochastic path. Our theoretical findings are illustrated by numerical simulations involving classification of the MNIST data set of handwritten digits using Bayesian logistic regression. △ Less

Submitted 11 November, 2023; v1 submitted 25 August, 2019; originally announced August 2019.

MSC Class: 60J70; 35B40; 46N30; 35Q84; 65C30

arXiv:1804.04029 [pdf, other]

Ergodic properties of quasi-Markovian generalized Langevin equations with configuration dependent noise and non-conservative force

Authors: Benedict Leimkuhler, Matthias Sachs

Abstract: We discuss the ergodic properties of quasi-Markovian stochastic differential equations, providing general conditions that ensure existence and uniqueness of a smooth invariant distribution and exponential convergence of the evolution operator in suitably weighted $L^{\infty}$ spaces, which implies the validity of central limit theorem for the respective solution processes. The main new result is a… ▽ More We discuss the ergodic properties of quasi-Markovian stochastic differential equations, providing general conditions that ensure existence and uniqueness of a smooth invariant distribution and exponential convergence of the evolution operator in suitably weighted $L^{\infty}$ spaces, which implies the validity of central limit theorem for the respective solution processes. The main new result is an ergodicity condition for the generalized Langevin equation with configuration-dependent noise and (non-)conservative force. △ Less

Submitted 10 November, 2018; v1 submitted 11 April, 2018; originally announced April 2018.

arXiv:1804.02327 [pdf, other]

Quadrature Points via Heat Kernel Repulsion

Authors: Jianfeng Lu, Matthias Sachs, Stefan Steinerberger

Abstract: We discuss the classical problem of how to pick $N$ weighted points on a $d-$dimensional manifold so as to obtain a reasonable quadrature rule $$ \frac{1}{|M|}\int_{M}{f(x) dx} \simeq \frac{1}{N} \sum_{n=1}^{N}{a_i f(x_i)}.$$ This problem, naturally, has a long history; the purpose of our paper is to propose selecting points and weights so as to minimize the energy functional… ▽ More We discuss the classical problem of how to pick $N$ weighted points on a $d-$dimensional manifold so as to obtain a reasonable quadrature rule $$ \frac{1}{|M|}\int_{M}{f(x) dx} \simeq \frac{1}{N} \sum_{n=1}^{N}{a_i f(x_i)}.$$ This problem, naturally, has a long history; the purpose of our paper is to propose selecting points and weights so as to minimize the energy functional $$ \sum_{i,j =1}^{N}{ a_i a_j \exp\left(-\frac{d(x_i,x_j)^2}{4t}\right) } \rightarrow \min, \quad \mbox{where}~t \sim N^{-2/d},$$ $d(x,y)$ is the geodesic distance and $d$ is the dimension of the manifold. This yields point sets that are theoretically guaranteed, via spectral theoretic properties of the Laplacian $-Δ$, to have good properties. One nice aspect is that the energy functional is universal and independent of the underlying manifold; we show several numerical examples. △ Less

Submitted 2 April, 2019; v1 submitted 6 April, 2018; originally announced April 2018.

Showing 1–8 of 8 results for author: Sachs, M