-
Efficient Numerical Algorithms for the Generalized Langevin Equation
Authors:
Benedict Leimkuhler,
Matthias Sachs
Abstract:
We study the design and implementation of numerical methods to solve the generalized Langevin equation (GLE) focusing on canonical sampling properties of numerical integrators. For this purpose, we cast the GLE in an extended phase space formulation and derive a family of splitting methods which generalize existing Langevin dynamics integration methods. We show exponential convergence in law and t…
▽ More
We study the design and implementation of numerical methods to solve the generalized Langevin equation (GLE) focusing on canonical sampling properties of numerical integrators. For this purpose, we cast the GLE in an extended phase space formulation and derive a family of splitting methods which generalize existing Langevin dynamics integration methods. We show exponential convergence in law and the validity of a central limit theorem for the Markov chains obtained via these integration methods, and we show that the dynamics of a suggested integration scheme is consistent with asymptotic limits of the exact dynamics and can reproduce (in the short memory limit) a superconvergence property for the analogous splitting of underdamped Langevin dynamics. We then apply our proposed integration method to several model systems, including a Bayesian inference problem. We demonstrate in numerical experiments that our method outperforms other proposed GLE integration schemes in terms of the accuracy of sampling. Moreover, using a parameterization of the memory kernel in the GLE as proposed by Ceriotti et al [9], our experiments indicate that the obtained GLE-based sampling scheme outperforms state-of-the-art sampling schemes based on underdamped Langevin dynamics in terms of robustness and efficiency.
△ Less
Submitted 8 December, 2020;
originally announced December 2020.
-
Nonparametric bounds for causal effects in imperfect randomized experiments
Authors:
Erin E. Gabriel,
Arvid Sjölander,
Michael C. Sachs
Abstract:
Nonignorable missingness and noncompliance can occur even in well-designed randomized experiments making the intervention effect that the experiment was designed to estimate nonidentifiable. Nonparametric causal bounds provide a way to narrow the range of possible values for a nonidentifiable causal effect with minimal assumptions. We derive novel bounds for the causal risk difference for a binary…
▽ More
Nonignorable missingness and noncompliance can occur even in well-designed randomized experiments making the intervention effect that the experiment was designed to estimate nonidentifiable. Nonparametric causal bounds provide a way to narrow the range of possible values for a nonidentifiable causal effect with minimal assumptions. We derive novel bounds for the causal risk difference for a binary outcome and intervention in randomized experiments with nonignorable missingness caused by a variety of mechanisms and with or without noncompliance. We illustrate the use of the proposed bounds in our motivating data example of peanut consumption on the development of peanut allergies in infants.
△ Less
Submitted 11 October, 2020;
originally announced October 2020.
-
Non-reversible Markov chain Monte Carlo for sampling of districting maps
Authors:
Gregory Herschlag,
Jonathan C. Mattingly,
Matthias Sachs,
Evan Wyse
Abstract:
Evaluating the degree of partisan districting (Gerrymandering) in a statistical framework typically requires an ensemble of districting plans which are drawn from a prescribed probability distribution that adheres to a realistic and non-partisan criteria. In this article we introduce novel non-reversible Markov chain Monte-Carlo (MCMC) methods for the sampling of such districting plans which have…
▽ More
Evaluating the degree of partisan districting (Gerrymandering) in a statistical framework typically requires an ensemble of districting plans which are drawn from a prescribed probability distribution that adheres to a realistic and non-partisan criteria. In this article we introduce novel non-reversible Markov chain Monte-Carlo (MCMC) methods for the sampling of such districting plans which have improved mixing properties in comparison to previously used (reversible) MCMC algorithms. In doing so we extend the current framework for construction of non-reversible Markov chains on discrete sampling spaces by considering a generalization of skew detailed balance. We provide a detailed description of the proposed algorithms and evaluate their performance in numerical experiments.
△ Less
Submitted 18 August, 2020;
originally announced August 2020.
-
Posterior computation with the Gibbs zig-zag sampler
Authors:
Matthias Sachs,
Deborshee Sen,
Jianfeng Lu,
David Dunson
Abstract:
An intriguing new class of piecewise deterministic Markov processes (PDMPs) has recently been proposed as an alternative to Markov chain Monte Carlo (MCMC). In order to facilitate the application to a larger class of problems, we propose a new class of PDMPs termed Gibbs zig-zag samplers, which allow parameters to be updated in blocks with a zig-zag sampler applied to certain parameters and tradit…
▽ More
An intriguing new class of piecewise deterministic Markov processes (PDMPs) has recently been proposed as an alternative to Markov chain Monte Carlo (MCMC). In order to facilitate the application to a larger class of problems, we propose a new class of PDMPs termed Gibbs zig-zag samplers, which allow parameters to be updated in blocks with a zig-zag sampler applied to certain parameters and traditional MCMC-style updates to others. We demonstrate the flexibility of this framework on posterior sampling for logistic models with shrinkage priors for high-dimensional regression and random effects and provide conditions for geometric ergodicity and the validity of a central limit theorem.
△ Less
Submitted 22 May, 2022; v1 submitted 8 April, 2020;
originally announced April 2020.
-
Causal bounds for outcome-dependent sampling in observational studies
Authors:
Erin E. Gabriel,
Michael C. Sachs,
Arvid Sjölander
Abstract:
Outcome-dependent sampling designs are common in many different scientific fields including epidemiology, ecology, and economics. As with all observational studies, such designs often suffer from unmeasured confounding, which generally precludes the nonparametric identification of causal effects. Nonparametric bounds can provide a way to narrow the range of possible values for a nonidentifiable ca…
▽ More
Outcome-dependent sampling designs are common in many different scientific fields including epidemiology, ecology, and economics. As with all observational studies, such designs often suffer from unmeasured confounding, which generally precludes the nonparametric identification of causal effects. Nonparametric bounds can provide a way to narrow the range of possible values for a nonidentifiable causal effect without making additional untestable assumptions. The nonparametric bounds literature has almost exclusively focused on settings with random sampling, and the bounds have often been derived with a particular linear programming method. We derive novel bounds for the causal risk difference, often referred to as the average treatment effect, in six settings with outcome-dependent sampling and unmeasured confounding for a binary outcome and exposure. Our derivations of the bounds illustrate two approaches that may be applicable in other settings where the bounding problem cannot be directly stated as a system of linear constraints. We illustrate our derived bounds in a real data example involving the effect of vitamin D concentration on mortality.
△ Less
Submitted 11 October, 2020; v1 submitted 24 February, 2020;
originally announced February 2020.
-
Hypocoercivity properties of adaptive Langevin dynamics
Authors:
Benedict Leimkuhler,
Matthias Sachs,
Gabriel Stoltz
Abstract:
Adaptive Langevin dynamics is a method for sampling the Boltzmann-Gibbs distribution at prescribed temperature in cases where the potential gradient is subject to stochastic perturbation of unknown magnitude. The method replaces the friction in underdamped Langevin dynamics with a dynamical variable, updated according to a negative feedback loop control law as in the Nosé-Hoover thermostat. Using…
▽ More
Adaptive Langevin dynamics is a method for sampling the Boltzmann-Gibbs distribution at prescribed temperature in cases where the potential gradient is subject to stochastic perturbation of unknown magnitude. The method replaces the friction in underdamped Langevin dynamics with a dynamical variable, updated according to a negative feedback loop control law as in the Nosé-Hoover thermostat. Using a hypocoercivity analysis we show that the law of Adaptive Langevin dynamics converges exponentially rapidly to the stationary distribution, with a rate that can be quantified in terms of the key parameters of the dynamics. This allows us in particular to obtain a central limit theorem with respect to the time averages computed along a stochastic path. Our theoretical findings are illustrated by numerical simulations involving classification of the MNIST data set of handwritten digits using Bayesian logistic regression.
△ Less
Submitted 11 November, 2023; v1 submitted 25 August, 2019;
originally announced August 2019.
-
Ergodic properties of quasi-Markovian generalized Langevin equations with configuration dependent noise and non-conservative force
Authors:
Benedict Leimkuhler,
Matthias Sachs
Abstract:
We discuss the ergodic properties of quasi-Markovian stochastic differential equations, providing general conditions that ensure existence and uniqueness of a smooth invariant distribution and exponential convergence of the evolution operator in suitably weighted $L^{\infty}$ spaces, which implies the validity of central limit theorem for the respective solution processes. The main new result is a…
▽ More
We discuss the ergodic properties of quasi-Markovian stochastic differential equations, providing general conditions that ensure existence and uniqueness of a smooth invariant distribution and exponential convergence of the evolution operator in suitably weighted $L^{\infty}$ spaces, which implies the validity of central limit theorem for the respective solution processes. The main new result is an ergodicity condition for the generalized Langevin equation with configuration-dependent noise and (non-)conservative force.
△ Less
Submitted 10 November, 2018; v1 submitted 11 April, 2018;
originally announced April 2018.
-
Quadrature Points via Heat Kernel Repulsion
Authors:
Jianfeng Lu,
Matthias Sachs,
Stefan Steinerberger
Abstract:
We discuss the classical problem of how to pick $N$ weighted points on a $d-$dimensional manifold so as to obtain a reasonable quadrature rule $$ \frac{1}{|M|}\int_{M}{f(x) dx} \simeq \frac{1}{N} \sum_{n=1}^{N}{a_i f(x_i)}.$$ This problem, naturally, has a long history; the purpose of our paper is to propose selecting points and weights so as to minimize the energy functional…
▽ More
We discuss the classical problem of how to pick $N$ weighted points on a $d-$dimensional manifold so as to obtain a reasonable quadrature rule $$ \frac{1}{|M|}\int_{M}{f(x) dx} \simeq \frac{1}{N} \sum_{n=1}^{N}{a_i f(x_i)}.$$ This problem, naturally, has a long history; the purpose of our paper is to propose selecting points and weights so as to minimize the energy functional $$ \sum_{i,j =1}^{N}{ a_i a_j \exp\left(-\frac{d(x_i,x_j)^2}{4t}\right) } \rightarrow \min, \quad \mbox{where}~t \sim N^{-2/d},$$ $d(x,y)$ is the geodesic distance and $d$ is the dimension of the manifold. This yields point sets that are theoretically guaranteed, via spectral theoretic properties of the Laplacian $-Δ$, to have good properties. One nice aspect is that the energy functional is universal and independent of the underlying manifold; we show several numerical examples.
△ Less
Submitted 2 April, 2019; v1 submitted 6 April, 2018;
originally announced April 2018.