-
Particle exchange Monte Carlo methods for eigenfunction and related nonlinear problems
Authors:
Paul Dupuis,
Benjamin J. Zhang
Abstract:
We introduce and develop a novel particle exchange Monte Carlo method. Whereas existing methods apply to eigenfunction problems where the eigenvalue is known (e.g., integrals with respect to a Gibbs measure, which can be interpreted as corresponding to eigenvalue zero), here the focus is on problems where the eigenvalue is not known a priori. To obtain an appropriate particle exchange rule we must…
▽ More
We introduce and develop a novel particle exchange Monte Carlo method. Whereas existing methods apply to eigenfunction problems where the eigenvalue is known (e.g., integrals with respect to a Gibbs measure, which can be interpreted as corresponding to eigenvalue zero), here the focus is on problems where the eigenvalue is not known a priori. To obtain an appropriate particle exchange rule we must consider a pair of processes, with one evolving forward in time and the other backward. Applications to eigenfunction problems corresponding to quasistationary distributions and ergodic stochastic control are discussed.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
Function-space regularized Rényi divergences
Authors:
Jeremiah Birrell,
Yannis Pantazis,
Paul Dupuis,
Markos A. Katsoulakis,
Luc Rey-Bellet
Abstract:
We propose a new family of regularized Rényi divergences parametrized not only by the order $α$ but also by a variational function space. These new objects are defined by taking the infimal convolution of the standard Rényi divergence with the integral probability metric (IPM) associated with the chosen function space. We derive a novel dual variational representation that can be used to construct…
▽ More
We propose a new family of regularized Rényi divergences parametrized not only by the order $α$ but also by a variational function space. These new objects are defined by taking the infimal convolution of the standard Rényi divergence with the integral probability metric (IPM) associated with the chosen function space. We derive a novel dual variational representation that can be used to construct numerically tractable divergence estimators. This representation avoids risk-sensitive terms and therefore exhibits lower variance, making it well-behaved when $α>1$; this addresses a notable weakness of prior approaches. We prove several properties of these new divergences, showing that they interpolate between the classical Rényi divergences and IPMs. We also study the $α\to\infty$ limit, which leads to a regularized worst-case-regret and a new variational representation in the classical case. Moreover, we show that the proposed regularized Rényi divergences inherit features from IPMs such as the ability to compare distributions that are not absolutely continuous, e.g., empirical measures and distributions with low-dimensional support. We present numerical results on both synthetic and real datasets, showing the utility of these new divergences in both estimation and GAN training applications; in particular, we demonstrate significantly reduced variance and improved training performance.
△ Less
Submitted 14 February, 2023; v1 submitted 10 October, 2022;
originally announced October 2022.
-
Quasistationary Distributions and Ergodic Control Problems
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Pierre Nyquist,
Guo-Jhen Wu
Abstract:
We introduce and study the basic properties of two ergodic stochastic control problems associated with the quasistationary distribution (QSD) of a diffusion process $X$ relative to a bounded domain. The two problems are in some sense dual, with one defined in terms of the generator associated with $X$ and the other in terms of its adjoint. Besides proving wellposedness of the associated Hamilton-J…
▽ More
We introduce and study the basic properties of two ergodic stochastic control problems associated with the quasistationary distribution (QSD) of a diffusion process $X$ relative to a bounded domain. The two problems are in some sense dual, with one defined in terms of the generator associated with $X$ and the other in terms of its adjoint. Besides proving wellposedness of the associated Hamilton-Jacobi-Bellman equations, we describe how they can be used to characterize important properties of the QSD. Of particular note is that the QSD itself can be identified, up to normalization, in terms of the cost potential of the control problem associated with the adjoint.
△ Less
Submitted 27 February, 2021;
originally announced March 2021.
-
$(f,Γ)$-Divergences: Interpolating between $f$-Divergences and Integral Probability Metrics
Authors:
Jeremiah Birrell,
Paul Dupuis,
Markos A. Katsoulakis,
Yannis Pantazis,
Luc Rey-Bellet
Abstract:
We develop a rigorous and general framework for constructing information-theoretic divergences that subsume both $f$-divergences and integral probability metrics (IPMs), such as the $1$-Wasserstein distance. We prove under which assumptions these divergences, hereafter referred to as $(f,Γ)$-divergences, provide a notion of `distance' between probability measures and show that they can be expresse…
▽ More
We develop a rigorous and general framework for constructing information-theoretic divergences that subsume both $f$-divergences and integral probability metrics (IPMs), such as the $1$-Wasserstein distance. We prove under which assumptions these divergences, hereafter referred to as $(f,Γ)$-divergences, provide a notion of `distance' between probability measures and show that they can be expressed as a two-stage mass-redistribution/mass-transport process. The $(f,Γ)$-divergences inherit features from IPMs, such as the ability to compare distributions which are not absolutely continuous, as well as from $f$-divergences, namely the strict concavity of their variational representations and the ability to control heavy-tailed distributions for particular choices of $f$. When combined, these features establish a divergence with improved properties for estimation, statistical learning, and uncertainty quantification applications. Using statistical learning as an example, we demonstrate their advantage in training generative adversarial networks (GANs) for heavy-tailed, not-absolutely continuous sample distributions. We also show improved performance and stability over gradient-penalized Wasserstein GAN in image generation.
△ Less
Submitted 15 September, 2021; v1 submitted 11 November, 2020;
originally announced November 2020.
-
Analysis and optimization of certain parallel Monte Carlo methods in the low temperature limit
Authors:
Paul Dupuis,
Guo-Jhen Wu
Abstract:
Metastability is a formidable challenge to Markov chain Monte Carlo methods. In this paper we present methods for algorithm design to meet this challenge. The design problem we consider is temperature selection for the infinite swapping scheme, which is the limit of the widely used parallel tempering scheme obtained when the swap rate tends to infinity. We use a recently developed tool for the ana…
▽ More
Metastability is a formidable challenge to Markov chain Monte Carlo methods. In this paper we present methods for algorithm design to meet this challenge. The design problem we consider is temperature selection for the infinite swapping scheme, which is the limit of the widely used parallel tempering scheme obtained when the swap rate tends to infinity. We use a recently developed tool for the analysis of the empirical measure of a small noise diffusion to transform the variance reduction problem into an explicit optimization problem. Our first analysis of the optimization problem is in the setting of a double well model, and it shows that the optimal selection of temperature ratios is a geometric sequence except possibly the highest temperature. In the same setting we identify two different sources of variance reduction, and show how their competition determines the optimal highest temperature. In the general multi-well setting we prove that a pure geometric sequence of temperature ratios is always nearly optimal, with a performance gap that decays geometrically in the number of temperatures.
△ Less
Submitted 10 November, 2020;
originally announced November 2020.
-
The large deviation principle for interacting dynamical systems on random graphs
Authors:
Paul Dupuis,
Georgi Medvedev
Abstract:
Using the weak convergence approach to large deviations, we formulate and prove the large deviation principle (LDP) for W-random graphs in the cut-norm topology. This generalizes the LDP for Erdős-R{\' e}nyi random graphs by Chatterjee and Varadhan. Furthermore, we translate the LDP for random graphs to a class of interacting dynamical systems on such graphs. To this end, we demonstrate that the s…
▽ More
Using the weak convergence approach to large deviations, we formulate and prove the large deviation principle (LDP) for W-random graphs in the cut-norm topology. This generalizes the LDP for Erdős-R{\' e}nyi random graphs by Chatterjee and Varadhan. Furthermore, we translate the LDP for random graphs to a class of interacting dynamical systems on such graphs. To this end, we demonstrate that the solutions of the dynamical models depend continuously on the underlying graphs with respect to the cut-norm and apply the contraction principle.
△ Less
Submitted 15 August, 2021; v1 submitted 27 July, 2020;
originally announced July 2020.
-
Variational Representations and Neural Network Estimation of Rényi Divergences
Authors:
Jeremiah Birrell,
Paul Dupuis,
Markos A. Katsoulakis,
Luc Rey-Bellet,
Jie Wang
Abstract:
We derive a new variational formula for the Rényi family of divergences, $R_α(Q\|P)$, between probability measures $Q$ and $P$. Our result generalizes the classical Donsker-Varadhan variational formula for the Kullback-Leibler divergence. We further show that this Rényi variational formula holds over a range of function spaces; this leads to a formula for the optimizer under very weak assumptions…
▽ More
We derive a new variational formula for the Rényi family of divergences, $R_α(Q\|P)$, between probability measures $Q$ and $P$. Our result generalizes the classical Donsker-Varadhan variational formula for the Kullback-Leibler divergence. We further show that this Rényi variational formula holds over a range of function spaces; this leads to a formula for the optimizer under very weak assumptions and is also key in our development of a consistency theory for Rényi divergence estimators. By applying this theory to neural-network estimators, we show that if a neural network family satisfies one of several strengthened versions of the universal approximation property then the corresponding Rényi divergence estimator is consistent. In contrast to density-estimator based methods, our estimators involve only expectations under $Q$ and $P$ and hence are more effective in high dimensional systems. We illustrate this via several numerical examples of neural network estimation in systems of up to 5000 dimensions.
△ Less
Submitted 20 July, 2021; v1 submitted 7 July, 2020;
originally announced July 2020.
-
Large deviation properties of the empirical measure of a metastable small noise diffusion
Authors:
Paul Dupuis,
Guo-Jhen Wu
Abstract:
The aim of this paper is to develop tractable large deviation approximations for the empirical measure of a small noise diffusion. The starting point is the Freidlin-Wentzell theory, which shows how to approximate via a large deviation principle the invariant distribution of such a diffusion. The rate function of the invariant measure is formulated in terms of quasipotentials, quantities that meas…
▽ More
The aim of this paper is to develop tractable large deviation approximations for the empirical measure of a small noise diffusion. The starting point is the Freidlin-Wentzell theory, which shows how to approximate via a large deviation principle the invariant distribution of such a diffusion. The rate function of the invariant measure is formulated in terms of quasipotentials, quantities that measure the difficulty of a transition from the neighborhood of one metastable set to another. The theory provides an intuitive and useful approximation for the invariant measure, and along the way many useful related results (e.g., transition rates between metastable states) are also developed. With the specific goal of design of Monte Carlo schemes in mind, we prove large deviation limits for integrals with respect to the empirical measure, where the process is considered over a time interval whose length grows as the noise decreases to zero. In particular, we show how the first and second moments of these integrals can be expressed in terms of quasipotentials. When the dynamics of the process depend on parameters, these approximations can be used for algorithm design, and applications of this sort will appear elsewhere. The use of a small noise limit is well motivated, since in this limit good sampling of the state space becomes most challenging. The proof exploits a regenerative structure, and a number of new techniques are needed to turn large deviation estimates over a regenerative cycle into estimates for the empirical measure and its moments.
△ Less
Submitted 8 January, 2021; v1 submitted 28 February, 2020;
originally announced February 2020.
-
Robust bounds and optimization at the large deviations scale for queueing models via Rényi divergence
Authors:
Rami Atar,
Amarjit Budhiraja,
Paul Dupuis,
Ruoyu Wu
Abstract:
This paper develops tools to obtain robust probabilistic estimates for queueing models at the large deviations (LD) scale. These tools are based on the recently introduced robust Rényi bounds, which provide LD estimates (and more generally risk-sensitive (RS) cost estimates) that hold uniformly over an uncertainty class of models, provided that the class is defined in terms of Rényi divergence wit…
▽ More
This paper develops tools to obtain robust probabilistic estimates for queueing models at the large deviations (LD) scale. These tools are based on the recently introduced robust Rényi bounds, which provide LD estimates (and more generally risk-sensitive (RS) cost estimates) that hold uniformly over an uncertainty class of models, provided that the class is defined in terms of Rényi divergence with respect to a reference model and that estimates are available for the reference model. One very attractive quality of the approach is that the class to which the estimates apply may consist of hard models, such as highly non-Markovian models and ones for which the LD principle is not available. Our treatment provides exact expressions as well as bounds on the Rényi divergence rate on families of marked point processes, including as a special case renewal processes. Another contribution is a general result that translates robust RS control problems, where robustness is formulated via Rényi divergence, to finite dimensional convex optimization problems, when the control set is a finite dimensional convex set. The implications to queueing are vast, as they apply in great generality. This is demonstrated on two non-Markovian queueing models. One is the multiclass single-server queue considered as a RS control problem, with scheduling as the control process and exponential weighted queue length as cost. The second is the many-server queue with reneging, with the probability of atypically large reneging count as performance criterion. As far as LD analysis is concerned, no robust estimates or non-Markovian treatment were previously available for either of these models.
△ Less
Submitted 13 August, 2020; v1 submitted 7 January, 2020;
originally announced January 2020.
-
Rare event asymptotics for exploration processes for random graphs
Authors:
Shankar Bhamidi,
Amarjit Budhiraja,
Paul Dupuis,
Ruoyu Wu
Abstract:
Much work in the study of large deviations for random graph models is focused on the dense regime where the theory of graphons has emerged as a principal tool. These tools do not give a good approach to large deviation problems for random graph models in the sparse regime. The aim of this paper is to study an approach for large deviation problems in this regime by establishing Large Deviation Prin…
▽ More
Much work in the study of large deviations for random graph models is focused on the dense regime where the theory of graphons has emerged as a principal tool. These tools do not give a good approach to large deviation problems for random graph models in the sparse regime. The aim of this paper is to study an approach for large deviation problems in this regime by establishing Large Deviation Principles (LDP) on suitable path spaces for certain exploration processes of the associated random graph sequence.
Our work focuses on the study of one particular class of random graph models, namely the configuration model; however the general approach of using exploration processes for studying large deviation properties of sparse random graph models has broader applicability. The goal is to study asymptotics of probabilities of non-typical behavior in the large network limit. The first key step for this is to establish a LDP for an exploration process associated with the configuration model. A suitable exploration process here turns out to be an infinite dimensional Markov process with transition probability rates that diminish to zero in certain parts of the state space. Large deviation properties of such Markovian models is challenging due to poor regularity behavior of the associated local rate functions. Next, using the rate function in the LDP for the exploration process we formulate a calculus of variations problem associated with the asymptotics of component degree distributions. The second key ingredient in our study is a careful analysis of the infinite dimensional Euler-Lagrange equations associated with this calculus of variations problem. Exact solutions are identified which then provide explicit formulas for decay rates of probabilities of non-typical component degree distributions and related quantities.
Please see the paper for the complete abstract.
△ Less
Submitted 3 July, 2020; v1 submitted 8 December, 2019;
originally announced December 2019.
-
Distributional Robustness and Uncertainty Quantification for Rare Events
Authors:
Jeremiah Birrell,
Paul Dupuis,
Markos A. Katsoulakis,
Luc Rey-Bellet,
Jie Wang
Abstract:
Rare events, and more general risk-sensitive quantities-of-interest (QoIs), are significantly impacted by uncertainty in the tail behavior of a distribution. Uncertainty in the tail can take many different forms, each of which leads to a particular ambiguity set of alternative models. Distributional robustness bounds over such an ambiguity set constitute a stress-test of the model. In this paper w…
▽ More
Rare events, and more general risk-sensitive quantities-of-interest (QoIs), are significantly impacted by uncertainty in the tail behavior of a distribution. Uncertainty in the tail can take many different forms, each of which leads to a particular ambiguity set of alternative models. Distributional robustness bounds over such an ambiguity set constitute a stress-test of the model. In this paper we develop a method, utilizing Rényi-divergences, of constructing the ambiguity set that captures a user-specified form of tail-perturbation. We then obtain distributional robustness bounds (performance guarantees) for risk-sensitive QoIs over these ambiguity sets, using the known connection between Rényi-divergences and robustness for risk-sensitive QoIs. We also expand on this connection in several ways, including a generalization of the Donsker-Varadhan variational formula to Rényi divergences, and various tightness results. These ideas are illustrated through applications to uncertainty quantification in a model of lithium-ion battery failure, robustness of large deviations rate functions, and risk-sensitive distributionally robust optimization for option pricing.
△ Less
Submitted 21 November, 2019;
originally announced November 2019.
-
Formulation and properties of a divergence used to compare probability measures without absolute continuity
Authors:
Paul Dupuis,
Yixiang Mao
Abstract:
This paper develops a new divergence that generalizes relative entropy and can be used to compare probability measures without a requirement of absolute continuity. We establish properties of the divergence, and in particular derive and exploit a representation as an infimum convolution of optimal transport cost and relative entropy. Also included are examples of computation and approximation of t…
▽ More
This paper develops a new divergence that generalizes relative entropy and can be used to compare probability measures without a requirement of absolute continuity. We establish properties of the divergence, and in particular derive and exploit a representation as an infimum convolution of optimal transport cost and relative entropy. Also included are examples of computation and approximation of the divergence, and the demonstration of properties that are useful when one quantifies model uncertainty.
△ Less
Submitted 17 November, 2019;
originally announced November 2019.
-
Large Deviations for the Single Server Queue and the Reneging Paradox
Authors:
Rami Atar,
Amarjit Budhiraja,
Paul Dupuis,
Ruoyu Wu
Abstract:
For the M/M/1+M model at the law-of-large-numbers scale, the long run reneging count per unit time does not depend on the individual (i.e., per customer) reneging rate. This paradoxical statement has a simple proof. Less obvious is a large deviations analogue of this fact, stated as follows: The decay rate of the probability that the long run reneging count per unit time is atypically large or aty…
▽ More
For the M/M/1+M model at the law-of-large-numbers scale, the long run reneging count per unit time does not depend on the individual (i.e., per customer) reneging rate. This paradoxical statement has a simple proof. Less obvious is a large deviations analogue of this fact, stated as follows: The decay rate of the probability that the long run reneging count per unit time is atypically large or atypically small does not depend on the individual reneging rate. In this paper, the sample path large deviations principle for the model is proved and the rate function is computed. Next, large time asymptotics for the reneging rate are studied for the case when the arrival rate exceeds the service rate. The key ingredient is a calculus of variations analysis of the variational problem associated with atypical reneging. A characterization of the aforementioned decay rate, given explicitly in terms of the arrival and service rate parameters of the model, is provided yielding a precise mathematical description of this paradoxical behavior.
△ Less
Submitted 13 April, 2020; v1 submitted 15 March, 2019;
originally announced March 2019.
-
Infinite Swapping using IID Samples
Authors:
Paul Dupuis,
Guo-Jhen Wu,
Michael Snarski
Abstract:
We propose a new method for estimating rare event probabilities when independent samples are available. It is assumed that the underlying probability measures satisfy a large deviations principle with a scaling parameter $\varepsilon$ that we call temperature. We show how by combining samples at different temperatures, one can construct an estimator with greatly reduced variance. Although as prese…
▽ More
We propose a new method for estimating rare event probabilities when independent samples are available. It is assumed that the underlying probability measures satisfy a large deviations principle with a scaling parameter $\varepsilon$ that we call temperature. We show how by combining samples at different temperatures, one can construct an estimator with greatly reduced variance. Although as presented here the method is not as broadly applicable as other rare event simulation methods, such as splitting or importance sampling, it does not require any problem-dependent constructions.
△ Less
Submitted 4 November, 2018;
originally announced November 2018.
-
Exit Time Risk-Sensitive Control for Systems of Cooperative Agents
Authors:
Paul Dupuis,
Vaios Laschos,
Kavita Ramanan
Abstract:
We study sequences, parametrized by the number of agents, of many agent exit time stochastic control problems with risk-sensitive cost structure. We identify a fully characterizing assumption, under which each of such control problem corresponds to a risk-neutral stochastic control problem with additive cost, and sequentially to a risk-neutral stochastic control problem on the simplex, where the s…
▽ More
We study sequences, parametrized by the number of agents, of many agent exit time stochastic control problems with risk-sensitive cost structure. We identify a fully characterizing assumption, under which each of such control problem corresponds to a risk-neutral stochastic control problem with additive cost, and sequentially to a risk-neutral stochastic control problem on the simplex, where the specific information about the state of each agent can be discarded. We also prove that, under some additional assumptions, the sequence of value functions converges to the value function of a deterministic control problem, which can be used for the design of nearly optimal controls for the original problem, when the number of agents is sufficiently large.
△ Less
Submitted 22 August, 2018; v1 submitted 17 July, 2018;
originally announced July 2018.
-
Digital coherent control of a superconducting qubit
Authors:
Edward Leonard Jr.,
Matthew A. Beck,
JJ Nelson,
Brad G. Christensen,
Ted Thorbeck,
Caleb Howington,
Alexander Opremcak,
Ivan V. Pechenezhskiy,
Kenneth Dodge,
Nicholas P. Dupuis,
Jaseung Ku,
Francisco Schlenker,
Joseph Suttle,
Christopher Wilen,
Shaojiang Zhu,
Maxim G. Vavilov,
Britton L. T. Plourde,
Robert McDermott
Abstract:
High-fidelity gate operations are essential to the realization of a fault-tolerant quantum computer. In addition, the physical resources required to implement gates must scale efficiently with system size. A longstanding goal of the superconducting qubit community is the tight integration of a superconducting quantum circuit with a proximal classical cryogenic control system. Here we implement coh…
▽ More
High-fidelity gate operations are essential to the realization of a fault-tolerant quantum computer. In addition, the physical resources required to implement gates must scale efficiently with system size. A longstanding goal of the superconducting qubit community is the tight integration of a superconducting quantum circuit with a proximal classical cryogenic control system. Here we implement coherent control of a superconducting transmon qubit using a Single Flux Quantum (SFQ) pulse driver cofabricated on the qubit chip. The pulse driver delivers trains of quantized flux pulses to the qubit through a weak capacitive coupling; coherent rotations of the qubit state are realized when the pulse-to-pulse timing is matched to a multiple of the qubit oscillation period. We measure the fidelity of SFQ-based gates to be ~95% using interleaved randomized benchmarking. Gate fidelities are limited by quasiparticle generation in the dissipative SFQ driver. We characterize the dissipative and dispersive contributions of the quasiparticle admittance and discuss mitigation strategies to suppress quasiparticle poisoning. These results open the door to integration of large-scale superconducting qubit arrays with SFQ control elements for low-latency feedback and stabilization.
△ Less
Submitted 20 June, 2018;
originally announced June 2018.
-
Sensitivity Analysis for Rare Events based on Rényi Divergence
Authors:
Paul Dupuis,
Markos A. Katsoulakis,
Yannis Pantazis,
Luc Rey-Bellet
Abstract:
Rare events play a key role in many applications and numerous algorithms have been proposed for estimating the probability of a rare event. However, relatively little is known on how to quantify the sensitivity of the probability with respect to model parameters. In this paper, instead of the direct statistical estimation of rare event sensitivities, we develop novel and general uncertainty quanti…
▽ More
Rare events play a key role in many applications and numerous algorithms have been proposed for estimating the probability of a rare event. However, relatively little is known on how to quantify the sensitivity of the probability with respect to model parameters. In this paper, instead of the direct statistical estimation of rare event sensitivities, we develop novel and general uncertainty quantification and sensitivity bounds which are not tied to specific rare event simulation methods and which apply to families of rare events. Our method is based on a recently derived variational representation for the family of Rényi divergences in terms of risk sensitive functionals associated with the rare events under consideration. Based on the derived bounds, we propose new sensitivity indices for rare events and relate them to the moment generating function of the score function. The bounds scale in such a way that we additionally develop sensitivity indices for large deviation rate functions.
△ Less
Submitted 4 February, 2019; v1 submitted 17 May, 2018;
originally announced May 2018.
-
Uniform large deviation principles for Banach space valued stochastic differential equations
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Michael Salins
Abstract:
We prove a large deviation principle (LDP) for a general class of Banach space valued stochastic differential equations (SDE) that is uniform with respect to initial conditions in bounded subsets of the Banach space. A key step in the proof is showing that a uniform large deviation principle over compact sets is implied by a uniform over compact sets Laplace principle. Because bounded subsets of i…
▽ More
We prove a large deviation principle (LDP) for a general class of Banach space valued stochastic differential equations (SDE) that is uniform with respect to initial conditions in bounded subsets of the Banach space. A key step in the proof is showing that a uniform large deviation principle over compact sets is implied by a uniform over compact sets Laplace principle. Because bounded subsets of infinite dimensional Banach spaces are in general not relatively compact in the norm topology, we embed the Banach space into its double dual and utilize the weak-$\star $ compactness of closed bounded sets in the double dual space. We prove that a modified version of our stochastic differential equation satisfies a uniform Laplace principle over weak-$\star $ compact sets and consequently a uniform over bounded sets large deviation principle. We then transfer this result back to the original equation using a contraction principle. The main motivation for this uniform LDP is to generalize results of Freidlin and Wentzell concerning the behavior of finite dimensional SDEs. Here we apply the uniform LDP to study the asymptotics of exit times from bounded sets of Banach space valued small noise SDE, including reaction diffusion equations with multiplicative noise and $2$-dimensional stochastic Navier-Stokes equations with multiplicative noise.
△ Less
Submitted 1 March, 2018;
originally announced March 2018.
-
Large Deviation Principle for the Exploration Process of the Configuration Model
Authors:
Shankar Bhamidi,
Amarjit Budhiraja,
Paul Dupuis,
Ruoyu Wu
Abstract:
The configuration model is a sequence of random graphs constructed such that in the large network limit the degree distribution converges to a pre-specified probability distribution. The component structure of such random graphs can be obtained from an infinite dimensional Markov chain referred to as the exploration process. We establish a large deviation principle for the exploration process asso…
▽ More
The configuration model is a sequence of random graphs constructed such that in the large network limit the degree distribution converges to a pre-specified probability distribution. The component structure of such random graphs can be obtained from an infinite dimensional Markov chain referred to as the exploration process. We establish a large deviation principle for the exploration process associated with the configuration model. Proofs rely on a representation of the exploration process as a system of stochastic differential equations driven by Poisson random measures and variational formulas for moments of nonnegative functionals of Poisson random measures. Uniqueness results for certain controlled systems of deterministic equations play a key role in the analysis. Applications of the large deviation results, for studying asymptotic behavior of the degree sequence in large components of the random graphs, are discussed.
△ Less
Submitted 10 December, 2019; v1 submitted 5 August, 2017;
originally announced August 2017.
-
Large Deviations for Small Noise Diffusions in a Fast Markovian Environment
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Arnab Ganguly
Abstract:
A large deviation principle is established for a two-scale stochastic system in which the slow component is a continuous process given by a small noise finite dimensional Itô stochastic differential equation, and the fast component is a finite state pure jump process. Previous works have considered settings where the coupling between the components is weak in a certain sense. In the current work w…
▽ More
A large deviation principle is established for a two-scale stochastic system in which the slow component is a continuous process given by a small noise finite dimensional Itô stochastic differential equation, and the fast component is a finite state pure jump process. Previous works have considered settings where the coupling between the components is weak in a certain sense. In the current work we study a fully coupled system in which the drift and diffusion coefficient of the slow component and the jump intensity function and jump distribution of the fast process depend on the states of both components. In addition, the diffusion can be degenerate. Our proofs use certain stochastic control representations for expectations of exponential functionals of finite dimensional Brownian motions and Poisson random measures together with weak convergence arguments. A key challenge is in the proof of the large deviation lower bound where, due to the interplay between the degeneracy of the diffusion and the full dependence of the coefficients on the two components, the associated local rate function has poor regularity properties.
△ Less
Submitted 8 May, 2017;
originally announced May 2017.
-
Thermodynamic Integration Methods, Infinite Swapping and the Calculation of Generalized Averages
Authors:
J. D. Doll,
P. Dupuis,
P. Nyquist
Abstract:
In the present paper we examine the risk-sensitive and sampling issues associated with the problem of calculating generalized averages. By combining thermodynamic integration and Stationary Phase Monte Carlo techniques, we develop an approach for such problems and explore its utility for a prototypical class of applications.
In the present paper we examine the risk-sensitive and sampling issues associated with the problem of calculating generalized averages. By combining thermodynamic integration and Stationary Phase Monte Carlo techniques, we develop an approach for such problems and explore its utility for a prototypical class of applications.
△ Less
Submitted 31 October, 2016;
originally announced October 2016.
-
A large deviations analysis of certain qualitative properties of parallel tempering and infinite swapping algorithms
Authors:
J. D. Doll,
Paul Dupuis,
Pierre Nyquist
Abstract:
Parallel tempering, or replica exchange, is a popular method for simulating complex systems. The idea is to run parallel simulations at different temperatures, and at a given swap rate exchange configurations between the parallel simulations. From the perspective of large deviations it is optimal to let the swap rate tend to infinity and it is possible to construct a corresponding simulation schem…
▽ More
Parallel tempering, or replica exchange, is a popular method for simulating complex systems. The idea is to run parallel simulations at different temperatures, and at a given swap rate exchange configurations between the parallel simulations. From the perspective of large deviations it is optimal to let the swap rate tend to infinity and it is possible to construct a corresponding simulation scheme, known as infinite swapping. In this paper we propose a novel use of large deviations for empirical measures for a more detailed analysis of the infinite swapping limit in the setting of continuous time jump Markov processes. Using the large deviations rate function and associated stochastic control problems we consider a diagnostic based on temperature assignments, which can be easily computed during a simulation. We show that the convergence of this diagnostic to its a priori known limit is a necessary condition for the convergence of infinite swapping. The rate function is also used to investigate the impact of asymmetries in the underlying potential landscape, and where in the state space poor sampling is most likely to occur.
△ Less
Submitted 19 April, 2016;
originally announced April 2016.
-
Large Deviation Principle For Finite-State Mean Field Interacting Particle Systems
Authors:
Paul Dupuis,
Kavita Ramanan,
Wei Wu
Abstract:
We establish a large deviation principle for the empirical measure process associated with a general class of finite-state mean field interacting particle systems with Lipschitz continuous transition rates that satisfy a certain ergodicity condition. The approach is based on a variational representation for functionals of a Poisson random measure. Under an appropriate strengthening of the ergodici…
▽ More
We establish a large deviation principle for the empirical measure process associated with a general class of finite-state mean field interacting particle systems with Lipschitz continuous transition rates that satisfy a certain ergodicity condition. The approach is based on a variational representation for functionals of a Poisson random measure. Under an appropriate strengthening of the ergodicity condition, we also prove a locally uniform large deviation principle. The main novelty is that more than one particle is allowed to change its state simultaneously, and so a standard approach to the proof based on a change of measure with respect to a system of independent particles is not possible. The result is shown to be applicable to a wide range of models arising from statistical physics, queueing systems and communication networks. Along the way, we establish a large deviation principle for a class of jump Markov processes on the simplex, whose rates decay to zero as they approach the boundary of the domain. This result may be of independent interest.
△ Less
Submitted 22 January, 2016;
originally announced January 2016.
-
Large deviations for configurations generated by Gibbs distributions with energy functionals consisting of singular interaction and weakly confining potentials
Authors:
Paul Dupuis,
Vaios Laschos,
Kavita Ramanan
Abstract:
We establish large deviation principles (LDPs) for empirical measures associated with a sequence of Gibbs distributions on $n$-particle configurations, each of which is defined in terms of an inverse temperature $% β_n$ and an energy functional consisting of a (possibly singular) interaction potential and a (possibly weakly) confining potential. Under fairly general assumptions on the potentials,…
▽ More
We establish large deviation principles (LDPs) for empirical measures associated with a sequence of Gibbs distributions on $n$-particle configurations, each of which is defined in terms of an inverse temperature $% β_n$ and an energy functional consisting of a (possibly singular) interaction potential and a (possibly weakly) confining potential. Under fairly general assumptions on the potentials, we use a common framework to establish LDPs both with speeds $β_n/n \rightarrow \infty$, in which case the rate function is expressed in terms of a functional involving the potentials, and with speed $β_n =n$, when the rate function contains an additional entropic term. Such LDPs are motivated by questions arising in random matrix theory, sampling, simulated annealing and asymptotic convex geometry. Our approach, which uses the weak convergence method developed by Dupuis and Ellis, establishes LDPs with respect to stronger Wasserstein-type topologies. Our results address several interesting examples not covered by previous works, including the case of a weakly confining potential, which allows for rate functions with minimizers that do not have compact support, thus resolving several open questions raised in a work of Chafaï et al.
△ Less
Submitted 5 January, 2020; v1 submitted 21 November, 2015;
originally announced November 2015.
-
Path-space information bounds for uncertainty quantification and sensitivity analysis of stochastic dynamics
Authors:
Paul Dupuis,
Markos A. Katsoulakis,
Yannis Pantazis,
Petr Plechac
Abstract:
Uncertainty quantification is a primary challenge for reliable modeling and simulation of complex stochastic dynamics. Such problems are typically plagued with incomplete information that may enter as uncertainty in the model parameters, or even in the model itself. Furthermore, due to their dynamic nature, we need to assess the impact of these uncertainties on the transient and long-time behavior…
▽ More
Uncertainty quantification is a primary challenge for reliable modeling and simulation of complex stochastic dynamics. Such problems are typically plagued with incomplete information that may enter as uncertainty in the model parameters, or even in the model itself. Furthermore, due to their dynamic nature, we need to assess the impact of these uncertainties on the transient and long-time behavior of the stochastic models and derive corresponding uncertainty bounds for observables of interest. A special class of such challenges is parametric uncertainties in the model and in particular sensitivity analysis along with the corresponding sensitivity bounds for stochastic dynamics. Moreover, sensitivity analysis can be further complicated in models with a high number of parameters that render straightforward approaches, such as gradient methods, impractical. In this paper, we derive uncertainty and sensitivity bounds for path-space observables of stochastic dynamics in terms of new goal-oriented divergences; the latter incorporate both observables and information theory objects such as the relative entropy rate. These bounds are tight, depend on the variance of the particular observable and are computable through Monte Carlo simulation. In the case of sensitivity analysis, the derived sensitivity bounds rely on the path Fisher Information Matrix, hence they depend only on local dynamics and are gradient-free. These features allow for computationally efficient implementation in systems with a high number of parameters, e.g., complex reaction networks and molecular simulations.
△ Less
Submitted 14 July, 2015; v1 submitted 17 March, 2015;
originally announced March 2015.
-
Local stability of Kolmogorov forward equations for finite state nonlinear Markov processes
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Markus Fischer,
Kavita Ramanan
Abstract:
The focus of this work is on local stability of a class of nonlinear ordinary differential equations (ODE) that describe limits of empirical measures associated with finite-state weakly interacting N-particle systems. Local Lyapunov functions are identified for several classes of such ODE, including those associated with systems with slow adaptation and Gibbs systems. Using results from [5] and la…
▽ More
The focus of this work is on local stability of a class of nonlinear ordinary differential equations (ODE) that describe limits of empirical measures associated with finite-state weakly interacting N-particle systems. Local Lyapunov functions are identified for several classes of such ODE, including those associated with systems with slow adaptation and Gibbs systems. Using results from [5] and large deviations heuristics, a partial differential equation (PDE) associated with the nonlinear ODE is introduced and it is shown that positive definite subsolutions of this PDE serve as local Lyapunov functions for the ODE. This PDE characterization is used to construct explicit Lyapunov functions for a broad class of models called locally Gibbs systems. This class of models is significantly larger than the family of Gibbs systems and several examples of such systems are presented, including models with nearest neighbor jumps and models with simultaneous jumps that arise in applications.
△ Less
Submitted 12 February, 2015; v1 submitted 17 December, 2014;
originally announced December 2014.
-
Limits of relative entropies associated with weakly interacting particle systems
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Markus Fischer,
Kavita Ramanan
Abstract:
The limits of scaled relative entropies between probability distributions associated with N-particle weakly interacting Markov processes are considered. The convergence of such scaled relative entropies is established in various settings. The analysis is motivated by the role relative entropy plays as a Lyapunov function for the (linear) Kolmogorov forward equation associated with an ergodic Marko…
▽ More
The limits of scaled relative entropies between probability distributions associated with N-particle weakly interacting Markov processes are considered. The convergence of such scaled relative entropies is established in various settings. The analysis is motivated by the role relative entropy plays as a Lyapunov function for the (linear) Kolmogorov forward equation associated with an ergodic Markov process, and Lyapunov function properties of these scaling limits with respect to nonlinear finite-state Markov processes are studied in the companion paper [6].
△ Less
Submitted 12 February, 2015; v1 submitted 17 December, 2014;
originally announced December 2014.
-
On Performance Measures for Infinite Swapping Monte Carlo Methods
Authors:
J. D. Doll,
Paul Dupuis
Abstract:
We introduce and illustrate a number of performance measures for rare-event sampling methods. These measures are designed to be of use in a variety of expanded ensemble techniques including parallel tempering as well as infinite and partial infinite swapping approaches. Using a variety of selected applications we address questions concerning the variation of sampling performance with respect to ke…
▽ More
We introduce and illustrate a number of performance measures for rare-event sampling methods. These measures are designed to be of use in a variety of expanded ensemble techniques including parallel tempering as well as infinite and partial infinite swapping approaches. Using a variety of selected applications we address questions concerning the variation of sampling performance with respect to key computational ensemble parameters.
△ Less
Submitted 14 October, 2014;
originally announced October 2014.
-
Moderate Deviation Principles for Stochastic Differential Equations with Jumps
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Arnab Ganguly
Abstract:
Moderate deviation principles for stochastic differential equations driven by a Poisson random measure (PRM) in finite and infinite dimensions are obtained. Proofs are based on a variational representation for expected values of positive functionals of a PRM.
Moderate deviation principles for stochastic differential equations driven by a Poisson random measure (PRM) in finite and infinite dimensions are obtained. Proofs are based on a variational representation for expected values of positive functionals of a PRM.
△ Less
Submitted 28 January, 2014;
originally announced January 2014.
-
Moderate deviations for recursive stochastic algorithms
Authors:
Paul Dupuis,
Dane Johnson
Abstract:
We prove a moderate deviation principle for the continuous time interpolation of discrete time recursive stochastic processes. The methods of proof are somewhat different from the corresponding large deviation result, and in particular the proof of the upper bound is more complicated. The results can be applied to the design of accelerated Monte Carlo algorithms for certain problems, where schemes…
▽ More
We prove a moderate deviation principle for the continuous time interpolation of discrete time recursive stochastic processes. The methods of proof are somewhat different from the corresponding large deviation result, and in particular the proof of the upper bound is more complicated. The results can be applied to the design of accelerated Monte Carlo algorithms for certain problems, where schemes based on moderate deviations are easier to construct and in certain situations provide performance comparable to those based on large deviations.
△ Less
Submitted 23 January, 2014;
originally announced January 2014.
-
Robust bounds on risk-sensitive functionals via Renyi divergence
Authors:
Rami Atar,
Kamaljit Chowdhary,
Paul Dupuis
Abstract:
We extend the duality between exponential integrals and relative entropy to a variational formula for exponential integrals involving the Renyi divergence. This formula characterizes the dependence of risk-sensitive functionals and related quantities determined by tail behavior to perturbations in the underlying distributions, in terms of the Renyi divergence. The characterization gives rise to up…
▽ More
We extend the duality between exponential integrals and relative entropy to a variational formula for exponential integrals involving the Renyi divergence. This formula characterizes the dependence of risk-sensitive functionals and related quantities determined by tail behavior to perturbations in the underlying distributions, in terms of the Renyi divergence. The characterization gives rise to upper and lower bounds that are meaningful for all values of a large deviation scaling parameter, allowing one to quantify in explicit terms the robustness of risk-sensitive costs. As applications we consider problems of uncertainty quantification when aspects of the model are not fully known, as well their use in bounding tail properties of an intractable model in terms of a tractable one.
△ Less
Submitted 23 October, 2013;
originally announced October 2013.
-
Escaping from an attractor: Importance sampling and rest points I
Authors:
Paul Dupuis,
Konstantinos Spiliopoulos,
Xiang Zhou
Abstract:
We discuss importance sampling schemes for the estimation of finite time exit probabilities of small noise diffusions that involve escape from an equilibrium. A factor that complicates the analysis is that rest points are included in the domain of interest. We build importance sampling schemes with provably good performance both pre-asymptotically, that is, for fixed size of the noise, and asympto…
▽ More
We discuss importance sampling schemes for the estimation of finite time exit probabilities of small noise diffusions that involve escape from an equilibrium. A factor that complicates the analysis is that rest points are included in the domain of interest. We build importance sampling schemes with provably good performance both pre-asymptotically, that is, for fixed size of the noise, and asymptotically, that is, as the size of the noise goes to zero, and that do not degrade as the time horizon gets large. Simulation studies demonstrate the theoretical results.
△ Less
Submitted 9 September, 2015; v1 submitted 2 March, 2013;
originally announced March 2013.
-
On the large deviation rate function for the empirical measures of reversible jump Markov processes
Authors:
Paul Dupuis,
Yufei Liu
Abstract:
The large deviations principle for the empirical measure for both continuous and discrete time Markov processes is well known. Various expressions are available for the rate function, but these expressions are usually as the solution to a variational problem, and in this sense not explicit. An interesting class of continuous time, reversible processes was identified in the original work of Donsker…
▽ More
The large deviations principle for the empirical measure for both continuous and discrete time Markov processes is well known. Various expressions are available for the rate function, but these expressions are usually as the solution to a variational problem, and in this sense not explicit. An interesting class of continuous time, reversible processes was identified in the original work of Donsker and Varadhan for which an explicit expression is possible. While this class includes many (reversible) processes of interest, it excludes the case of continuous time pure jump processes, such as a reversible finite state Markov chain. In this paper, we study the large deviations principle for the empirical measure of pure jump Markov processes and provide an explicit formula of the rate function under reversibility.
△ Less
Submitted 19 June, 2015; v1 submitted 26 February, 2013;
originally announced February 2013.
-
Rare-Event Sampling: Occupation-Based Performance Measures for Parallel Tempering and Infinite Swapping Monte Carlo Methods
Authors:
J. D. Doll,
Nuria Plattner,
David L. Freeman,
Yufei Liu,
Paul Dupuis
Abstract:
In the present paper we identify a rigorous property of a number of tempering-based Monte Carlo sampling methods, including parallel tempering as well as partial and infinite swapping. Based on this property we develop a variety of performance measures for such rare-event sampling methods that are broadly applicable, informative, and straightforward to implement. We illustrate the use of these per…
▽ More
In the present paper we identify a rigorous property of a number of tempering-based Monte Carlo sampling methods, including parallel tempering as well as partial and infinite swapping. Based on this property we develop a variety of performance measures for such rare-event sampling methods that are broadly applicable, informative, and straightforward to implement. We illustrate the use of these performance measures with a series of applications involving the equilibrium properties of simple Lennard-Jones clusters, applications for which the performance levels of partial and infinite swapping approaches are found to be higher than those of conventional parallel tempering.
△ Less
Submitted 30 August, 2012;
originally announced August 2012.
-
Large Deviations for Stochastic Partial Differential Equations Driven by a Poisson Random Measure
Authors:
Amarjit Budhiraja,
Jiang Chen,
Paul Dupuis
Abstract:
Stochastic partial differential equations driven by Poisson random measures (PRM) have been proposed as models for many different physical systems, where they are viewed as a refinement of a corresponding noiseless partial differential equations (PDE). A systematic framework for the study of probabilities of deviations of the stochastic PDE from the deterministic PDE is through the theory of large…
▽ More
Stochastic partial differential equations driven by Poisson random measures (PRM) have been proposed as models for many different physical systems, where they are viewed as a refinement of a corresponding noiseless partial differential equations (PDE). A systematic framework for the study of probabilities of deviations of the stochastic PDE from the deterministic PDE is through the theory of large deviations. The goal of this work is to develop the large deviation theory for small Poisson noise perturbations of a general class of deterministic infinite dimensional models. Although the analogous questions for finite dimensional systems have been well studied, there are currently no general results in the infinite dimensional setting. This is in part due to the fact that in this setting solutions may have little spatial regularity, and thus classical approximation methods for large deviation analysis become intractable. The approach taken here, which is based on a variational representation for nonnegative functionals of general PRM, reduces the proof of the large deviation principle to establishing basic qualitative properties for controlled analogues of the underlying stochastic system. As an illustration of the general theory, we consider a particular system that models the spread of a pollutant in a waterway.
△ Less
Submitted 21 September, 2012; v1 submitted 18 March, 2012;
originally announced March 2012.
-
On the Infinite Swapping Limit for Parallel Tempering
Authors:
Paul Dupuis,
Yufei Liu,
Nuria Plattner,
J. D. Doll
Abstract:
Parallel tempering, also known as replica exchange sampling, is an important method for simulating complex systems. In this algorithm simulations are conducted in parallel at a series of temperatures, and the key feature of the algorithm is a swap mechanism that exchanges configurations between the parallel simulations at a given rate. The mechanism is designed to allow the low temperature system…
▽ More
Parallel tempering, also known as replica exchange sampling, is an important method for simulating complex systems. In this algorithm simulations are conducted in parallel at a series of temperatures, and the key feature of the algorithm is a swap mechanism that exchanges configurations between the parallel simulations at a given rate. The mechanism is designed to allow the low temperature system of interest to escape from deep local energy minima where it might otherwise be trapped, via those swaps with the higher temperature components. In this paper we introduce a performance criteria for such schemes based on large deviation theory, and argue that the rate of convergence is a monotone increasing function of the swap rate. This motivates the study of the limit process as the swap rate goes to infinity. We construct a scheme which is equivalent to this limit in a distributional sense, but which involves no swapping at all. Instead, the effect of the swapping is captured by a collection of weights that influence both the dynamics and the empirical measure. While theoretically optimal, this limit is not computationally feasible when the number of temperatures is large, and so variations that are easy to implement and nearly optimal are also developed.
△ Less
Submitted 13 June, 2012; v1 submitted 22 October, 2011;
originally announced October 2011.
-
Importance Sampling for Multiscale Diffusions
Authors:
Paul Dupuis,
Konstantinos Spiliopoulos,
Hui Wang
Abstract:
We construct importance sampling schemes for stochastic differential equations with small noise and fast oscillating coefficients. Standard Monte Carlo methods perform poorly for these problems in the small noise limit. With multiscale processes there are additional complications, and indeed the straightforward adaptation of methods for standard small noise diffusions will not produce efficient sc…
▽ More
We construct importance sampling schemes for stochastic differential equations with small noise and fast oscillating coefficients. Standard Monte Carlo methods perform poorly for these problems in the small noise limit. With multiscale processes there are additional complications, and indeed the straightforward adaptation of methods for standard small noise diffusions will not produce efficient schemes. Using the subsolution approach we construct schemes and identify conditions under which the schemes will be asymptotically optimal. Examples and simulation results are provided.
△ Less
Submitted 27 July, 2011;
originally announced July 2011.
-
An Infinite Swapping Approach to the Rare-Event Sampling Problem
Authors:
Nuria Plattner,
J. D. Doll,
Paul Dupuis,
Hui Wang,
Yufei Liu,
J. E. Gubernatis
Abstract:
We describe a new approach to the rare-event Monte Carlo sampling problem. This technique utilizes a symmetrization strategy to create probability distributions that are more highly connected and thus more easily sampled than their original, potentially sparse counterparts. After discussing the formal outline of the approach and devising techniques for its practical implementation, we illustrate t…
▽ More
We describe a new approach to the rare-event Monte Carlo sampling problem. This technique utilizes a symmetrization strategy to create probability distributions that are more highly connected and thus more easily sampled than their original, potentially sparse counterparts. After discussing the formal outline of the approach and devising techniques for its practical implementation, we illustrate the utility of the technique with a series of numerical applications to Lennard-Jones clusters of varying complexity and rare-event character.
△ Less
Submitted 30 June, 2011;
originally announced June 2011.
-
Counting with Combined Splitting and Capture-Recapture Methods
Authors:
Paul Dupuis,
Bahar Kaynar,
Ad Ridder,
Reuven Rubinstein,
Radislav Vaisman
Abstract:
We apply the splitting method to three well-known counting problems, namely 3-SAT, random graphs with prescribed degrees, and binary contingency tables. We present an enhanced version of the splitting method based on the capture-recapture technique, and show by experiments the superiority of this technique for SAT problems in terms of variance of the associated estimators, and speed of the algorit…
▽ More
We apply the splitting method to three well-known counting problems, namely 3-SAT, random graphs with prescribed degrees, and binary contingency tables. We present an enhanced version of the splitting method based on the capture-recapture technique, and show by experiments the superiority of this technique for SAT problems in terms of variance of the associated estimators, and speed of the algorithms.
△ Less
Submitted 31 March, 2011;
originally announced March 2011.
-
Distinguishing and integrating aleatoric and epistemic variation in uncertainty quantification
Authors:
Kamaljit Chowdhary,
Paul Dupuis
Abstract:
Much of uncertainty quantification to date has focused on determining the effect of variables modeled probabilistically, and with a known distribution, on some physical or engineering system. We develop methods to obtain information on the system when the distributions of some variables are known exactly, others are known only approximately, and perhaps others are not modeled as random variables a…
▽ More
Much of uncertainty quantification to date has focused on determining the effect of variables modeled probabilistically, and with a known distribution, on some physical or engineering system. We develop methods to obtain information on the system when the distributions of some variables are known exactly, others are known only approximately, and perhaps others are not modeled as random variables at all. The main tool used is the duality between risk-sensitive integrals and relative entropy, and we obtain explicit bounds on standard performance measures (variances, exceedance probabilities) over families of distributions whose distance from a nominal distribution is measured by relative entropy. The evaluation of the risk-sensitive expectations is based on polynomial chaos expansions, which help keep the computational aspects tractable.
△ Less
Submitted 17 March, 2011; v1 submitted 9 March, 2011;
originally announced March 2011.
-
Large Deviations for Multiscale Diffusions via Weak Convergence Methods
Authors:
Paul Dupuis,
Konstantinos Spiliopoulos
Abstract:
We study the large deviations principle for locally periodic stochastic differential equations with small noise and fast oscillating coefficients. There are three possible regimes depending on how fast the intensity of the noise goes to zero relative to the homogenization parameter. We use weak convergence methods which provide convenient representations for the action functional for all three reg…
▽ More
We study the large deviations principle for locally periodic stochastic differential equations with small noise and fast oscillating coefficients. There are three possible regimes depending on how fast the intensity of the noise goes to zero relative to the homogenization parameter. We use weak convergence methods which provide convenient representations for the action functional for all three regimes. Along the way we study weak limits of related controlled SDEs with fast oscillating coefficients and derive, in some cases, a control that nearly achieves the large deviations lower bound at the prelimit level. This control is useful for designing efficient importance sampling schemes for multiscale diffusions driven by small noise.
△ Less
Submitted 26 November, 2010;
originally announced November 2010.
-
Large deviation properties of weakly interacting processes via weak convergence methods
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Markus Fischer
Abstract:
We study large deviation properties of systems of weakly interacting particles modeled by Itô stochastic differential equations (SDEs). It is known under certain conditions that the corresponding sequence of empirical measures converges, as the number of particles tends to infinity, to the weak solution of an associated McKean-Vlasov equation. We derive a large deviation principle via the weak con…
▽ More
We study large deviation properties of systems of weakly interacting particles modeled by Itô stochastic differential equations (SDEs). It is known under certain conditions that the corresponding sequence of empirical measures converges, as the number of particles tends to infinity, to the weak solution of an associated McKean-Vlasov equation. We derive a large deviation principle via the weak convergence approach. The proof, which avoids discretization arguments, is based on a representation theorem, weak convergence and ideas from stochastic optimal control. The method works under rather mild assumptions and also for models described by SDEs not of diffusion type. To illustrate this, we treat the case of SDEs with delay.
△ Less
Submitted 25 September, 2012; v1 submitted 29 September, 2010;
originally announced September 2010.
-
Large deviations for stochastic flows of diffeomorphisms
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Vasileios Maroulas
Abstract:
A large deviation principle is established for a general class of stochastic flows in the small noise limit. This result is then applied to a Bayesian formulation of an image matching problem, and an approximate maximum likelihood property is shown for the solution of an optimization problem involving the large deviations rate function.
A large deviation principle is established for a general class of stochastic flows in the small noise limit. This result is then applied to a Bayesian formulation of an image matching problem, and an approximate maximum likelihood property is shown for the solution of an optimization problem involving the large deviations rate function.
△ Less
Submitted 23 February, 2010;
originally announced February 2010.
-
Correction. SDEs with oblique reflections on nonsmooth domains
Authors:
Paul Dupuis,
Hitoshi Ishii
Abstract:
Correction to The Annals of Probability 21 (1993) 554--580 [http://projecteuclid.org/euclid.aop/1176989415]
Correction to The Annals of Probability 21 (1993) 554--580 [http://projecteuclid.org/euclid.aop/1176989415]
△ Less
Submitted 25 September, 2008;
originally announced September 2008.
-
Large deviations for infinite dimensional stochastic dynamical systems
Authors:
Amarjit Budhiraja,
Paul Dupuis,
Vasileios Maroulas
Abstract:
The large deviations analysis of solutions to stochastic differential equations and related processes is often based on approximation. The construction and justification of the approximations can be onerous, especially in the case where the process state is infinite dimensional. In this paper we show how such approximations can be avoided for a variety of infinite dimensional models driven by so…
▽ More
The large deviations analysis of solutions to stochastic differential equations and related processes is often based on approximation. The construction and justification of the approximations can be onerous, especially in the case where the process state is infinite dimensional. In this paper we show how such approximations can be avoided for a variety of infinite dimensional models driven by some form of Brownian noise. The approach is based on a variational representation for functionals of Brownian motion. Proofs of large deviations properties are reduced to demonstrating basic qualitative properties (existence, uniqueness and tightness) of certain perturbations of the original process.
△ Less
Submitted 27 August, 2008;
originally announced August 2008.
-
Splitting for Rare Event Simulation: A Large Deviation Approach to Design and Analysis
Authors:
Thomas Dean,
Paul Dupuis
Abstract:
Particle splitting methods are considered for the estimation of rare events. The probability of interest is that a Markov process first enters a set $B$ before another set $A$, and it is assumed that this probability satisfies a large deviation scaling. A notion of subsolution is defined for the related calculus of variations problem, and two main results are proved under mild conditions. The fi…
▽ More
Particle splitting methods are considered for the estimation of rare events. The probability of interest is that a Markov process first enters a set $B$ before another set $A$, and it is assumed that this probability satisfies a large deviation scaling. A notion of subsolution is defined for the related calculus of variations problem, and two main results are proved under mild conditions. The first is that the number of particles generated by the algorithm grows subexponentially if and only if a certain scalar multiple of the importance function is a subsolution. The second is that, under the same condition, the variance of the algorithm is characterized (asymptotically) in terms of the subsolution. The design of asymptotically optimal schemes is discussed, and numerical examples are presented.
△ Less
Submitted 13 November, 2007;
originally announced November 2007.
-
Dynamic importance sampling for queueing networks
Authors:
Paul Dupuis,
Ali Devin Sezer,
Hui Wang
Abstract:
Importance sampling is a technique that is commonly used to speed up Monte Carlo simulation of rare events. However, little is known regarding the design of efficient importance sampling algorithms in the context of queueing networks. The standard approach, which simulates the system using an a priori fixed change of measure suggested by large deviation analysis, has been shown to fail in even t…
▽ More
Importance sampling is a technique that is commonly used to speed up Monte Carlo simulation of rare events. However, little is known regarding the design of efficient importance sampling algorithms in the context of queueing networks. The standard approach, which simulates the system using an a priori fixed change of measure suggested by large deviation analysis, has been shown to fail in even the simplest network setting (e.g., a two-node tandem network). Exploiting connections between importance sampling, differential games, and classical subsolutions of the corresponding Isaacs equation, we show how to design and analyze simple and efficient dynamic importance sampling schemes for general classes of networks. The models used to illustrate the approach include $d$-node tandem Jackson networks and a two-node network with feedback, and the rare events studied are those of large queueing backlogs, including total population overflow and the overflow of individual buffers.
△ Less
Submitted 24 October, 2007;
originally announced October 2007.
-
On the convergence from discrete to continuous time in an optimal stopping problem
Authors:
Paul Dupuis,
Hui Wang
Abstract:
We consider the problem of optimal stopping for a one-dimensional diffusion process. Two classes of admissible stopping times are considered. The first class consists of all nonanticipating stopping times that take values in [0,\infty], while the second class further restricts the set of allowed values to the discrete grid {nh:n=0,1,2,...,\infty} for some parameter h>0. The value functions for t…
▽ More
We consider the problem of optimal stopping for a one-dimensional diffusion process. Two classes of admissible stopping times are considered. The first class consists of all nonanticipating stopping times that take values in [0,\infty], while the second class further restricts the set of allowed values to the discrete grid {nh:n=0,1,2,...,\infty} for some parameter h>0. The value functions for the two problems are denoted by V(x) and V^h(x), respectively. We identify the rate of convergence of V^h(x) to V(x) and the rate of convergence of the stopping regions, and provide simple formulas for the rate coefficients.
△ Less
Submitted 12 May, 2005;
originally announced May 2005.
-
Dynamic importance sampling for uniformly recurrent markov chains
Authors:
Paul Dupuis,
Hui Wang
Abstract:
Importance sampling is a variance reduction technique for efficient estimation of rare-event probabilities by Monte Carlo. In standard importance sampling schemes, the system is simulated using an a priori fixed change of measure suggested by a large deviation lower bound analysis. Recent work, however, has suggested that such schemes do not work well in many situations. In this paper we conside…
▽ More
Importance sampling is a variance reduction technique for efficient estimation of rare-event probabilities by Monte Carlo. In standard importance sampling schemes, the system is simulated using an a priori fixed change of measure suggested by a large deviation lower bound analysis. Recent work, however, has suggested that such schemes do not work well in many situations. In this paper we consider dynamic importance sampling in the setting of uniformly recurrent Markov chains. By ``dynamic'' we mean that in the course of a single simulation, the change of measure can depend on the outcome of the simulation up till that time. Based on a control-theoretic approach to large deviations, the existence of asymptotically optimal dynamic schemes is demonstrated in great generality. The implementation of the dynamic schemes is carried out with the help of a limiting Bellman equation. Numerical examples are presented to contrast the dynamic and standard schemes.
△ Less
Submitted 22 March, 2005;
originally announced March 2005.
-
Explicit solution for a network control problem in the large deviation regime
Authors:
Rami Atar,
Paul Dupuis,
Adam Shwartz
Abstract:
We consider optimal control of a stochastic network,where service is controlled to prevent buffer overflow. We use a risk-sensitive escape time criterion, which in comparison to the ordinary escape time criteria heavily penalizes exits which occur on short time intervals. A limit as the buffer sizes tend to infinity is considered. In [2] we showed that, for a large class of networks, the limit o…
▽ More
We consider optimal control of a stochastic network,where service is controlled to prevent buffer overflow. We use a risk-sensitive escape time criterion, which in comparison to the ordinary escape time criteria heavily penalizes exits which occur on short time intervals. A limit as the buffer sizes tend to infinity is considered. In [2] we showed that, for a large class of networks, the limit of the normalized cost agrees with the value function of a differential game. The game's value is characterized in [2] as the unique solution to a Hamilton-Jacobi-Bellman Partial Differential Equation (PDE). In the current paper we apply this general theory to the important case of a network of queues in tandem. Our main results are: (i) the construction of an explicit solution to the corresponding PDE, and (ii) drawing out the implications for optimal risk-sensitive and robust regulation of the network. In particular, the following general principle can be extracted. To avoid buffer overflow there is a natural competition between two tendencies. One may choose to serve a particular queue, since that will help prevent its own buffer from overflowing, or one may prefer to stop service, with the goal of preventing overflow of buffers further down the line. The solution to the PDE indicates the optimal choice between these two, specifying the parts of the state space where each queue must be served (so as not to lose optimality), and where it can idle.
△ Less
Submitted 3 January, 2005;
originally announced January 2005.