-
Optimal Bregman quantization : existence and uniqueness of optimal quantizers revisited
Authors:
Guillaume Boutoille,
Gilles Pagès
Abstract:
In this paper we revisit the exsistence theorem for $L^r$-optimal quantization, $r\ge 2$, with respect to a Bregman divergence: we establish the existence of optimal quantizaers under lighter assumptions onthe strictly convex function which generates the divergence, espcially in the quadratic case ($r=2$). We then prove a uniqueness theorem ``à la Trushkin'' in one dimension for strongly unimodal…
▽ More
In this paper we revisit the exsistence theorem for $L^r$-optimal quantization, $r\ge 2$, with respect to a Bregman divergence: we establish the existence of optimal quantizaers under lighter assumptions onthe strictly convex function which generates the divergence, espcially in the quadratic case ($r=2$). We then prove a uniqueness theorem ``à la Trushkin'' in one dimension for strongly unimodal distributions and divergences gerated by strictly convex functions whiose thire dervative is either stictly $\log$-convex or $\log$-concave.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
Locally optimal Functional Quantization
Authors:
Harald Luschgy,
Gilles Pagès
Abstract:
In this note we demonstrate that locally optimal functional quantizers for probability distributions on a Banach space lying in the support of $P$ behave exactly like globally optimal functional quantizers in terms of stationarity/self-consistency.
In this note we demonstrate that locally optimal functional quantizers for probability distributions on a Banach space lying in the support of $P$ behave exactly like globally optimal functional quantizers in terms of stationarity/self-consistency.
△ Less
Submitted 22 April, 2025; v1 submitted 11 April, 2025;
originally announced April 2025.
-
A note on the $\mathcal{W}_2$-convergence rate of the empirical measure of an ergodic $\mathbb{R}^d$-valued diffusion
Authors:
Jean-Francois Chassagneux,
Gilles Pagès
Abstract:
In this note, we consider a Stochastic Differential Equation under a strong confluence and Lipschitz continuity assumption of the coefficients. For the unique stationary solution, we study the rate of convergence of its empirical measure toward the invariant probability measure. We provide rate for the Wasserstein distance in the mean quadratic and almost sure sense.
In this note, we consider a Stochastic Differential Equation under a strong confluence and Lipschitz continuity assumption of the coefficients. For the unique stationary solution, we study the rate of convergence of its empirical measure toward the invariant probability measure. We provide rate for the Wasserstein distance in the mean quadratic and almost sure sense.
△ Less
Submitted 11 February, 2025;
originally announced February 2025.
-
Convex comparison of Gaussian mixtures
Authors:
Benjamin Jourdain,
Gilles Pagès
Abstract:
Motivated by the study of the propagation of convexity by semi-groups of stochastic differential equations and convex comparison between the distributions of solutions of two such equations, we study the comparison for the convex order between a Gaussian distribution and a Gaussian mixture. We give and discuss intrinsic necessary and sufficient conditions for convex ordering. On the examples that…
▽ More
Motivated by the study of the propagation of convexity by semi-groups of stochastic differential equations and convex comparison between the distributions of solutions of two such equations, we study the comparison for the convex order between a Gaussian distribution and a Gaussian mixture. We give and discuss intrinsic necessary and sufficient conditions for convex ordering. On the examples that we have worked out, the two conditions appear to be closely related.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Computing the invariant distribution of McKean-Vlasov SDEs by ergodic simulation
Authors:
Jean-François Chassagneux,
Gilles Pagès
Abstract:
We design a fully implementable scheme to compute the invariant distribution of ergodic McKean-Vlasov SDE satisfying a uniform confluence property. Under natural conditions, we prove various convergence results notably we obtain rates for the Wasserstein distance in quadratic mean and almost sure sense.
We design a fully implementable scheme to compute the invariant distribution of ergodic McKean-Vlasov SDE satisfying a uniform confluence property. Under natural conditions, we prove various convergence results notably we obtain rates for the Wasserstein distance in quadratic mean and almost sure sense.
△ Less
Submitted 12 February, 2025; v1 submitted 19 June, 2024;
originally announced June 2024.
-
Volterra equations with affine drift: looking for stationarity
Authors:
Gilles Pagès
Abstract:
We investigate the properties of the solutions of scaled Volterra equations (i.e. with an affine mean-reverting drift) in terms of stationarity at both a finite horizon and on the long run. In particular we prove that such an equation never has a stationary regime, except if the kernel is constant (i.e. the equation is a standard Brownian diffusion) or in some fully degenerate pathological setting…
▽ More
We investigate the properties of the solutions of scaled Volterra equations (i.e. with an affine mean-reverting drift) in terms of stationarity at both a finite horizon and on the long run. In particular we prove that such an equation never has a stationary regime, except if the kernel is constant (i.e. the equation is a standard Brownian diffusion) or in some fully degenerate pathological settings. We introduce a deterministic stabilizer $ ς$ associated to the kernel which produces a {\em fake stationary regime} in the sense that all the marginals share the same expectation and variance. We also show that the marginals of such a process starting from when starting various initial values are confluent in $L^2$ as time goes to infinity. We establish that for some classes of diffusion coefficients (square root of positive quadratic polynomials) the time shifted solutions of such Volterra equations weakly functionally converges toward a family of $L^2$-stationary processes sharing the same covariance function. We apply these results to (stabilized) rough volatility models (when the kernel $K(t)= t^{H-\frac 12}$, $0<H<\frac 12$) which leads to produce a fake stationary quadratic rough Heston model.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Convex ordering of solutions to one-dimensional SDEs
Authors:
Benjamin Jourdain,
Gilles Pagès
Abstract:
In this paper, we are interested in the propagation of convexity by the strong solution to a one-dimensional Brownian stochastic differential equation with coefficients Lipschitz in the spatial variable uniformly in the time variable and in the convex ordering between the solutions of two such equations. We prove that while these properties hold without further assumptions for convex functions of…
▽ More
In this paper, we are interested in the propagation of convexity by the strong solution to a one-dimensional Brownian stochastic differential equation with coefficients Lipschitz in the spatial variable uniformly in the time variable and in the convex ordering between the solutions of two such equations. We prove that while these properties hold without further assumptions for convex functions of the processes at one instant only, an assumption almost amounting to spatial convexity of the diffusion coefficient is needed for the extension to convex functions at two instants. Under this spatial convexity of the diffusion coefficients, the two properties even hold for convex functionals of the whole path. For directionally convex functionals, the spatial convexity of the diffusion coefficient is no longer needed. Our method of proof consists in first establishing the results for time discretization schemes of Euler type and then transferring them to their limiting Brownian diffusions. We thus exhibit approximations which avoid {\em convexity arbitrages} by preserving convexity propagation and comparison and can be computed by Monte Carlo simulation.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Asymptotic Error Analysis of Multilevel Stochastic Approximations for the Value-at-Risk and Expected Shortfall
Authors:
Stéphane Crépey,
Noufel Frikha,
Azar Louzi,
Gilles Pagès
Abstract:
Crépey, Frikha, and Louzi (2023) introduced a nested stochastic approximation algorithm and its multilevel acceleration to compute the value-at-risk and expected shortfall of a random financial loss. We hereby establish central limit theorems for the renormalized estimation errors associated with both algorithms as well as their averaged versions. Our findings are substantiated through a numerical…
▽ More
Crépey, Frikha, and Louzi (2023) introduced a nested stochastic approximation algorithm and its multilevel acceleration to compute the value-at-risk and expected shortfall of a random financial loss. We hereby establish central limit theorems for the renormalized estimation errors associated with both algorithms as well as their averaged versions. Our findings are substantiated through a numerical example.
△ Less
Submitted 25 November, 2024; v1 submitted 26 November, 2023;
originally announced November 2023.
-
Policy Gradient Optimal Correlation Search for Variance Reduction in Monte Carlo simulation and Maximum Optimal Transport
Authors:
Pierre Bras,
Gilles Pagès
Abstract:
We propose a new algorithm for variance reduction when estimating $f(X_T)$ where $X$ is the solution to some stochastic differential equation and $f$ is a test function. The new estimator is $(f(X^1_T) + f(X^2_T))/2$, where $X^1$ and $X^2$ have same marginal law as $X$ but are pathwise correlated so that to reduce the variance. The optimal correlation function $ρ$ is approximated by a deep neural…
▽ More
We propose a new algorithm for variance reduction when estimating $f(X_T)$ where $X$ is the solution to some stochastic differential equation and $f$ is a test function. The new estimator is $(f(X^1_T) + f(X^2_T))/2$, where $X^1$ and $X^2$ have same marginal law as $X$ but are pathwise correlated so that to reduce the variance. The optimal correlation function $ρ$ is approximated by a deep neural network and is calibrated along the trajectories of $(X^1, X^2)$ by policy gradient and reinforcement learning techniques. Finding an optimal coupling given marginal laws has links with maximum optimal transport.
△ Less
Submitted 15 September, 2023; v1 submitted 24 July, 2023;
originally announced July 2023.
-
From elephant to goldfish (and back): memory in stochastic Volterra processes
Authors:
Ofelia Bonesini,
Giorgia Callegaro,
Martino Grasselli,
Gilles Pagès
Abstract:
We propose a new theoretical framework that exploits convolution kernels to transform a Volterra path-dependent (non-Markovian) stochastic process into a standard (Markovian) diffusion process. This transformation is achieved by embedding a Markovian "memory process" within the dynamics of the non-Markovian process. We discuss existence and path-wise regularity of solutions for the stochastic Volt…
▽ More
We propose a new theoretical framework that exploits convolution kernels to transform a Volterra path-dependent (non-Markovian) stochastic process into a standard (Markovian) diffusion process. This transformation is achieved by embedding a Markovian "memory process" within the dynamics of the non-Markovian process. We discuss existence and path-wise regularity of solutions for the stochastic Volterra equations introduced and we provide a financial application to volatility modeling. We also propose a numerical scheme for simulating the processes. The numerical scheme exhibits a strong convergence rate of 1/2, which is independent of the roughness parameter of the volatility process. This is a significant improvement compared to Euler schemes used in similar models.
We propose a new theoretical framework that exploits convolution kernels to transform a Volterra path-dependent (non-Markovian) stochastic process into a standard (Markovian) diffusion process. This transformation is achieved by embedding a Markovian "memory process" (the goldfish) within the dynamics of the non-Markovian process (the elephant). Most notably, it is also possible to go back, i.e., the transformation is reversible. We discuss existence and path-wise regularity of solutions for the stochastic Volterra equations introduced and we propose a numerical scheme for simulating the processes, which exhibits a remarkable convergence rate of $1/2$. In particular, in the fractional kernel case, the strong convergence rate is independent of the roughness parameter, which is a positive novelty in contrast with what happens in the available Euler schemes in the literature in rough volatility models.
△ Less
Submitted 8 January, 2025; v1 submitted 5 June, 2023;
originally announced June 2023.
-
Langevin algorithms for Markovian Neural Networks and Deep Stochastic control
Authors:
Pierre Bras,
Gilles Pagès
Abstract:
Stochastic Gradient Descent Langevin Dynamics (SGLD) algorithms, which add noise to the classic gradient descent, are known to improve the training of neural networks in some cases where the neural network is very deep. In this paper we study the possibilities of training acceleration for the numerical resolution of stochastic control problems through gradient descent, where the control is paramet…
▽ More
Stochastic Gradient Descent Langevin Dynamics (SGLD) algorithms, which add noise to the classic gradient descent, are known to improve the training of neural networks in some cases where the neural network is very deep. In this paper we study the possibilities of training acceleration for the numerical resolution of stochastic control problems through gradient descent, where the control is parametrized by a neural network. If the control is applied at many discretization times then solving the stochastic control problem reduces to minimizing the loss of a very deep neural network. We numerically show that Langevin algorithms improve the training on various stochastic control problems like hedging and resource management, and for different choices of gradient descent methods.
△ Less
Submitted 13 January, 2023; v1 submitted 22 December, 2022;
originally announced December 2022.
-
Convex ordering for stochastic Volterra equations and their Euler schemes
Authors:
Benjamin Jourdain,
Gilles Pagès
Abstract:
In this paper, we are interested in comparing solutions to stochastic Volterra equations for the convex order on the space of continuous $\R^d$-valued paths and for the monotonic convex order when $d=1$. Even if in general these solutions are neither semi-martingales nor Markov processes, we are able to exhibit conditions on their coefficients enabling the comparison. Our approach consists in firs…
▽ More
In this paper, we are interested in comparing solutions to stochastic Volterra equations for the convex order on the space of continuous $\R^d$-valued paths and for the monotonic convex order when $d=1$. Even if in general these solutions are neither semi-martingales nor Markov processes, we are able to exhibit conditions on their coefficients enabling the comparison. Our approach consists in first comparing their Euler schemes and then taking the limit as the time step vanishes. We consider two types of Euler schemes depending on the way the Volterra kernels are discretized. The conditions ensuring the comparison are slightly weaker for the first scheme than for the second one and this is the other way round for convergence. Moreover, we extend the integrability needed on the starting values in the existence and convergence results in the literature to be able to only assume finite first order moments, which is the natural framework for convex ordering.
△ Less
Submitted 18 November, 2022;
originally announced November 2022.
-
Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise II: Total Variation
Authors:
Pierre Bras,
Gilles Pagès
Abstract:
We study the convergence of Langevin-Simulated Annealing type algorithms with multiplicative noise, i.e. for $V : \mathbb{R}^d \to \mathbb{R}$ a potential function to minimize, we consider the stochastic differential equation $dY_t = - σσ^\top \nabla V(Y_t) dt + a(t)σ(Y_t)dW_t + a(t)^2Υ(Y_t)dt$, where $(W_t)$ is a Brownian motion, where $σ: \mathbb{R}^d \to \mathcal{M}_d(\mathbb{R})$ is an adaptiv…
▽ More
We study the convergence of Langevin-Simulated Annealing type algorithms with multiplicative noise, i.e. for $V : \mathbb{R}^d \to \mathbb{R}$ a potential function to minimize, we consider the stochastic differential equation $dY_t = - σσ^\top \nabla V(Y_t) dt + a(t)σ(Y_t)dW_t + a(t)^2Υ(Y_t)dt$, where $(W_t)$ is a Brownian motion, where $σ: \mathbb{R}^d \to \mathcal{M}_d(\mathbb{R})$ is an adaptive (multiplicative) noise, where $a : \mathbb{R}^+ \to \mathbb{R}^+$ is a function decreasing to $0$ and where $Υ$ is a correction term. Allowing $σ$ to depend on the position brings faster convergence in comparison with the classical Langevin equation $dY_t = -\nabla V(Y_t)dt + σdW_t$. In a previous paper we established the convergence in $L^1$-Wasserstein distance of $Y_t$ and of its associated Euler scheme $\bar{Y}_t$ to $\text{argmin}(V)$ with the classical schedule $a(t) = A\log^{-1/2}(t)$. In the present paper we prove the convergence in total variation distance. The total variation case appears more demanding to deal with and requires regularization lemmas.
△ Less
Submitted 30 May, 2022;
originally announced May 2022.
-
Total variation distance between two diffusions in small time with unbounded drift: application to the Euler-Maruyama scheme
Authors:
Pierre Bras,
Gilles Pagès,
Fabien Panloup
Abstract:
We give bounds for the total variation distance between the solutions to two stochastic differential equations starting at the same point and with close coefficients, which applies in particular to the distance between an exact solution and its Euler-Maruyama scheme in small time. We show that for small $t$, the total variation distance is of order $t^{r/(2r+1)}$ if the noise coefficient $σ$ of th…
▽ More
We give bounds for the total variation distance between the solutions to two stochastic differential equations starting at the same point and with close coefficients, which applies in particular to the distance between an exact solution and its Euler-Maruyama scheme in small time. We show that for small $t$, the total variation distance is of order $t^{r/(2r+1)}$ if the noise coefficient $σ$ of the SDE is elliptic and $\mathcal{C}^{2r}_b$, $r\in \mathbb{N}$ and if the drift is $C^1$ with bounded derivatives, using multi-step Richardson-Romberg extrapolation. We do not require the drift to be bounded. Then we prove with a counterexample that we cannot achieve a bound better than $t^{1/2}$ in general.
△ Less
Submitted 9 December, 2022; v1 submitted 18 November, 2021;
originally announced November 2021.
-
Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise
Authors:
Pierre Bras,
Gilles Pagès
Abstract:
We study the convergence of Langevin-Simulated Annealing type algorithms with multiplicative noise, i.e. for $V : \mathbb{R}^d \to \mathbb{R}$ a potential function to minimize, we consider the stochastic equation $dY_t = - σσ^\top \nabla V(Y_t) dt + a(t)σ(Y_t)dW_t + a(t)^2Υ(Y_t)dt$, where $(W_t)$ is a Brownian motion, where $σ: \mathbb{R}^d \to \mathcal{M}_d(\mathbb{R})$ is an adaptive (multiplica…
▽ More
We study the convergence of Langevin-Simulated Annealing type algorithms with multiplicative noise, i.e. for $V : \mathbb{R}^d \to \mathbb{R}$ a potential function to minimize, we consider the stochastic equation $dY_t = - σσ^\top \nabla V(Y_t) dt + a(t)σ(Y_t)dW_t + a(t)^2Υ(Y_t)dt$, where $(W_t)$ is a Brownian motion, where $σ: \mathbb{R}^d \to \mathcal{M}_d(\mathbb{R})$ is an adaptive (multiplicative) noise, where $a : \mathbb{R}^+ \to \mathbb{R}^+$ is a function decreasing to $0$ and where $Υ$ is a correction term. This setting can be applied to optimization problems arising in Machine Learning. The case where $σ$ is a constant matrix has been extensively studied however little attention has been paid to the general case. We prove the convergence for the $L^1$-Wasserstein distance of $Y_t$ and of the associated Euler-scheme $\bar{Y}_t$ to some measure $ν^\star$ which is supported by $\text{argmin}(V)$ and give rates of convergence to the instantaneous Gibbs measure $ν_{a(t)}$ of density $\propto \exp(-2V(x)/a(t)^2)$. To do so, we first consider the case where $a$ is a piecewise constant function. We find again the classical schedule $a(t) = A\log^{-1/2}(t)$. We then prove the convergence for the general case by giving bounds for the Wasserstein distance to the stepwise constant case using ergodicity properties.
△ Less
Submitted 24 April, 2022; v1 submitted 23 September, 2021;
originally announced September 2021.
-
Performance of a Markovian neural network versus dynamic programming on a fishing control problem
Authors:
Mathieu Laurière,
Gilles Pagès,
Olivier Pironneau
Abstract:
Fishing quotas are unpleasant but efficient to control the productivity of a fishing site. A popular model has a stochastic differential equation for the biomass on which a stochastic dynamic programming or a Hamilton-Jacobi-Bellman algorithm can be used to find the stochastic control -- the fishing quota. We compare the solutions obtained by dynamic programming against those obtained with a neura…
▽ More
Fishing quotas are unpleasant but efficient to control the productivity of a fishing site. A popular model has a stochastic differential equation for the biomass on which a stochastic dynamic programming or a Hamilton-Jacobi-Bellman algorithm can be used to find the stochastic control -- the fishing quota. We compare the solutions obtained by dynamic programming against those obtained with a neural network which preserves the Markov property of the solution. The method is extended to a similar multi species model to check its robustness in high dimension.
△ Less
Submitted 14 September, 2021;
originally announced September 2021.
-
Quantization-based approximation of reflected BSDEs with extended upper bounds for recursive quantization
Authors:
Rancy El Nmeir,
Gilles Pagès
Abstract:
We establish upper bounds for the $L^p$-quantization error, p in (1, 2+d), induced by the recursive Markovian quantization of a d-dimensional diffusion discretized via the Euler scheme. We introduce a hybrid recursive quantization scheme, easier to implement in the high-dimensional framework, and establish upper bounds to the corresponding $L^p$-quantization error. To take advantage of these exten…
▽ More
We establish upper bounds for the $L^p$-quantization error, p in (1, 2+d), induced by the recursive Markovian quantization of a d-dimensional diffusion discretized via the Euler scheme. We introduce a hybrid recursive quantization scheme, easier to implement in the high-dimensional framework, and establish upper bounds to the corresponding $L^p$-quantization error. To take advantage of these extensions, we propose a time discretization scheme and a recursive quantization-based discretization scheme associated to a reflected Backward Stochastic Differential Equation and estimate $L^p$-error bounds induced by the space approximation. We will explain how to numerically compute the solution of the reflected BSDE relying on the recursive quantization and compare it to other types of quantization.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
Monotone convex order for the McKean-Vlasov processes
Authors:
Yating Liu,
Gilles Pagès
Abstract:
In this paper, we establish the monotone convex order between two $\mathbb{R}$-valued McKean-Vlasov processes $X=(X_t)_{t\in [0, T]}$ and $Y=(Y_t)_{t\in [0, T]}$ defined on a filtered probability space $(Ω, \mathcal{F}, (\mathcal{F}_{t})_{t\geq0}, \mathbb{P})$ by \begin{align} &dX_{t}=b(t, X_{t}, μ_{t})dt+σ(t, X_{t}, μ_{t})dB_{t}, \quad X_{0}\in L^{p}(\mathbb{P})\; \text{with}\; p\geq 2,\nonumber\…
▽ More
In this paper, we establish the monotone convex order between two $\mathbb{R}$-valued McKean-Vlasov processes $X=(X_t)_{t\in [0, T]}$ and $Y=(Y_t)_{t\in [0, T]}$ defined on a filtered probability space $(Ω, \mathcal{F}, (\mathcal{F}_{t})_{t\geq0}, \mathbb{P})$ by \begin{align} &dX_{t}=b(t, X_{t}, μ_{t})dt+σ(t, X_{t}, μ_{t})dB_{t}, \quad X_{0}\in L^{p}(\mathbb{P})\; \text{with}\; p\geq 2,\nonumber\\ &dY_{t}=β(t, Y_{t}, ν_{t})dt+θ(t, \,Y_{t}, ν_{t})\,dB_{t}, \,\quad Y_{0}\in L^{p}(\mathbb{P}), \nonumber \end{align} where $\forall\, t\in [0, T],\: μ_{t}=\mathbb{P}\circ X_{t}^{-1}, \:ν_{t}=\mathbb{P}\circ Y_{t}^{-1}. $ If we make the convexity and monotony assumption (only) on $b$ and $|σ|$ and if $b\leq β$ and $|σ|\leq |θ|$, then the monotone convex order for the initial random variable $X_0\preceq_{\,\text{mcv}} Y_0$ can be propagated to the whole path of processes $X$ and $Y$. That is, if we consider a non-decreasing convex functional $F$ defined on the path space with polynomial growth, we have $\mathbb{E}\, F(X)\leq \mathbb{E}\, F(Y)$; for a non-decreasing convex functional $G$ defined on the product space involving the path space and its marginal distribution space, we have $\mathbb{E}\, G(X, (μ_{t})_{t\in [0, T]})\leq \mathbb{E}\, G(Y, (ν_{t})_{t\in [0, T]})$ under appropriate conditions. The symmetric setting is also valid, that is, if $Y_0\preceq_{\,\text{mcv}} X_0$ and $|θ|\leq |σ|$, then $\mathbb{E}\, F(Y)\leq \mathbb{E}\, F(X)$ and $\mathbb{E}\, G(Y, (ν_{t})_{t\in [0, T]})\leq \mathbb{E}\, G(X, (μ_{t})_{t\in [0, T]})$. The proof is based on several forward and backward dynamic programming principle and the convergence of the truncated Euler scheme of the McKean-Vlasov equation.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Unadjusted Langevin algorithm with multiplicative noise: Total variation and Wasserstein bounds
Authors:
Gilles Pages,
Fabien Panloup
Abstract:
In this paper, we focus on non-asymptotic bounds related to the Euler scheme of an ergodic diffusion with a possibly multiplicative diffusion term (non-constant diffusion coefficient). More precisely, the objective of this paper is to control the distance of the standard Euler scheme with decreasing step ({usually called Unadjusted Langevin Algorithm in the Monte Carlo literature}) to the invaria…
▽ More
In this paper, we focus on non-asymptotic bounds related to the Euler scheme of an ergodic diffusion with a possibly multiplicative diffusion term (non-constant diffusion coefficient). More precisely, the objective of this paper is to control the distance of the standard Euler scheme with decreasing step ({usually called Unadjusted Langevin Algorithm in the Monte Carlo literature}) to the invariant distribution of such an ergodic diffusion. In an appropriate Lyapunov setting and under {uniform} ellipticity assumptions on the diffusion coefficient, we establish (or improve) such bounds for Total Variation and $L^1$-Wasserstein distances in both multiplicative and additive and frameworks. These bounds rely on weak error expansions using {Stochastic Analysis} adapted to decreasing step setting.
△ Less
Submitted 22 September, 2022; v1 submitted 28 December, 2020;
originally announced December 2020.
-
Quantization and martingale couplings
Authors:
Benjamin Jourdain,
Gilles Pagès
Abstract:
Quantization provides a very natural way to preserve the convex order when approximating two ordered probability measures by two finitely supported ones. Indeed, when the convex order dominating original probability measure is compactly supported, it is smaller than any of its dual quantizations while the dominated original measure is greater than any of its stationary (and therefore any of its op…
▽ More
Quantization provides a very natural way to preserve the convex order when approximating two ordered probability measures by two finitely supported ones. Indeed, when the convex order dominating original probability measure is compactly supported, it is smaller than any of its dual quantizations while the dominated original measure is greater than any of its stationary (and therefore any of its optimal) quadratic primal quantization. Moreover, the quantization errors then correspond to martingale couplings between each original probability measure and its quantization. This permits to prove that any martingale coupling between the original probability measures can be approximated by a martingale coupling between their quantizations in Wassertein distance with a rate given by the quantization errors but also in the much finer adapted Wassertein distance. As a consequence, while the stability of (Weak) Martingale Optimal Transport problems with respect to the marginal distributions has only been established in dimension $1$ so far, their value function computed numerically for the quantized marginals converges in any dimension to the value for the original probability measures as the numbers of quantization points go to $\infty$.
△ Less
Submitted 18 December, 2020;
originally announced December 2020.
-
Optimal dual quantizers of $1D$ $\log$-concave distributions: uniqueness and Lloyd like algorithm
Authors:
Benjamin Jourdain,
Gilles Pagès
Abstract:
We establish for dual quantization the counterpart of Kieffer's uniqueness result for compactly supported one dimensional probability distributions having a $\log$-concave density (also called strongly unimodal): for such distributions, $L^r$-optimal dual quantizers are unique at each level $N$, the optimal grid being the unique critical point of the quantization error. An example of non-strongly…
▽ More
We establish for dual quantization the counterpart of Kieffer's uniqueness result for compactly supported one dimensional probability distributions having a $\log$-concave density (also called strongly unimodal): for such distributions, $L^r$-optimal dual quantizers are unique at each level $N$, the optimal grid being the unique critical point of the quantization error. An example of non-strongly unimodal distribution for which uniqueness of critical points fails is exhibited.
In the quadratic $r=2$ case, we propose an algorithm to compute the unique optimal dual quantizer. It provides a counterpart of Lloyd's method~I algorithm in a Voronoi framework. Finally semi-closed forms of $L^r$-optimal dual quantizers are established for power distributions on compacts intervals and truncated exponential distributions.
△ Less
Submitted 21 October, 2020;
originally announced October 2020.
-
Functional convex order for the scaled McKean-Vlasov processes
Authors:
Yating Liu,
Gilles Pagès
Abstract:
We establish the functional convex order results for two scaled McKean-Vlasov processes $X=(X_{t})_{t\in[0, T]}$ and $Y=(Y_{t})_{t\in[0, T]}$ defined on a filtered probability space $(Ω, \mathcal{F}, (\mathcal{F}_{t})_{t\geq0}, \mathbb{P})$ by \[\begin{cases} dX_{t}= b(t, X_{t}, μ_{t})dt+σ(t, X_{t}, μ_{t})dB_{t}, \;\;X_{0}\in L^{p}(\mathbb{P}),\\ dY_{t}\,= b(t, \,Y_{t}\,,\, ν_{t})dt+θ(t, \,Y_{t}\,…
▽ More
We establish the functional convex order results for two scaled McKean-Vlasov processes $X=(X_{t})_{t\in[0, T]}$ and $Y=(Y_{t})_{t\in[0, T]}$ defined on a filtered probability space $(Ω, \mathcal{F}, (\mathcal{F}_{t})_{t\geq0}, \mathbb{P})$ by \[\begin{cases} dX_{t}= b(t, X_{t}, μ_{t})dt+σ(t, X_{t}, μ_{t})dB_{t}, \;\;X_{0}\in L^{p}(\mathbb{P}),\\ dY_{t}\,= b(t, \,Y_{t}\,,\, ν_{t})dt+θ(t, \,Y_{t}\,,\, ν_{t})dB_{t}, \;\;Y_{0}\in L^{p}(\mathbb{P}), \end{cases}\]
where $p\geq2$, for every $ t\in[0, T]$, $μ_t$, $ν_t$ denote the probability distribution of $X_t$, $Y_t$ respectively and the drift coefficient $b(t, x, μ)$ is affine in $x$ (scaled). If we make the convexity and monotony assumption (only) on $σ$ and if $σ\preceqθ$ with respect to the partial matrix order, the convex order for the initial random variable $X_0 \preceq_{\,cv} Y_0$ can be propagated to the whole path of process $X$ and $Y$. That is, if we consider a convex functional $F$ defined on the path space with polynomial growth, we have $\mathbb{E}F(X)\leq\mathbb{E}F(Y)$; for a convex functional $G$ defined on the product space involving the path space and its marginal distribution space, we have $\mathbb{E}\,G\big(X, (μ_t)_{t\in[0, T]}\big)\leq \mathbb{E}\,G\big(Y, (ν_t)_{t\in[0, T]}\big)$ under appropriate conditions. The symmetric setting is also valid, that is, if $θ\preceq σ$ and $Y_0 \leq X_0$ with respect to the convex order, then $\mathbb{E}\,F(Y) \leq \mathbb{E}\,F(X)$ and $\mathbb{E}\,G\big(Y, (ν_t)_{t\in[0, T]}\big)\leq \mathbb{E}\,G(X, (μ_t)_{t\in[0, T]})$. The proof is based on several forward and backward dynamic programming principles and the convergence of the Euler scheme of the McKean-Vlasov equation.
△ Less
Submitted 5 January, 2022; v1 submitted 6 May, 2020;
originally announced May 2020.
-
New approach to greedy vector quantization
Authors:
Rancy El Nmeir,
Harald Luschgy,
Gilles Pagès
Abstract:
We extend some rate of convergence results of greedy quantization sequences already investigated in arXiv:1409.0732 [math.PR]. We show, for a more general class of distributions satisfying a certain control, that the quantization error of these sequences have an $n^{-\frac1d}$ rate of convergence and that the distortion mismatch property is satisfied. We will give some non-asymptotic Pierce type e…
▽ More
We extend some rate of convergence results of greedy quantization sequences already investigated in arXiv:1409.0732 [math.PR]. We show, for a more general class of distributions satisfying a certain control, that the quantization error of these sequences have an $n^{-\frac1d}$ rate of convergence and that the distortion mismatch property is satisfied. We will give some non-asymptotic Pierce type estimates. The recursive character of greedy vector quantization allows some improvements to the algorithm of computation of these sequences and the implementation of a recursive formula to quantization-based numerical integration. Furthermore, we establish further properties of sub-optimality of greedy quantization sequences.
△ Less
Submitted 31 March, 2020;
originally announced March 2020.
-
Stationary Heston model: Calibration and Pricing of exotics using Product Recursive Quantization
Authors:
Vincent Lemaire,
Thibaut Montes,
Gilles Pagès
Abstract:
A major drawback of the Standard Heston model is that its implied volatility surface does not produce a steep enough smile when looking at short maturities. For that reason, we introduce the Stationary Heston model where we replace the deterministic initial condition of the volatility by its invariant measure and show, based on calibrated parameters, that this model produce a steeper smile for sho…
▽ More
A major drawback of the Standard Heston model is that its implied volatility surface does not produce a steep enough smile when looking at short maturities. For that reason, we introduce the Stationary Heston model where we replace the deterministic initial condition of the volatility by its invariant measure and show, based on calibrated parameters, that this model produce a steeper smile for short maturities than the Standard Heston model. We also present numerical solution based on Product Recursive Quantization for the evaluation of exotic options (Bermudan and Barrier options).
△ Less
Submitted 10 July, 2020; v1 submitted 9 January, 2020;
originally announced January 2020.
-
Quantization-based Bermudan option pricing in the $FX$ world
Authors:
Jean-Michel Fayolle,
Vincent Lemaire,
Thibaut Montes,
Gilles Pagès
Abstract:
This paper proposes two numerical solution based on Product Optimal Quantization for the pricing of Foreign Echange (FX) linked long term Bermudan options e.g. Bermudan Power Reverse Dual Currency options, where we take into account stochastic domestic and foreign interest rates on top of stochastic FX rate, hence we consider a 3-factor model. For these two numerical methods, we give an estimation…
▽ More
This paper proposes two numerical solution based on Product Optimal Quantization for the pricing of Foreign Echange (FX) linked long term Bermudan options e.g. Bermudan Power Reverse Dual Currency options, where we take into account stochastic domestic and foreign interest rates on top of stochastic FX rate, hence we consider a 3-factor model. For these two numerical methods, we give an estimation of the $L^2$-error induced by such approximations and we illustrate them with market-based examples that highlight the speed of such methods.
△ Less
Submitted 1 May, 2020; v1 submitted 13 November, 2019;
originally announced November 2019.
-
Convex order, quantization and monotone approximations of ARCH models
Authors:
Benjamin Jourdain,
Gilles Pagès
Abstract:
We are interested in proposing approximations of a sequence of probability measures in the convex order by finitely supported probability measures still in the convex order. We propose to alternate transitions according to a martingale Markov kernel mapping a probability measure in the sequence to the next and dual quantization steps. In the case of ARCH models and in particular of the Euler schem…
▽ More
We are interested in proposing approximations of a sequence of probability measures in the convex order by finitely supported probability measures still in the convex order. We propose to alternate transitions according to a martingale Markov kernel mapping a probability measure in the sequence to the next and dual quantization steps. In the case of ARCH models and in particular of the Euler scheme of a driftless Brownian diffusion, the noise has to be truncated to enable the dual quantization step. We analyze the error between the original ARCH model and its approximation with truncated noise and exhibit conditions under which the latter is dominated by the former in the convex order at the level of sample-paths. Last, we analyse the error of the scheme combining the dual quantization steps with truncation of the noise according to primal quantization.
△ Less
Submitted 21 October, 2020; v1 submitted 2 October, 2019;
originally announced October 2019.
-
New Weak Error bounds and expansions for Optimal Quantization
Authors:
Vincent Lemaire,
Thibaut Montes,
Gilles Pagès
Abstract:
We propose new weak error bounds and expansion in dimension one for optimal quantization-based cubature formula for different classes of functions, such that piecewise affine functions, Lipschitz convex functions or differentiable function with piecewise-defined locally Lipschitz or $α$-Hölder derivatives. This new results rest on the local behaviors of optimal quantizers, the $L^r$-$L^s$ distribu…
▽ More
We propose new weak error bounds and expansion in dimension one for optimal quantization-based cubature formula for different classes of functions, such that piecewise affine functions, Lipschitz convex functions or differentiable function with piecewise-defined locally Lipschitz or $α$-Hölder derivatives. This new results rest on the local behaviors of optimal quantizers, the $L^r$-$L^s$ distribution mismatch problem and Zador's Theorem. This new expansion supports the definition of a Richardson-Romberg extrapolation yielding a better rate of convergence for the cubature formula. An extension of this expansion is then proposed in higher dimension for the first time. We then propose a novel variance reduction method for Monte Carlo estimators, based on one dimensional optimal quantizers.
△ Less
Submitted 1 May, 2020; v1 submitted 25 March, 2019;
originally announced March 2019.
-
Convergence rate of optimal quantization grids and application to empirical measure
Authors:
Yating Liu,
Gilles Pagès
Abstract:
We study the convergence rate of the optimal quantization for a probability measure sequence $(μ_{n})_{n\in\mathbb{N}^{*}}$ on $\mathbb{R}^{d}$ converging in the Wasserstein distance in two aspects: the first one is the convergence rate of optimal quantizer $x^{(n)}\in(\mathbb{R}^{d})^{K}$ of $μ_{n}$ at level $K$; the other one is the convergence rate of the distortion function valued at…
▽ More
We study the convergence rate of the optimal quantization for a probability measure sequence $(μ_{n})_{n\in\mathbb{N}^{*}}$ on $\mathbb{R}^{d}$ converging in the Wasserstein distance in two aspects: the first one is the convergence rate of optimal quantizer $x^{(n)}\in(\mathbb{R}^{d})^{K}$ of $μ_{n}$ at level $K$; the other one is the convergence rate of the distortion function valued at $x^{(n)}$, called the "performance" of $x^{(n)}$. Moreover, we also study the mean performance of the optimal quantization for the empirical measure of a distribution $μ$ with finite second moment but possibly unbounded support. As an application, we show that the mean performance for the empirical measure of the multidimensional normal distribution $\mathcal{N}(m, Σ)$ and of distributions with hyper-exponential tails behave like $\mathcal{O}(\frac{\log n}{\sqrt{n}})$. This extends the results from [BDL08] obtained for compactly supported distribution. We also derive an upper bound which is sharper in the quantization level $K$ but suboptimal in $n$ by applying results in [FG15].
△ Less
Submitted 19 February, 2020; v1 submitted 20 November, 2018;
originally announced November 2018.
-
A general weak and strong error analysis of the recursive quantization with an application to jump diffusions
Authors:
Gilles Pagès,
Abass Sagna
Abstract:
Observing that the recent developments of the recursive (product) quantization method induces a family of Markov chains which includes all standard discretization schemes of diffusions processes , we propose to compute a general error bound induced by the recursive quantization schemes using this generic markovian structure. Furthermore, we compute a marginal weak error for the recursive quantizat…
▽ More
Observing that the recent developments of the recursive (product) quantization method induces a family of Markov chains which includes all standard discretization schemes of diffusions processes , we propose to compute a general error bound induced by the recursive quantization schemes using this generic markovian structure. Furthermore, we compute a marginal weak error for the recursive quantization. We also extend the recursive quantization method to the Euler scheme associated to diffusion processes with jumps, which still have this markovian structure, and we say how to compute the recursive quantization and the associated weights and transition weights.
△ Less
Submitted 29 August, 2018;
originally announced August 2018.
-
Weak error for nested Multilevel Monte Carlo
Authors:
Daphné Giorgi,
Vincent Lemaire,
Gilles Pagès
Abstract:
This article discusses MLMC estimators with and without weights, applied to nested expectations of the form E [f (E [F (Y, Z)|Y ])]. More precisely, we are interested on the assumptions needed to comply with the MLMC framework, depending on whether the payoff function f is smooth or not. A new result to our knowledge is given when f is not smooth in the development of the weak error at an order hi…
▽ More
This article discusses MLMC estimators with and without weights, applied to nested expectations of the form E [f (E [F (Y, Z)|Y ])]. More precisely, we are interested on the assumptions needed to comply with the MLMC framework, depending on whether the payoff function f is smooth or not. A new result to our knowledge is given when f is not smooth in the development of the weak error at an order higher than 1, which is needed for a successful use of MLMC estimators with weights.
△ Less
Submitted 20 June, 2018;
originally announced June 2018.
-
Characterization of probability distribution convergence in Wasserstein distance by $L^{p}$-quantization error function
Authors:
Yating Liu,
Gilles Pagès
Abstract:
We establish conditions to characterize probability measures by their $L^{p}$-quantization error functions in both $\mathbb{R}^{d}$ and Hilbert settings. This characterization is two-fold: static (identity of two distributions) and dynamic (convergence for the $L^p$-Wasserstein distance). We first propose a criterion on the quantization level $N$, valid for any norm on $\mathbb{R}^{d}$ and any ord…
▽ More
We establish conditions to characterize probability measures by their $L^{p}$-quantization error functions in both $\mathbb{R}^{d}$ and Hilbert settings. This characterization is two-fold: static (identity of two distributions) and dynamic (convergence for the $L^p$-Wasserstein distance). We first propose a criterion on the quantization level $N$, valid for any norm on $\mathbb{R}^{d}$ and any order $p$ based on a geometrical approach involving the Voronoï diagram. Then, we prove that in the $L^2$-case on a (separable) Hilbert space, the condition on the level $N$ can be reduced to $N=2$, which is optimal. More quantization based characterization cases on dimension 1 and a discussion of the completeness of a distance defined by the quantization error function can be found in the end of this paper.
△ Less
Submitted 27 March, 2019; v1 submitted 18 January, 2018;
originally announced January 2018.
-
Discretization of the Ergodic Functional Central Limit Theorem
Authors:
Gilles Pagès,
Clément Rey
Abstract:
In this paper, we study the discretization of the ergodic Functional Central Limit Theorem (CLT) established by Bhattacharya (see \cite{Bhattacharya_1982}) which states the following: Given a stationary and ergodic Markov process $(X_t)_{t \geqslant 0}$ with unique invariant measure $ν$ and infinitesimal generator $A$, then, for every smooth enough function $f$,…
▽ More
In this paper, we study the discretization of the ergodic Functional Central Limit Theorem (CLT) established by Bhattacharya (see \cite{Bhattacharya_1982}) which states the following: Given a stationary and ergodic Markov process $(X_t)_{t \geqslant 0}$ with unique invariant measure $ν$ and infinitesimal generator $A$, then, for every smooth enough function $f$, $(n^{1/2} \frac{1}{n}\int_0^{nt} Af(X_s)ds)_{t \geqslant 0}$ converges in distribution towards the distribution of the process $(\sqrt{-2 \langle f, Af \rangle_ν} W_{t})_{t \geqslant 0}$ with $(W_{t})_{t \geqslant 0}$ a Wiener process. In particular, we consider the marginal distribution at fixed $t=1$, and we show that when $\int_0^{n} Af(X_s)ds$ is replaced by a well chosen discretization of the time integral with order $q$ ($e.g.$ Riemann discretization in the case $q=1$), then the CLT still holds but with rate $n^{q/(2q+1)}$ instead of $n^{1/2}$. Moreover, our results remain valid when $(X_t)_{t \geqslant 0}$ is replaced by a $q$-weak order approximation (not necessarily stationary). This paper presents both the discretization method of order $q$ for the time integral and the $q$-order ergodic CLT we derive from them. We finally propose applications concerning the first order CLT for the approximation of Markov Brownian diffusion stationary regimes with Euler scheme (where we recover existing results from the literature) and the second order CLT for the approximation of Brownian diffusion stationary regimes using Talay's scheme \cite{Talay_1990} of weak order two.
△ Less
Submitted 7 March, 2025; v1 submitted 16 January, 2018;
originally announced January 2018.
-
Recursive computation of the invariant distributions of Feller processes: Revisited examples and new applications
Authors:
Gilles Pagès,
Clément Rey
Abstract:
In this paper, we show that the abstract framework developed in Pages & Rey (2017) and inspired by Lamberton & Pages (2002) can be used to build invariant distributions for Brownian diffusion processes using the Milstein scheme and for diffusion processes with censored jump using the Euler scheme. Both studies rely on a weakly mean reverting setting for both cases. For the Milstein scheme we prove…
▽ More
In this paper, we show that the abstract framework developed in Pages & Rey (2017) and inspired by Lamberton & Pages (2002) can be used to build invariant distributions for Brownian diffusion processes using the Milstein scheme and for diffusion processes with censored jump using the Euler scheme. Both studies rely on a weakly mean reverting setting for both cases. For the Milstein scheme we prove the convergence for test functions with polynomial (Wasserstein convergence) and exponential growth. For the Euler scheme of diffusion processes with censored jump we prove the convergence for test functions with polynomial growth.
△ Less
Submitted 16 January, 2018; v1 submitted 11 December, 2017;
originally announced December 2017.
-
Recursive computation of the invariant distribution of Markov and Feller processes
Authors:
Gilles Pagès,
Clément Rey
Abstract:
This paper provides a general and abstract approach to approximate ergodic regimes of Markov and Feller processes. More precisely, we show that the recursive algorithm presented in Lamberton & Pages (2002) and based on simulation algorithms of stochastic schemes with decreasing step can be used to build invariant measures for general Markov and Feller processes. We also propose applications in thr…
▽ More
This paper provides a general and abstract approach to approximate ergodic regimes of Markov and Feller processes. More precisely, we show that the recursive algorithm presented in Lamberton & Pages (2002) and based on simulation algorithms of stochastic schemes with decreasing step can be used to build invariant measures for general Markov and Feller processes. We also propose applications in three different configurations: Approximation of Markov switching Brownian diffusion ergodic regimes using Euler scheme, approximation of Markov Brownian diffusion ergodic regimes with Milstein scheme and approximation of general diffusions with jump components ergodic regimes.
△ Less
Submitted 16 January, 2018; v1 submitted 13 March, 2017;
originally announced March 2017.
-
Limit theorems for weighted and regular Multilevel estimators
Authors:
Daphné Giorgi,
Vincent Lemaire,
Gilles Pagès
Abstract:
We aim at analyzing in terms of a.s. convergence and weak rate the performances of the Multilevel Monte Carlo estimator (MLMC) introduced in [Gil08] and of its weighted version, the Multilevel Richardson Romberg estimator (ML2R), introduced in [LP14]. These two estimators permit to compute a very accurate approximation of $I_0 = \mathbb{E}[Y_0]$ by a Monte Carlo type estimator when the (non-degene…
▽ More
We aim at analyzing in terms of a.s. convergence and weak rate the performances of the Multilevel Monte Carlo estimator (MLMC) introduced in [Gil08] and of its weighted version, the Multilevel Richardson Romberg estimator (ML2R), introduced in [LP14]. These two estimators permit to compute a very accurate approximation of $I_0 = \mathbb{E}[Y_0]$ by a Monte Carlo type estimator when the (non-degenerate) random variable $Y_0 \in L^2(\mathbb{P})$ cannot be simulated (exactly) at a reasonable computational cost whereas a family of simulatable approximations $(Y_h)_{h \in \mathcal{H}}$ is available. We will carry out these investigations in an abstract framework before applying our results, mainly a Strong Law of Large Numbers and a Central Limit Theorem, to some typical fields of applications: discretization schemes of diffusions and nested Monte Carlo.
△ Less
Submitted 16 November, 2016;
originally announced November 2016.
-
Weighted Multilevel Langevin Simulation of Invariant Measures
Authors:
Gilles Pagès,
Fabien Panloup
Abstract:
We investigate a weighted Multilevel Richardson-Romberg extrapolation for the ergodic approximation of invariant distributions of diffusions adapted from the one introduced in~[Lemaire-Pagès, 2013] for regular Monte Carlo simulation. In a first result, we prove under weak confluence assumptions on the diffusion, that for any integer $R\ge2$, the procedure allows us to attain a rate…
▽ More
We investigate a weighted Multilevel Richardson-Romberg extrapolation for the ergodic approximation of invariant distributions of diffusions adapted from the one introduced in~[Lemaire-Pagès, 2013] for regular Monte Carlo simulation. In a first result, we prove under weak confluence assumptions on the diffusion, that for any integer $R\ge2$, the procedure allows us to attain a rate $n^{\frac{R}{2R+1}}$ whereas the original algorithm convergence is at a weak rate $n^{1/3}$. Furthermore, this is achieved without any explosion of the asymptotic variance. In a second part, under stronger confluence assumptions and with the help of some second order expansions of the asymptotic error, we go deeper in the study by optimizing the choice of the parameters involved by the method. In particular, for a given $\varepsilon\textgreater{}0$, we exhibit some semi-explicit parameters for which the number of iterations of the Euler scheme required to attain a Mean-Squared Error lower than $\varepsilon^2$ is about $\varepsilon^{-2}\log(\varepsilon^{-1})$. Finally, we numerically this Multilevel Langevin estimator on several examples including the simple one-dimensional Ornstein-Uhlenbeck process but also on a high dimensional diffusion motivated by a statistical problem. These examples confirm the theoretical efficiency of the method.
△ Less
Submitted 4 July, 2016;
originally announced July 2016.
-
Non-Asymptotic Gaussian Estimates for the Recursive Approximation of the Invariant Measure of a Diffusion
Authors:
Igor Honoré,
Stephane Menozzi,
Gilles Pagès
Abstract:
We obtain non-asymptotic Gaussian concentration bounds for the difference between the invariant measure $ν$ of an ergodic Brownian diffusion process and the empirical distribution of an approximating scheme with decreasing time step along a suitable class of (smooth enough) test functions f such that f -- $ν$(f) is a coboundary of the infinitesimal generator. We show that these bounds can still be…
▽ More
We obtain non-asymptotic Gaussian concentration bounds for the difference between the invariant measure $ν$ of an ergodic Brownian diffusion process and the empirical distribution of an approximating scheme with decreasing time step along a suitable class of (smooth enough) test functions f such that f -- $ν$(f) is a coboundary of the infinitesimal generator. We show that these bounds can still be improved when the (squared) Fr{ö}benius norm of the diffusion coefficient lies in this class. We apply these bounds to design computable non-asymptotic confidence intervals for the approximating scheme. As a theoretical application, we finally derive non-asymptotic deviation bounds for the almost sure Central Limit Theorem.
△ Less
Submitted 25 May, 2018; v1 submitted 27 May, 2016;
originally announced May 2016.
-
Product Markovian quantization of an R^d -valued Euler scheme of a diffusion process with applications to finance
Authors:
Fiorin Lucio,
Gilles Pagès,
Abass Sagna
Abstract:
We introduce a new approach to quantize the Euler scheme of an $\mathbb{R}^d$-valued diffusion process. This method is based on a Markovian and componentwise product quantization and allows us, from a numerical point of view, to speak of {\em fast online quantization} in dimension greater than one since the product quantization of the Euler scheme of the diffusion process and its companion wei…
▽ More
We introduce a new approach to quantize the Euler scheme of an $\mathbb{R}^d$-valued diffusion process. This method is based on a Markovian and componentwise product quantization and allows us, from a numerical point of view, to speak of {\em fast online quantization} in dimension greater than one since the product quantization of the Euler scheme of the diffusion process and its companion weights and transition probabilities may be computed quite instantaneously. We show that the resulting quantization process is a Markov chain, then, we compute the associated companion weights and transition probabilities from (semi-) closed formulas. From the analytical point of view, we show that the induced quantization errors at the $k$-th discretization step $t_k$ is a cumulative of the marginal quantization error up to time $t_k$. Numerical experiments are performed for the pricing of a Basket call option, for the pricing of a European call option in a Heston model and for the approximation of the solution of backward stochastic differential equations to show the performances of the method.
△ Less
Submitted 24 March, 2017; v1 submitted 5 November, 2015;
originally announced November 2015.
-
Improved error bounds for quantization based numerical schemes for BSDE and nonlinear filtering
Authors:
Gilles Pagès
Abstract:
We take advantage of recent and new results on optimal quantization theory to improve the quadratic optimal quantization error bounds for backward stochastic differential equations (BSDE) and nonlinear filtering problems. For both problems, a first improvement relies on a Pythagoras like Theorem for quantized conditional expectation. While allowing for some locally Lipschitz functions conditiona…
▽ More
We take advantage of recent and new results on optimal quantization theory to improve the quadratic optimal quantization error bounds for backward stochastic differential equations (BSDE) and nonlinear filtering problems. For both problems, a first improvement relies on a Pythagoras like Theorem for quantized conditional expectation. While allowing for some locally Lipschitz functions conditional densities in nonlinear filtering, the analysis of the error brings into playing a new robustness result about optimal quantizers, the so-called distortion mismatch property: $L^r$-quadratic optimal quantizers of size $N$ behave in $L^s$ in term of mean error at the same rate $N^{-\frac 1d}$, $0<s< r+d$.
△ Less
Submitted 25 July, 2017; v1 submitted 5 October, 2015;
originally announced October 2015.
-
Greedy vector quantization
Authors:
Harald Luschgy,
Gilles Pagès
Abstract:
We investigate the greedy version of the $L^p$-optimal vector quantization problem for an $\mathbb{R}^d$-valued random vector $X\!\in L^p$. We show the existence of a sequence $(a_N)_{N\ge 1}$ such that $a_N$ minimizes $a\mapsto\big \|\min_{1\le i\le N-1}|X-a_i|\wedge |X-a|\big\|_{L^p}$ ($L^p$-mean quantization error at level $N$ induced by $(a_1,\ldots,a_{N-1},a)$). We show that this sequence pro…
▽ More
We investigate the greedy version of the $L^p$-optimal vector quantization problem for an $\mathbb{R}^d$-valued random vector $X\!\in L^p$. We show the existence of a sequence $(a_N)_{N\ge 1}$ such that $a_N$ minimizes $a\mapsto\big \|\min_{1\le i\le N-1}|X-a_i|\wedge |X-a|\big\|_{L^p}$ ($L^p$-mean quantization error at level $N$ induced by $(a_1,\ldots,a_{N-1},a)$). We show that this sequence produces $L^p$-rate optimal $N$-tuples $a^{(N)}=(a_1,\ldots,a_{_N})$ ($i.e.$ the $L^p$-mean quantization error at level $N$ induced by $a^{(N)}$ goes to $0$ at rate $N^{-\frac 1d}$). Greedy optimal sequences also satisfy, under natural additional assumptions, the distortion mismatch property: the $N$-tuples $a^{(N)}$ remain rate optimal with respect to the $L^q$-norms, $p\le q <p+d$. Finally, we propose optimization methods to compute greedy sequences, adapted from usual Lloyd's I and Competitive Learning Vector Quantization procedures, either in their deterministic (implementable when $d=1$) or stochastic versions.
△ Less
Submitted 21 August, 2015; v1 submitted 2 September, 2014;
originally announced September 2014.
-
Convex order for path-dependent derivatives: a dynamic programming approach
Authors:
Gilles Pagès
Abstract:
We investigate the (functional) convex order of for various continuous martingale processes, either with respect to their diffusions coefficients for Lévy-driven SDEs or their integrands for stochastic integrals. Main results are bordered by counterexamples. Various upper and lower bounds can be derived for path wise European option prices in local volatility models. In view of numerical applicati…
▽ More
We investigate the (functional) convex order of for various continuous martingale processes, either with respect to their diffusions coefficients for Lévy-driven SDEs or their integrands for stochastic integrals. Main results are bordered by counterexamples. Various upper and lower bounds can be derived for path wise European option prices in local volatility models. In view of numerical applications, we adopt a systematic (and symmetric) methodology: (a) propagate the convexity in a {\em simulatable} dominating/dominated discrete time model through a backward induction (or linear dynamical principle); (b) Apply functional weak convergence results to numerical schemes/time discretizations of the continuous time martingale satisfying (a) in order to transfer the convex order properties. Various bounds are derived for European options written on convex pathwise dependent payoffs. We retrieve and extend former results obtains by several authors since the seminal 1985 paper by Hajek . In a second part, we extend this approach to Optimal Stopping problems using a that the Snell envelope satisfies (a') a Backward Dynamical Programming Principle to propagate convexity in discrete time; (b') satisfies abstract convergence results under non-degeneracy assumption on filtrations. Applications to the comparison of American option prices on convex pathwise payoff processes are given obtained by a purely probabilistic arguments.
△ Less
Submitted 23 July, 2014;
originally announced July 2014.
-
Multilevel Richardson-Romberg extrapolation
Authors:
Vincent Lemaire,
Gilles Pagès
Abstract:
We propose and analyze a Multilevel Richardson-Romberg (MLRR) estimator which combines the higher order bias cancellation of the Multistep Richardson-Romberg method introduced in [Pa07] and the variance control resulting from the stratification introduced in the Multilevel Monte Carlo (MLMC) method (see [Hei01, Gi08]). Thus, in standard frameworks like discretization schemes of diffusion processes…
▽ More
We propose and analyze a Multilevel Richardson-Romberg (MLRR) estimator which combines the higher order bias cancellation of the Multistep Richardson-Romberg method introduced in [Pa07] and the variance control resulting from the stratification introduced in the Multilevel Monte Carlo (MLMC) method (see [Hei01, Gi08]). Thus, in standard frameworks like discretization schemes of diffusion processes, the root mean squared error (RMSE) $\varepsilon > 0$ can be achieved with our MLRR estimator with a global complexity of $\varepsilon^{-2} \log(1/\varepsilon)$ instead of $\varepsilon^{-2} (\log(1/\varepsilon))^2$ with the standard MLMC method, at least when the weak error $\mathbf{E}[Y_h]-\mathbf{E}[Y_0]$ of the biased implemented estimator $Y_h$ can be expanded at any order in $h$ and $\|Y_h - Y_0\|_2 = O(h^{\frac{1}{2}})$. The MLRR estimator is then halfway between a regular MLMC and a virtual unbiased Monte Carlo. When the strong error $\|Y_h - Y_0\|_2 = O(h^{\fracβ{2}})$, $β< 1$, the gain of MLRR over MLMC becomes even more striking. We carry out numerical simulations to compare these estimators in two settings: vanilla and path-dependent option pricing by Monte Carlo simulation and the less classical Nested Monte Carlo simulation.
△ Less
Submitted 4 July, 2016; v1 submitted 6 January, 2014;
originally announced January 2014.
-
Pointwise convergence of the Lloyd algorithm in higher dimension
Authors:
Gilles Pagès,
Jun Yu
Abstract:
We establish the pointwise convergence of the iterative Lloyd algorithm, also known as $k$-means algorithm, when the quadratic quantization error of the starting grid (with size $N\ge 2$) is lower than the minimal quantization error with respect to the input distribution is lower at level $N-1$. Such a protocol is known as the splitting method and allows for convergence even when the input distrib…
▽ More
We establish the pointwise convergence of the iterative Lloyd algorithm, also known as $k$-means algorithm, when the quadratic quantization error of the starting grid (with size $N\ge 2$) is lower than the minimal quantization error with respect to the input distribution is lower at level $N-1$. Such a protocol is known as the splitting method and allows for convergence even when the input distribution has an unbounded support. We also show under very light assumption that the resulting limiting grid still has full size $N$. These results are obtained without continuity assumption on the input distribution. A variant of the procedure taking advantage of the asymptotic of the optimal quantizer radius is proposed which always guarantees the boundedness of the iterated grids.
△ Less
Submitted 31 December, 2013;
originally announced January 2014.
-
Nonlinear Randomized Urn Models: a Stochastic Approximation Viewpoint
Authors:
Sophie Laruelle,
Gilles Pagès
Abstract:
This paper extends the link between stochastic approximation (SA) theory and randomized urn models developed in Laruelle, Pag{è}s (2013), and their applications to clinical trials introduced in Bai, HU (1999,2005) and Bai, Hu, Shen (2002). We no longer assume that the drawing rule is uniform among the balls of the urn (which contains d colors), but can be reinforced by a function f. This is a…
▽ More
This paper extends the link between stochastic approximation (SA) theory and randomized urn models developed in Laruelle, Pag{è}s (2013), and their applications to clinical trials introduced in Bai, HU (1999,2005) and Bai, Hu, Shen (2002). We no longer assume that the drawing rule is uniform among the balls of the urn (which contains d colors), but can be reinforced by a function f. This is a way to model risk aversion. Firstly, by considering that f is concave or convex and by reformulating the dynamics of the urn composition as an SA algorithm with remainder, we derive the a.s. convergence and the asymptotic normality (Central Limit Theorem, CLT) of the normalized procedure by calling upon the so-called ODE and SDE methods. An in-depth analysis of the case d=2 exhibits two different behaviors: A single equilibrium point when f is concave, and when f is convex, a transition phase from a single attracting equilibrium to a system with two attracting and one repulsive equilibrium points. The last setting is solved using results on non-convergence toward noisy and noiseless "traps" in order to deduce the a.s. convergence toward one of the attracting points. Secondly, the special case of a Polya urn (when the addition rule is the identity matrix) is analyzed, still using result from SA theory about "traps". Finally, these results are applied to a function with regular variation and to an optimal asset allocation in Finance.
△ Less
Submitted 15 May, 2018; v1 submitted 28 November, 2013;
originally announced November 2013.
-
Recursive marginal quantization of the Euler scheme of a diffusion process
Authors:
Gilles Pagès,
Abass Sagna
Abstract:
We propose a new approach to quantize the marginals of the discrete Euler diffusion process. The method is built recursively and involves the conditional distribution of the marginals of the discrete Euler process. Analytically, the method raises several questions like the analysis of the induced quadratic quantization error between the marginals of the Euler process and the proposed q…
▽ More
We propose a new approach to quantize the marginals of the discrete Euler diffusion process. The method is built recursively and involves the conditional distribution of the marginals of the discrete Euler process. Analytically, the method raises several questions like the analysis of the induced quadratic quantization error between the marginals of the Euler process and the proposed quantizations. We show in particular that at every discretization step $t\_k$ of the Euler scheme, this error is bounded by the cumulative quantization errors induced by the Euler operator, from times $t\_0=0$ to time $t\_k$. For numerics, we restrict our analysis to the one dimensional setting and show how to compute the optimal grids using a Newton-Raphson algorithm. We then propose a closed formula for the companion weights and the transition probabilities associated to the proposed quantizations. This allows us to quantize in particular diffusion processes in local volatility models by reducing dramatically the computational complexity of the search of optimal quantizers while increasing their computational precision with respect to the algorithms commonly proposed in this framework. Numerical tests are carried out for the Brownian motion and for the pricing of European options in a local volatility model. A comparison with the Monte Carlo simulations shows that the proposed method may sometimes be more efficient (w.r.t. both computational precision and time complexity) than the Monte Carlo method.
△ Less
Submitted 22 May, 2015; v1 submitted 9 April, 2013;
originally announced April 2013.
-
Invariant distribution of duplicated diffusions and application to Richardson-Romberg extrapolation
Authors:
Vincent Lemaire,
Gilles Pagès,
Fabien Panloup
Abstract:
With a view to numerical applications we address the following question: given an ergodic Brownian diffusion with a unique invariant distribution, what are the invariant distributions of the duplicated system consisting of two trajectories? We mainly focus on the interesting case where the two trajectories are driven by the same Brownian path. Under this assumption, we first show that uniqueness o…
▽ More
With a view to numerical applications we address the following question: given an ergodic Brownian diffusion with a unique invariant distribution, what are the invariant distributions of the duplicated system consisting of two trajectories? We mainly focus on the interesting case where the two trajectories are driven by the same Brownian path. Under this assumption, we first show that uniqueness of the invariant distribution (weak confluence) of the duplicated system is essentially always true in the one-dimensional case. In the multidimensional case, we begin by exhibiting explicit counter-examples. Then, we provide a series of weak confluence criterions (of integral type) and also of a.s. pathwise confluence, depending on the drift and diffusion coefficients through a non-infinitesimal Lyapunov exponent. As examples, we apply our criterions to some non-trivially confluent settings such as classes of gradient systems with non-convex potentials or diffusions where the confluence is generated by the diffusive component. We finally establish that the weak confluence property is connected with an optimal transport problem. As a main application, we apply our results to the optimization of the Richardson-Romberg extrapolation for the numerical approximation of the invariant measure of the initial ergodic Brownian diffusion.
△ Less
Submitted 8 July, 2014; v1 submitted 7 February, 2013;
originally announced February 2013.
-
Functional co-monotony of processes with applications to peacocks and barrier options
Authors:
Gilles Pagès
Abstract:
We show that several general classes of stochastic processes satisfy a functional co-monotony principle, including processes with independent increments, Brownian diffusions, Liouville processes. As a first application, we recover some recent results about peacock processes obtained by Hirsch et al. which were themselves motivated by a former work of Carr et al. about the sensitivity of Asian Call…
▽ More
We show that several general classes of stochastic processes satisfy a functional co-monotony principle, including processes with independent increments, Brownian diffusions, Liouville processes. As a first application, we recover some recent results about peacock processes obtained by Hirsch et al. which were themselves motivated by a former work of Carr et al. about the sensitivity of Asian Call options with respect to their volatility and residual maturity (seniority). We also derive semi-universal bounds for various barrier options.
△ Less
Submitted 12 November, 2012; v1 submitted 19 September, 2012;
originally announced September 2012.
-
Optimal posting price of limit orders: learning by trading
Authors:
Sophie Laruelle,
Charles-Albert Lehalle,
Gilles Pagès
Abstract:
Considering that a trader or a trading algorithm interacting with markets during continuous auctions can be modeled by an iterating procedure adjusting the price at which he posts orders at a given rhythm, this paper proposes a procedure minimizing his costs. We prove the a.s. convergence of the algorithm under assumptions on the cost function and give some practical criteria on model parameters t…
▽ More
Considering that a trader or a trading algorithm interacting with markets during continuous auctions can be modeled by an iterating procedure adjusting the price at which he posts orders at a given rhythm, this paper proposes a procedure minimizing his costs. We prove the a.s. convergence of the algorithm under assumptions on the cost function and give some practical criteria on model parameters to ensure that the conditions to use the algorithm are fulfilled (using notably the co-monotony principle). We illustrate our results with numerical experiments on both simulated data and using a financial market dataset.
△ Less
Submitted 11 September, 2012; v1 submitted 11 December, 2011;
originally announced December 2011.
-
GPGPUs in computational finance: Massive parallel computing for American style options
Authors:
Gilles Pagès,
Benedikt Wilbertz
Abstract:
The pricing of American style and multiple exercise options is a very challenging problem in mathematical finance. One usually employs a Least-Square Monte Carlo approach (Longstaff-Schwartz method) for the evaluation of conditional expectations which arise in the Backward Dynamic Programming principle for such optimal stopping or stochastic control problems in a Markovian framework. Unfortunately…
▽ More
The pricing of American style and multiple exercise options is a very challenging problem in mathematical finance. One usually employs a Least-Square Monte Carlo approach (Longstaff-Schwartz method) for the evaluation of conditional expectations which arise in the Backward Dynamic Programming principle for such optimal stopping or stochastic control problems in a Markovian framework. Unfortunately, these Least-Square Monte Carlo approaches are rather slow and allow, due to the dependency structure in the Backward Dynamic Programming principle, no parallel implementation; whether on the Monte Carlo levelnor on the time layer level of this problem. We therefore present in this paper a quantization method for the computation of the conditional expectations, that allows a straightforward parallelization on the Monte Carlo level. Moreover, we are able to develop for AR(1)-processes a further parallelization in the time domain, which makes use of faster memory structures and therefore maximizes parallel execution. Finally, we present numerical results for a CUDA implementation of this methods. It will turn out that such an implementation leads to an impressive speed-up compared to a serial CPU implementation.
△ Less
Submitted 17 January, 2011;
originally announced January 2011.
-
Randomized Urn Models revisited using Stochastic Approximation
Authors:
Sophie Laruelle,
Gilles Pagès
Abstract:
This paper presents the link between stochastic approximation and clinical trials based on randomized urn models investigated in Bai and Hu (1999,2005) and Bai, Hu and Shen (2002). We reformulate the dynamics of both the urn composition and the assigned treatments as standard stochastic approximation (SA) algorithms with remainder. Then, we derive the a.s. convergence and the asymptotic normality…
▽ More
This paper presents the link between stochastic approximation and clinical trials based on randomized urn models investigated in Bai and Hu (1999,2005) and Bai, Hu and Shen (2002). We reformulate the dynamics of both the urn composition and the assigned treatments as standard stochastic approximation (SA) algorithms with remainder. Then, we derive the a.s. convergence and the asymptotic normality (CLT) of the normalized procedure under less stringent assumptions by calling upon the ODE and SDE methods. As a second step, we investigate a more involved family of models, known as multi-arm clinical trials, where the urn updating depends on the past performances of the treatments. By increasing the dimension of the state vector, our SA approach provides this time a new asymptotic normality result.
△ Less
Submitted 18 January, 2017; v1 submitted 14 January, 2011;
originally announced January 2011.