Convergence of Langevin Monte Carlo in Chi-Square Divergence

Erdogdu, Murat A.; Hosseinzadeh, Rasa; Zhang, Matthew S.

Statistics > Machine Learning

arXiv:2007.11612v3 (stat)

[Submitted on 22 Jul 2020 (v1), revised 24 May 2021 (this version, v3), latest version 8 Jul 2021 (v4)]

Title:Convergence of Langevin Monte Carlo in Chi-Square Divergence

Authors:Murat A. Erdogdu, Rasa Hosseinzadeh, Matthew S. Zhang

View PDF

Abstract:We study sampling from a target distribution $\nu_* = e^{-f}$ using the unadjusted Langevin Monte Carlo (LMC) algorithm when the potential $f$ satisfies a strong dissipativity condition and it is first-order smooth with Lipschitz gradient. We prove that, initialized with a Gaussian that has sufficiently small variance, $\widetilde{\mathcal{O}}(\lambda d\epsilon^{-1})$ steps of the LMC algorithm are sufficient to reach $\epsilon$-neighborhood of the target in Chi-square divergence, where $\lambda$ is the log-Sobolev constant of $\nu_*$. Our results do not require warm-start to deal with exponential dimension dependency in Chi-square divergence at initialization. In particular, for strongly convex and first-order smooth potentials, we show that the LMC algorithm achieves the rate estimate $\widetilde{\mathcal{O}}(d\epsilon^{-1})$ which improves the previously known rates in this metric, under the same assumptions. Translating to other metrics, our result also recovers the best-known rate estimates in KL divergence, total variation and $2$-Wasserstein distance in the same setup. Finally, as we rely on the log-Sobolev inequality, our framework covers a wide range of non-convex potentials that are first-order smooth and that exhibit strong convexity outside of a compact region.

Comments:	v1: There was an error in the proof of Lemma 1. Authors thank Andre Wibisono for noticing this and letting us know. v2: Paper is updated with an opaque condition, in order not to mislead researchers. v3: Opaque condition in the previous version is proved under LSI and strong dissipativity
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR); Computation (stat.CO)
Cite as:	arXiv:2007.11612 [stat.ML]
	(or arXiv:2007.11612v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2007.11612

Submission history

From: Rasa Hosseinzadeh [view email]
[v1] Wed, 22 Jul 2020 18:18:28 UTC (45 KB)
[v2] Thu, 30 Jul 2020 16:23:22 UTC (12 KB)
[v3] Mon, 24 May 2021 17:57:14 UTC (43 KB)
[v4] Thu, 8 Jul 2021 06:45:09 UTC (48 KB)

Statistics > Machine Learning

Title:Convergence of Langevin Monte Carlo in Chi-Square Divergence

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Convergence of Langevin Monte Carlo in Chi-Square Divergence

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators