Convergence Rates for Non-Log-Concave Sampling and Log-Partition Estimation

Holzmüller, David; Bach, Francis

Statistics > Machine Learning

arXiv:2303.03237 (stat)

[Submitted on 6 Mar 2023 (v1), last revised 1 Aug 2023 (this version, v3)]

Title:Convergence Rates for Non-Log-Concave Sampling and Log-Partition Estimation

Authors:David Holzmüller, Francis Bach

View PDF

Abstract:Sampling from Gibbs distributions $p(x) \propto \exp(-V(x)/\varepsilon)$ and computing their log-partition function are fundamental tasks in statistics, machine learning, and statistical physics. However, while efficient algorithms are known for convex potentials $V$, the situation is much more difficult in the non-convex case, where algorithms necessarily suffer from the curse of dimensionality in the worst case. For optimization, which can be seen as a low-temperature limit of sampling, it is known that smooth functions $V$ allow faster convergence rates. Specifically, for $m$-times differentiable functions in $d$ dimensions, the optimal rate for algorithms with $n$ function evaluations is known to be $O(n^{-m/d})$, where the constant can potentially depend on $m, d$ and the function to be optimized. Hence, the curse of dimensionality can be alleviated for smooth functions at least in terms of the convergence rate. Recently, it has been shown that similarly fast rates can also be achieved with polynomial runtime $O(n^{3.5})$, where the exponent $3.5$ is independent of $m$ or $d$. Hence, it is natural to ask whether similar rates for sampling and log-partition computation are possible, and whether they can be realized in polynomial time with an exponent independent of $m$ and $d$. We show that the optimal rates for sampling and log-partition computation are sometimes equal and sometimes faster than for optimization. We then analyze various polynomial-time sampling algorithms, including an extension of a recent promising optimization approach, and find that they sometimes exhibit interesting behavior but no near-optimal rates. Our results also give further insights on the relation between sampling, log-partition, and optimization problems.

Comments:	Changes in v3: Minor corrections and improvements. Plots can be reproduced using the code at this https URL
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Computation (stat.CO)
Cite as:	arXiv:2303.03237 [stat.ML]
	(or arXiv:2303.03237v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2303.03237

Submission history

From: David Holzmüller [view email]
[v1] Mon, 6 Mar 2023 15:53:44 UTC (847 KB)
[v2] Thu, 6 Apr 2023 16:06:32 UTC (847 KB)
[v3] Tue, 1 Aug 2023 13:09:53 UTC (847 KB)

Statistics > Machine Learning

Title:Convergence Rates for Non-Log-Concave Sampling and Log-Partition Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Convergence Rates for Non-Log-Concave Sampling and Log-Partition Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators