Elucidating the solution space of extended reverse-time SDE for diffusion models

Cui, Qinpeng; Zhang, Xinyi; Bao, Qiqi; Liao, Qingmin

Computer Science > Machine Learning

arXiv:2309.06169 (cs)

[Submitted on 12 Sep 2023 (v1), last revised 27 Feb 2025 (this version, v3)]

Title:Elucidating the solution space of extended reverse-time SDE for diffusion models

Authors:Qinpeng Cui, Xinyi Zhang, Qiqi Bao, Qingmin Liao

View PDF

Abstract:Sampling from Diffusion Models can alternatively be seen as solving differential equations, where there is a challenge in balancing speed and image visual quality. ODE-based samplers offer rapid sampling time but reach a performance limit, whereas SDE-based samplers achieve superior quality, albeit with longer iterations. In this work, we formulate the sampling process as an Extended Reverse-Time SDE (ER SDE), unifying prior explorations into ODEs and SDEs. Theoretically, leveraging the semi-linear structure of ER SDE solutions, we offer exact solutions and approximate solutions for VP SDE and VE SDE, respectively. Based on the approximate solution space of the ER SDE, referred to as one-step prediction errors, we yield mathematical insights elucidating the rapid sampling capability of ODE solvers and the high-quality sampling ability of SDE solvers. Additionally, we unveil that VP SDE solvers stand on par with their VE SDE counterparts. Based on these findings, leveraging the dual advantages of ODE solvers and SDE solvers, we devise efficient high-quality samplers, namely ER-SDE-Solvers. Experimental results demonstrate that ER-SDE-Solvers achieve state-of-the-art performance across all stochastic samplers while maintaining efficiency of deterministic samplers. Specifically, on the ImageNet $128\times128$ dataset, ER-SDE-Solvers obtain 8.33 FID in only 20 function evaluations. Code is available at \href{this https URL}{this https URL}

Comments:	This paper has been accepted by WACV 2025 (Oral). The official version lacked proper attribution to the co-authors, and this version has been updated accordingly
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2309.06169 [cs.LG]
	(or arXiv:2309.06169v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.06169

Submission history

From: Qinpeng Cui [view email]
[v1] Tue, 12 Sep 2023 12:27:17 UTC (13,067 KB)
[v2] Tue, 26 Sep 2023 06:19:00 UTC (17,661 KB)
[v3] Thu, 27 Feb 2025 07:11:01 UTC (21,621 KB)

Computer Science > Machine Learning

Title:Elucidating the solution space of extended reverse-time SDE for diffusion models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Elucidating the solution space of extended reverse-time SDE for diffusion models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators