Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training

Bonnaire, Tony; Urfin, Raphaël; Biroli, Giulio; Mézard, Marc

Computer Science > Machine Learning

arXiv:2505.17638 (cs)

[Submitted on 23 May 2025]

Title:Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training

Authors:Tony Bonnaire, Raphaël Urfin, Giulio Biroli, Marc Mézard

View PDF

Abstract:Diffusion models have achieved remarkable success across a wide range of generative tasks. A key challenge is understanding the mechanisms that prevent their memorization of training data and allow generalization. In this work, we investigate the role of the training dynamics in the transition from generalization to memorization. Through extensive experiments and theoretical analysis, we identify two distinct timescales: an early time $\tau_\mathrm{gen}$ at which models begin to generate high-quality samples, and a later time $\tau_\mathrm{mem}$ beyond which memorization emerges. Crucially, we find that $\tau_\mathrm{mem}$ increases linearly with the training set size $n$, while $\tau_\mathrm{gen}$ remains constant. This creates a growing window of training times with $n$ where models generalize effectively, despite showing strong memorization if training continues beyond it. It is only when $n$ becomes larger than a model-dependent threshold that overfitting disappears at infinite training times. These findings reveal a form of implicit dynamical regularization in the training dynamics, which allow to avoid memorization even in highly overparameterized settings. Our results are supported by numerical experiments with standard U-Net architectures on realistic and synthetic datasets, and by a theoretical analysis using a tractable random features model studied in the high-dimensional limit.

Comments:	36 pages, 15 figures
Subjects:	Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (stat.ML)
Cite as:	arXiv:2505.17638 [cs.LG]
	(or arXiv:2505.17638v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2505.17638

Submission history

From: Tony Bonnaire [view email]
[v1] Fri, 23 May 2025 08:58:47 UTC (820 KB)

Computer Science > Machine Learning

Title:Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators