Training Unbiased Diffusion Models From Biased Dataset

Kim, Yeongmin; Na, Byeonghu; Park, Minsang; Jang, JoonHo; Kim, Dongjun; Kang, Wanmo; Moon, Il-Chul

Computer Science > Machine Learning

arXiv:2403.01189 (cs)

[Submitted on 2 Mar 2024]

Title:Training Unbiased Diffusion Models From Biased Dataset

Authors:Yeongmin Kim, Byeonghu Na, Minsang Park, JoonHo Jang, Dongjun Kim, Wanmo Kang, Il-Chul Moon

View PDF

Abstract:With significant advancements in diffusion models, addressing the potential risks of dataset bias becomes increasingly important. Since generated outputs directly suffer from dataset bias, mitigating latent bias becomes a key factor in improving sample quality and proportion. This paper proposes time-dependent importance reweighting to mitigate the bias for the diffusion models. We demonstrate that the time-dependent density ratio becomes more precise than previous approaches, thereby minimizing error propagation in generative learning. While directly applying it to score-matching is intractable, we discover that using the time-dependent density ratio both for reweighting and score correction can lead to a tractable form of the objective function to regenerate the unbiased data density. Furthermore, we theoretically establish a connection with traditional score-matching, and we demonstrate its convergence to an unbiased distribution. The experimental evidence supports the usefulness of the proposed method, which outperforms baselines including time-independent importance reweighting on CIFAR-10, CIFAR-100, FFHQ, and CelebA with various bias settings. Our code is available at this https URL.

Comments:	International Conference on Learning Representations (ICLR 2024)
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.01189 [cs.LG]
	(or arXiv:2403.01189v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.01189

Submission history

From: Yeongmin Kim [view email]
[v1] Sat, 2 Mar 2024 12:06:42 UTC (43,875 KB)

Computer Science > Machine Learning

Title:Training Unbiased Diffusion Models From Biased Dataset

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Training Unbiased Diffusion Models From Biased Dataset

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators