DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

Zawar, Rushikesh; Dewan, Shaurya; Saxena, Prakanshul; Chang, Yingshan; Luo, Andrew; Bisk, Yonatan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.05191 (cs)

[Submitted on 7 Jun 2024 (v1), last revised 14 Nov 2024 (this version, v4)]

Title:DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

Authors:Rushikesh Zawar, Shaurya Dewan, Prakanshul Saxena, Yingshan Chang, Andrew Luo, Yonatan Bisk

View PDF HTML (experimental)

Abstract:Text-to-image diffusion models have made significant progress in generating naturalistic images from textual inputs, and demonstrate the capacity to learn and represent complex visual-semantic relationships. While these diffusion models have achieved remarkable success, the underlying mechanisms driving their performance are not yet fully accounted for, with many unanswered questions surrounding what they learn, how they represent visual-semantic relationships, and why they sometimes fail to generalize. Our work presents Diffusion Partial Information Decomposition (DiffusionPID), a novel technique that applies information-theoretic principles to decompose the input text prompt into its elementary components, enabling a detailed examination of how individual tokens and their interactions shape the generated image. We introduce a formal approach to analyze the uniqueness, redundancy, and synergy terms by applying PID to the denoising model at both the image and pixel level. This approach enables us to characterize how individual tokens and their interactions affect the model output. We first present a fine-grained analysis of characteristics utilized by the model to uniquely localize specific concepts, we then apply our approach in bias analysis and show it can recover gender and ethnicity biases. Finally, we use our method to visually characterize word ambiguity and similarity from the model's perspective and illustrate the efficacy of our method for prompt intervention. Our results show that PID is a potent tool for evaluating and diagnosing text-to-image diffusion models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.05191 [cs.CV]
	(or arXiv:2406.05191v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.05191
Journal reference:	Thirty-Eighth Annual Conference on Neural Information Processing Systems (2024)

Submission history

From: Rushikesh Zawar [view email]
[v1] Fri, 7 Jun 2024 18:17:17 UTC (23,203 KB)
[v2] Wed, 12 Jun 2024 21:28:13 UTC (23,203 KB)
[v3] Fri, 4 Oct 2024 17:58:13 UTC (27,164 KB)
[v4] Thu, 14 Nov 2024 21:26:33 UTC (27,164 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators