Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context

Deshpande, Rucha; Özbey, Muzaffer; Li, Hua; Anastasio, Mark A.; Brooks, Frank J.

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2309.10817 (eess)

[Submitted on 19 Sep 2023]

Title:Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context

Authors:Rucha Deshpande, Muzaffer Özbey, Hua Li, Mark A. Anastasio, Frank J. Brooks

View PDF

Abstract:Diffusion models have emerged as a popular family of deep generative models (DGMs). In the literature, it has been claimed that one class of diffusion models -- denoising diffusion probabilistic models (DDPMs) -- demonstrate superior image synthesis performance as compared to generative adversarial networks (GANs). To date, these claims have been evaluated using either ensemble-based methods designed for natural images, or conventional measures of image quality such as structural similarity. However, there remains an important need to understand the extent to which DDPMs can reliably learn medical imaging domain-relevant information, which is referred to as `spatial context' in this work. To address this, a systematic assessment of the ability of DDPMs to learn spatial context relevant to medical imaging applications is reported for the first time. A key aspect of the studies is the use of stochastic context models (SCMs) to produce training data. In this way, the ability of the DDPMs to reliably reproduce spatial context can be quantitatively assessed by use of post-hoc image analyses. Error-rates in DDPM-generated ensembles are reported, and compared to those corresponding to a modern GAN. The studies reveal new and important insights regarding the capacity of DDPMs to learn spatial context. Notably, the results demonstrate that DDPMs hold significant capacity for generating contextually correct images that are `interpolated' between training samples, which may benefit data-augmentation tasks in ways that GANs cannot.

Comments:	This paper is under consideration at IEEE TMI
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2309.10817 [eess.IV]
	(or arXiv:2309.10817v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2309.10817

Submission history

From: Rucha Deshpande [view email]
[v1] Tue, 19 Sep 2023 17:58:35 UTC (18,507 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators