Inflating 2D Convolution Weights for Efficient Generation of 3D Medical Images

Liu, Yanbin; Dwivedi, Girish; Boussaid, Farid; Sanfilippo, Frank; Yamada, Makoto; Bennamoun, Mohammed

doi:10.1016/j.cmpb.2023.107685

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2208.03934 (eess)

[Submitted on 8 Aug 2022 (v1), last revised 5 Dec 2023 (this version, v3)]

Title:Inflating 2D Convolution Weights for Efficient Generation of 3D Medical Images

Authors:Yanbin Liu, Girish Dwivedi, Farid Boussaid, Frank Sanfilippo, Makoto Yamada, Mohammed Bennamoun

View PDF HTML (experimental)

Abstract:The generation of three-dimensional (3D) medical images has great application potential since it takes into account the 3D anatomical structure. Two problems prevent effective training of a 3D medical generative model: (1) 3D medical images are expensive to acquire and annotate, resulting in an insufficient number of training images, and (2) a large number of parameters are involved in 3D convolution.
Methods: We propose a novel GAN model called 3D Split&Shuffle-GAN. To address the 3D data scarcity issue, we first pre-train a two-dimensional (2D) GAN model using abundant image slices and inflate the 2D convolution weights to improve the initialization of the 3D GAN. Novel 3D network architectures are proposed for both the generator and discriminator of the GAN model to significantly reduce the number of parameters while maintaining the quality of image generation. Several weight inflation strategies and parameter-efficient 3D architectures are investigated.
Results: Experiments on both heart (Stanford AIMI Coronary Calcium) and brain (Alzheimer's Disease Neuroimaging Initiative) datasets show that our method leads to improved 3D image generation quality (14.7 improvements on Fréchet inception distance) with significantly fewer parameters (only 48.5% of the baseline method).
Conclusions: We built a parameter-efficient 3D medical image generation model. Due to the efficiency and effectiveness, it has the potential to generate high-quality 3D brain and heart images for real use cases.

Comments:	Published at Computer Methods and Programs in Biomedicine (CMPB) 2023
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
ACM classes:	I.4; J.3
Cite as:	arXiv:2208.03934 [eess.IV]
	(or arXiv:2208.03934v3 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2208.03934
Journal reference:	Computer Methods and Programs in Biomedicine (2023): 107685
Related DOI:	https://doi.org/10.1016/j.cmpb.2023.107685

Submission history

From: Yanbin Liu [view email]
[v1] Mon, 8 Aug 2022 06:31:00 UTC (1,201 KB)
[v2] Wed, 17 Aug 2022 09:28:35 UTC (3,693 KB)
[v3] Tue, 5 Dec 2023 23:59:59 UTC (1,468 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Inflating 2D Convolution Weights for Efficient Generation of 3D Medical Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Inflating 2D Convolution Weights for Efficient Generation of 3D Medical Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators