Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images

Zhang, Yundi; Chen, Chen; Shit, Suprosanna; Starck, Sophie; Rueckert, Daniel; Pan, Jiazhen

doi:10.1007/978-3-031-72378-0_34

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2406.00329 (eess)

[Submitted on 1 Jun 2024 (v1), last revised 6 Jun 2024 (this version, v2)]

Title:Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images

Authors:Yundi Zhang, Chen Chen, Suprosanna Shit, Sophie Starck, Daniel Rueckert, Jiazhen Pan

View PDF HTML (experimental)

Abstract:Cardiac Magnetic Resonance (CMR) imaging serves as the gold-standard for evaluating cardiac morphology and function. Typically, a multi-view CMR stack, covering short-axis (SA) and 2/3/4-chamber long-axis (LA) views, is acquired for a thorough cardiac assessment. However, efficiently streamlining the complex, high-dimensional 3D+T CMR data and distilling compact, coherent representation remains a challenge. In this work, we introduce a whole-heart self-supervised learning framework that utilizes masked imaging modeling to automatically uncover the correlations between spatial and temporal patches throughout the cardiac stacks. This process facilitates the generation of meaningful and well-clustered heart representations without relying on the traditionally required, and often costly, labeled data. The learned heart representation can be directly used for various downstream tasks. Furthermore, our method demonstrates remarkable robustness, ensuring consistent representations even when certain CMR planes are missing/flawed. We train our model on 14,000 unlabeled CMR data from UK BioBank and evaluate it on 1,000 annotated data. The proposed method demonstrates superior performance to baselines in tasks that demand comprehensive 3D+T cardiac information, e.g. cardiac phenotype (ejection fraction and ventricle volume) prediction and multi-plane/multi-frame CMR segmentation, highlighting its effectiveness in extracting comprehensive cardiac features that are both anatomically and pathologically relevant.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2406.00329 [eess.IV]
	(or arXiv:2406.00329v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2406.00329
Related DOI:	https://doi.org/10.1007/978-3-031-72378-0_34

Submission history

From: Yundi Zhang [view email]
[v1] Sat, 1 Jun 2024 07:08:45 UTC (3,637 KB)
[v2] Thu, 6 Jun 2024 15:27:12 UTC (3,637 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators