Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation

Gorade, Vandan; Mittal, Sparsh; Jha, Debesh; Singhal, Rekha; Bagci, Ulas

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2401.10373 (eess)

[Submitted on 18 Jan 2024 (v1), last revised 8 Aug 2024 (this version, v2)]

Title:Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation

Authors:Vandan Gorade, Sparsh Mittal, Debesh Jha, Rekha Singhal, Ulas Bagci

View PDF HTML (experimental)

Abstract:Deep learning has demonstrated remarkable achievements in medical image segmentation. However, prevailing deep learning models struggle with poor generalization due to (i) intra-class variations, where the same class appears differently in different samples, and (ii) inter-class independence, resulting in difficulties capturing intricate relationships between distinct objects, leading to higher false negative cases. This paper presents a novel approach that synergies spatial and spectral representations to enhance domain-generalized medical image segmentation. We introduce the innovative Spectral Correlation Coefficient objective to improve the model's capacity to capture middle-order features and contextual long-range dependencies. This objective complements traditional spatial objectives by incorporating valuable spectral information. Extensive experiments reveal that optimizing this objective with existing architectures like UNet and TransUNet significantly enhances generalization, interpretability, and noise robustness, producing more confident predictions. For instance, in cardiac segmentation, we observe a 0.81 pp and 1.63 pp (pp = percentage point) improvement in DSC over UNet and TransUNet, respectively. Our interpretability study demonstrates that, in most tasks, objectives optimized with UNet outperform even TransUNet by introducing global contextual information alongside local details. These findings underscore the versatility and effectiveness of our proposed method across diverse imaging modalities and medical domains.

Comments:	Early Accepted at ICPR-2024 for Oral Presentation
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2401.10373 [eess.IV]
	(or arXiv:2401.10373v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2401.10373

Submission history

From: Vandan Gorade [view email]
[v1] Thu, 18 Jan 2024 20:43:43 UTC (19,131 KB)
[v2] Thu, 8 Aug 2024 07:06:40 UTC (32,891 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators