Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

Heidari, Moein; Kolahi, Sina Ghorbani; Karimijafarbigloo, Sanaz; Azad, Bobby; Bozorgpour, Afshin; Hatami, Soheila; Azad, Reza; Diba, Ali; Bagci, Ulas; Merhof, Dorit; Hacihaliloglu, Ilker

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2406.03430 (eess)

[Submitted on 5 Jun 2024]

Title:Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

Authors:Moein Heidari, Sina Ghorbani Kolahi, Sanaz Karimijafarbigloo, Bobby Azad, Afshin Bozorgpour, Soheila Hatami, Reza Azad, Ali Diba, Ulas Bagci, Dorit Merhof, Ilker Hacihaliloglu

View PDF HTML (experimental)

Abstract:Sequence modeling plays a vital role across various domains, with recurrent neural networks being historically the predominant method of performing these tasks. However, the emergence of transformers has altered this paradigm due to their superior performance. Built upon these advances, transformers have conjoined CNNs as two leading foundational models for learning visual representations. However, transformers are hindered by the $\mathcal{O}(N^2)$ complexity of their attention mechanisms, while CNNs lack global receptive fields and dynamic weight allocation. State Space Models (SSMs), specifically the \textit{\textbf{Mamba}} model with selection mechanisms and hardware-aware architecture, have garnered immense interest lately in sequential modeling and visual representation learning, challenging the dominance of transformers by providing infinite context lengths and offering substantial efficiency maintaining linear complexity in the input sequence. Capitalizing on the advances in computer vision, medical imaging has heralded a new epoch with Mamba models. Intending to help researchers navigate the surge, this survey seeks to offer an encyclopedic review of Mamba models in medical imaging. Specifically, we start with a comprehensive theoretical review forming the basis of SSMs, including Mamba architecture and its alternatives for sequence modeling paradigms in this context. Next, we offer a structured classification of Mamba models in the medical field and introduce a diverse categorization scheme based on their application, imaging modalities, and targeted organs. Finally, we summarize key challenges, discuss different future research directions of the SSMs in the medical domain, and propose several directions to fulfill the demands of this field. In addition, we have compiled the studies discussed in this paper along with their open-source implementations on our GitHub repository.

Comments:	This is the first version of our survey, and the paper is currently under review
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.03430 [eess.IV]
	(or arXiv:2406.03430v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2406.03430

Submission history

From: Moein Heidari [view email]
[v1] Wed, 5 Jun 2024 16:29:03 UTC (6,040 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators