MedSAM2: Segment Anything in 3D Medical Images and Videos

Ma, Jun; Yang, Zongxin; Kim, Sumin; Chen, Bihui; Baharoon, Mohammed; Fallahpour, Adibvafa; Asakereh, Reza; Lyu, Hongwei; Wang, Bo

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2504.03600 (eess)

[Submitted on 4 Apr 2025]

Title:MedSAM2: Segment Anything in 3D Medical Images and Videos

Authors:Jun Ma, Zongxin Yang, Sumin Kim, Bihui Chen, Mohammed Baharoon, Adibvafa Fallahpour, Reza Asakereh, Hongwei Lyu, Bo Wang

View PDF HTML (experimental)

Abstract:Medical image and video segmentation is a critical task for precision medicine, which has witnessed considerable progress in developing task or modality-specific and generalist models for 2D images. However, there have been limited studies on building general-purpose models for 3D images and videos with comprehensive user studies. Here, we present MedSAM2, a promptable segmentation foundation model for 3D image and video segmentation. The model is developed by fine-tuning the Segment Anything Model 2 on a large medical dataset with over 455,000 3D image-mask pairs and 76,000 frames, outperforming previous models across a wide range of organs, lesions, and imaging modalities. Furthermore, we implement a human-in-the-loop pipeline to facilitate the creation of large-scale datasets resulting in, to the best of our knowledge, the most extensive user study to date, involving the annotation of 5,000 CT lesions, 3,984 liver MRI lesions, and 251,550 echocardiogram video frames, demonstrating that MedSAM2 can reduce manual costs by more than 85%. MedSAM2 is also integrated into widely used platforms with user-friendly interfaces for local and cloud deployment, making it a practical tool for supporting efficient, scalable, and high-quality segmentation in both research and healthcare environments.

Comments:	this https URL
Subjects:	Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.03600 [eess.IV]
	(or arXiv:2504.03600v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2504.03600

Submission history

From: Jun Ma [view email]
[v1] Fri, 4 Apr 2025 17:13:37 UTC (8,084 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:MedSAM2: Segment Anything in 3D Medical Images and Videos

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:MedSAM2: Segment Anything in 3D Medical Images and Videos

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators