Assessing the generalization performance of SAM for ureteroscopy scene understanding

Villagrana, Martin; Lopez-Tiro, Francisco; Larose, Clement; Ochoa-Ruiz, Gilberto; Daul, Christian

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2505.17210 (eess)

[Submitted on 22 May 2025]

Title:Assessing the generalization performance of SAM for ureteroscopy scene understanding

Authors:Martin Villagrana, Francisco Lopez-Tiro, Clement Larose, Gilberto Ochoa-Ruiz, Christian Daul

View PDF HTML (experimental)

Abstract:The segmentation of kidney stones is regarded as a critical preliminary step to enable the identification of urinary stone types through machine- or deep-learning-based approaches. In urology, manual segmentation is considered tedious and impractical due to the typically large scale of image databases and the continuous generation of new data. In this study, the potential of the Segment Anything Model (SAM) -- a state-of-the-art deep learning framework -- is investigated for the automation of kidney stone segmentation. The performance of SAM is evaluated in comparison to traditional models, including U-Net, Residual U-Net, and Attention U-Net, which, despite their efficiency, frequently exhibit limitations in generalizing to unseen datasets. The findings highlight SAM's superior adaptability and efficiency. While SAM achieves comparable performance to U-Net on in-distribution data (Accuracy: 97.68 + 3.04; Dice: 97.78 + 2.47; IoU: 95.76 + 4.18), it demonstrates significantly enhanced generalization capabilities on out-of-distribution data, surpassing all U-Net variants by margins of up to 23 percent.

Comments:	15 pages, 4 figures, 2 tables, conference, MIUA25
Subjects:	Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2505.17210 [eess.IV]
	(or arXiv:2505.17210v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2505.17210

Submission history

From: Francisco Javier Lopez-Tiro [view email]
[v1] Thu, 22 May 2025 18:35:37 UTC (3,835 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Assessing the generalization performance of SAM for ureteroscopy scene understanding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Assessing the generalization performance of SAM for ureteroscopy scene understanding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators