The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound

VanBerlo, Blake; Wong, Alexander; Hoey, Jesse; Arntfield, Robert

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2504.07904 (eess)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 10 Apr 2025 (v1), last revised 10 Jun 2025 (this version, v2)]

Title:The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound

Authors:Blake VanBerlo, Alexander Wong, Jesse Hoey, Robert Arntfield

View PDF HTML (experimental)

Abstract:Data augmentation is a central component of joint embedding self-supervised learning (SSL). Approaches that work for natural images may not always be effective in medical imaging tasks. This study systematically investigated the impact of data augmentation and preprocessing strategies in SSL for lung ultrasound. Three data augmentation pipelines were assessed: (1) a baseline pipeline commonly used across imaging domains, (2) a novel semantic-preserving pipeline designed for ultrasound, and (3) a distilled set of the most effective transformations from both pipelines. Pretrained models were evaluated on multiple classification tasks: B-line detection, pleural effusion detection, and COVID-19 classification. Experiments revealed that semantics-preserving data augmentation resulted in the greatest performance for COVID-19 classification - a diagnostic task requiring global image context. Cropping-based methods yielded the greatest performance on the B-line and pleural effusion object classification tasks, which require strong local pattern recognition. Lastly, semantics-preserving ultrasound image preprocessing resulted in increased downstream performance for multiple tasks. Guidance regarding data augmentation and preprocessing strategies was synthesized for practitioners working with SSL in ultrasound.

Comments:	17 pages, 12 figures, 18 tables, Submitted to Medical Image Analysis
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
ACM classes:	I.2.10; I.4.9; J.3
Cite as:	arXiv:2504.07904 [eess.IV]
	(or arXiv:2504.07904v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2504.07904

Submission history

From: Blake VanBerlo [view email]
[v1] Thu, 10 Apr 2025 16:26:47 UTC (34,344 KB)
[v2] Tue, 10 Jun 2025 20:25:07 UTC (22,299 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators