StepAL: Step-aware Active Learning for Cataract Surgical Videos

Shah, Nisarg A.; Safaei, Bardia; Sikder, Shameema; Vedula, S. Swaroop; Patel, Vishal M.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2507.22059 (cs)

[Submitted on 29 Jul 2025]

Title:StepAL: Step-aware Active Learning for Cataract Surgical Videos

Authors:Nisarg A. Shah, Bardia Safaei, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

View PDF HTML (experimental)

Abstract:Active learning (AL) can reduce annotation costs in surgical video analysis while maintaining model performance. However, traditional AL methods, developed for images or short video clips, are suboptimal for surgical step recognition due to inter-step dependencies within long, untrimmed surgical videos. These methods typically select individual frames or clips for labeling, which is ineffective for surgical videos where annotators require the context of the entire video for annotation. To address this, we propose StepAL, an active learning framework designed for full video selection in surgical step recognition. StepAL integrates a step-aware feature representation, which leverages pseudo-labels to capture the distribution of predicted steps within each video, with an entropy-weighted clustering strategy. This combination prioritizes videos that are both uncertain and exhibit diverse step compositions for annotation. Experiments on two cataract surgery datasets (Cataract-1k and Cataract-101) demonstrate that StepAL consistently outperforms existing active learning approaches, achieving higher accuracy in step recognition with fewer labeled videos. StepAL offers an effective approach for efficient surgical video analysis, reducing the annotation burden in developing computer-assisted surgical systems.

Comments:	Accepted to MICCAI 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2507.22059 [cs.CV]
	(or arXiv:2507.22059v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2507.22059

Submission history

From: Nisarg Shah [view email]
[v1] Tue, 29 Jul 2025 17:59:14 UTC (299 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:StepAL: Step-aware Active Learning for Cataract Surgical Videos

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:StepAL: Step-aware Active Learning for Cataract Surgical Videos

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators