Dynamic Scene Graph Representation for Surgical Video

Holm, Felix; Ghazaei, Ghazal; Czempiel, Tobias; Özsoy, Ege; Saur, Stefan; Navab, Nassir

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.14538 (cs)

[Submitted on 25 Sep 2023 (v1), last revised 24 Oct 2023 (this version, v2)]

Title:Dynamic Scene Graph Representation for Surgical Video

Authors:Felix Holm, Ghazal Ghazaei, Tobias Czempiel, Ege Özsoy, Stefan Saur, Nassir Navab

View PDF

Abstract:Surgical videos captured from microscopic or endoscopic imaging devices are rich but complex sources of information, depicting different tools and anatomical structures utilized during an extended amount of time. Despite containing crucial workflow information and being commonly recorded in many procedures, usage of surgical videos for automated surgical workflow understanding is still limited.
In this work, we exploit scene graphs as a more holistic, semantically meaningful and human-readable way to represent surgical videos while encoding all anatomical structures, tools, and their interactions. To properly evaluate the impact of our solutions, we create a scene graph dataset from semantic segmentations from the CaDIS and CATARACTS datasets. We demonstrate that scene graphs can be leveraged through the use of graph convolutional networks (GCNs) to tackle surgical downstream tasks such as surgical workflow recognition with competitive performance. Moreover, we demonstrate the benefits of surgical scene graphs regarding the explainability and robustness of model decisions, which are crucial in the clinical setting.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2309.14538 [cs.CV]
	(or arXiv:2309.14538v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.14538

Submission history

From: Ghazal Ghazaei [view email]
[v1] Mon, 25 Sep 2023 21:28:14 UTC (5,555 KB)
[v2] Tue, 24 Oct 2023 10:24:00 UTC (5,555 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Dynamic Scene Graph Representation for Surgical Video

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dynamic Scene Graph Representation for Surgical Video

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators