Multitask Learning of Temporal Connectionism in Convolutional Networks using a Joint Distribution Loss Function to Simultaneously Identify Tools and Phase in Surgical Videos

Mondal, Shanka Subhra; Sathish, Rachana; Sheet, Debdoot

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:1905.08315 (eess)

[Submitted on 20 May 2019 (v1), last revised 25 May 2019 (this version, v2)]

Title:Multitask Learning of Temporal Connectionism in Convolutional Networks using a Joint Distribution Loss Function to Simultaneously Identify Tools and Phase in Surgical Videos

Authors:Shanka Subhra Mondal, Rachana Sathish, Debdoot Sheet

View PDF

Abstract:Surgical workflow analysis is of importance for understanding onset and persistence of surgical phases and individual tool usage across surgery and in each phase. It is beneficial for clinical quality control and to hospital administrators for understanding surgery planning. Video acquired during surgery typically can be leveraged for this task. Currently, a combination of convolutional neural network (CNN) and recurrent neural networks (RNN) are popularly used for video analysis in general, not only being restricted to surgical videos. In this paper, we propose a multi-task learning framework using CNN followed by a bi-directional long short term memory (Bi-LSTM) to learn to encapsulate both forward and backward temporal dependencies. Further, the joint distribution indicating set of tools associated with a phase is used as an additional loss during learning to correct for their co-occurrence in any predictions. Experimental evaluation is performed using the Cholec80 dataset. We report a mean average precision (mAP) score of 0.99 and 0.86 for tool and phase identification respectively which are higher compared to prior-art in the field.

Comments:	15 pages, 8 figures, 5th MedImage Workshop of 11th Indian Conference on Computer Vision, Graphics and Image Processing, Hyderabad, India, 2018
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1905.08315 [eess.IV]
	(or arXiv:1905.08315v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.1905.08315

Submission history

From: Shanka Subhra Mondal [view email]
[v1] Mon, 20 May 2019 19:42:40 UTC (1,604 KB)
[v2] Sat, 25 May 2019 16:38:08 UTC (1,604 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Multitask Learning of Temporal Connectionism in Convolutional Networks using a Joint Distribution Loss Function to Simultaneously Identify Tools and Phase in Surgical Videos

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Multitask Learning of Temporal Connectionism in Convolutional Networks using a Joint Distribution Loss Function to Simultaneously Identify Tools and Phase in Surgical Videos

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators