Frontiers in Intelligent Colonoscopy

Ji, Ge-Peng; Liu, Jingyi; Xu, Peng; Barnes, Nick; Khan, Fahad Shahbaz; Khan, Salman; Fan, Deng-Ping

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2410.17241 (eess)

[Submitted on 22 Oct 2024 (v1), last revised 1 Feb 2025 (this version, v2)]

Title:Frontiers in Intelligent Colonoscopy

Authors:Ge-Peng Ji, Jingyi Liu, Peng Xu, Nick Barnes, Fahad Shahbaz Khan, Salman Khan, Deng-Ping Fan

View PDF HTML (experimental)

Abstract:Colonoscopy is currently one of the most sensitive screening methods for colorectal cancer. This study investigates the frontiers of intelligent colonoscopy techniques and their prospective implications for multimodal medical applications. With this goal, we begin by assessing the current data-centric and model-centric landscapes through four tasks for colonoscopic scene perception, including classification, detection, segmentation, and vision-language understanding. This assessment enables us to identify domain-specific challenges and reveals that multimodal research in colonoscopy remains open for further exploration. To embrace the coming multimodal era, we establish three foundational initiatives: a large-scale multimodal instruction tuning dataset ColonINST, a colonoscopy-designed multimodal language model ColonGPT, and a multimodal benchmark. To facilitate ongoing monitoring of this rapidly evolving field, we provide a public website for the latest updates: this https URL.

Comments:	[Work in progress] A comprehensive survey of intelligent colonoscopy in the multimodal era. [Updated Version V2] New training strategy for colonoscopy-specific multimodal language model
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.17241 [eess.IV]
	(or arXiv:2410.17241v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2410.17241

Submission history

From: Ge-Peng Ji [view email]
[v1] Tue, 22 Oct 2024 17:57:12 UTC (5,752 KB)
[v2] Sat, 1 Feb 2025 05:00:55 UTC (13,956 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Frontiers in Intelligent Colonoscopy

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Frontiers in Intelligent Colonoscopy

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators