Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

Wu, Yongjian; Zhou, Yang; Saiyin, Jiya; Wei, Bingzheng; Lai, Maode; Shou, Jianzhong; Fan, Yubo; Xu, Yan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.17659 (cs)

[Submitted on 30 Jun 2023]

Title:Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

Authors:Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

View PDF

Abstract:Large-scale visual-language pre-trained models (VLPM) have proven their excellent performance in downstream object detection for natural scenes. However, zero-shot nuclei detection on H\&E images via VLPMs remains underexplored. The large gap between medical images and the web-originated text-image pairs used for pre-training makes it a challenging task. In this paper, we attempt to explore the potential of the object-level VLPM, Grounded Language-Image Pre-training (GLIP) model, for zero-shot nuclei detection. Concretely, an automatic prompts design pipeline is devised based on the association binding trait of VLPM and the image-to-text VLPM BLIP, avoiding empirical manual prompts engineering. We further establish a self-training framework, using the automatically designed prompts to generate the preliminary results as pseudo labels from GLIP and refine the predicted boxes in an iterative manner. Our method achieves a remarkable performance for label-free nuclei detection, surpassing other comparison methods. Foremost, our work demonstrates that the VLPM pre-trained on natural image-text pairs exhibits astonishing potential for downstream tasks in the medical field as well. Code will be released at this https URL.

Comments:	This article has been accepted by MICCAI 2023,but has not been fully edited. Content may change prior to final publication
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.17659 [cs.CV]
	(or arXiv:2306.17659v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.17659

Submission history

From: Yang Zhou [view email]
[v1] Fri, 30 Jun 2023 13:44:13 UTC (15,190 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators