Med-URWKV: Pure RWKV With ImageNet Pre-training For Medical Image Segmentation

Zhou, Zhenhuan

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2506.10858 (eess)

[Submitted on 12 Jun 2025]

Title:Med-URWKV: Pure RWKV With ImageNet Pre-training For Medical Image Segmentation

Authors:Zhenhuan Zhou

View PDF HTML (experimental)

Abstract:Medical image segmentation is a fundamental and key technology in computer-aided diagnosis and treatment. Previous methods can be broadly classified into three categories: convolutional neural network (CNN) based, Transformer based, and hybrid architectures that combine both. However, each of them has its own limitations, such as restricted receptive fields in CNNs or the computational overhead caused by the quadratic complexity of Transformers. Recently, the Receptance Weighted Key Value (RWKV) model has emerged as a promising alternative for various vision tasks, offering strong long-range modeling capabilities with linear computational complexity. Some studies have also adapted RWKV to medical image segmentation tasks, achieving competitive performance. However, most of these studies focus on modifications to the Vision-RWKV (VRWKV) mechanism and train models from scratch, without exploring the potential advantages of leveraging pre-trained VRWKV models for medical image segmentation tasks. In this paper, we propose Med-URWKV, a pure RWKV-based architecture built upon the U-Net framework, which incorporates ImageNet-based pretraining to further explore the potential of RWKV in medical image segmentation tasks. To the best of our knowledge, Med-URWKV is the first pure RWKV segmentation model in the medical field that can directly reuse a large-scale pre-trained VRWKV encoder. Experimental results on seven datasets demonstrate that Med-URWKV achieves comparable or even superior segmentation performance compared to other carefully optimized RWKV models trained from scratch. This validates the effectiveness of using a pretrained VRWKV encoder in enhancing model performance. The codes will be released.

Comments:	Preprint Draft, 5 pages. This paper will be updated with a formal version in the future, Copyright: College of Computer Science, Nankai University. All rights reserved
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2506.10858 [eess.IV]
	(or arXiv:2506.10858v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2506.10858

Submission history

From: Zhenhuan Zhou [view email]
[v1] Thu, 12 Jun 2025 16:19:18 UTC (263 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Med-URWKV: Pure RWKV With ImageNet Pre-training For Medical Image Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Med-URWKV: Pure RWKV With ImageNet Pre-training For Medical Image Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators