FETNet: Feature Erasing and Transferring Network for Scene Text Removal

Lyu, Guangtao; Liu, Kun; Zhu, Anna; Uchida, Seiichi; Iwana, Brian Kenji

doi:10.1016/j.patcog.2023.109531

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.09593 (cs)

[Submitted on 16 Jun 2023]

Title:FETNet: Feature Erasing and Transferring Network for Scene Text Removal

Authors:Guangtao Lyu, Kun Liu, Anna Zhu, Seiichi Uchida, Brian Kenji Iwana

View PDF

Abstract:The scene text removal (STR) task aims to remove text regions and recover the background smoothly in images for private information protection. Most existing STR methods adopt encoder-decoder-based CNNs, with direct copies of the features in the skip connections. However, the encoded features contain both text texture and structure information. The insufficient utilization of text features hampers the performance of background reconstruction in text removal regions. To tackle these problems, we propose a novel Feature Erasing and Transferring (FET) mechanism to reconfigure the encoded features for STR in this paper. In FET, a Feature Erasing Module (FEM) is designed to erase text features. An attention module is responsible for generating the feature similarity guidance. The Feature Transferring Module (FTM) is introduced to transfer the corresponding features in different layers based on the attention guidance. With this mechanism, a one-stage, end-to-end trainable network called FETNet is constructed for scene text removal. In addition, to facilitate research on both scene text removal and segmentation tasks, we introduce a novel dataset, Flickr-ST, with multi-category annotations. A sufficient number of experiments and ablation studies are conducted on the public datasets and Flickr-ST. Our proposed method achieves state-of-the-art performance using most metrics, with remarkably higher quality scene text removal results. The source code of our work is available at: \href{this https URL}{this https URL.

Comments:	Accepted by Pattern Recognition 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.09593 [cs.CV]
	(or arXiv:2306.09593v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.09593
Journal reference:	Pattern Recognition 2023
Related DOI:	https://doi.org/10.1016/j.patcog.2023.109531

Submission history

From: Guangtao Lyu [view email]
[v1] Fri, 16 Jun 2023 02:38:30 UTC (11,779 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:FETNet: Feature Erasing and Transferring Network for Scene Text Removal

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:FETNet: Feature Erasing and Transferring Network for Scene Text Removal

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators