Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations

Diao, Xiaolei; Shi, Daqian; Li, Jian; Shi, Lida; Yue, Mingzhe; Qi, Ruihua; Li, Chuntao; Xu, Hao

doi:10.1145/3581783.3612201

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.00655 (cs)

[Submitted on 1 Aug 2023]

Title:Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations

Authors:Xiaolei Diao, Daqian Shi, Jian Li, Lida Shi, Mingzhe Yue, Ruihua Qi, Chuntao Li, Hao Xu

View PDF

Abstract:Optical character recognition (OCR) methods have been applied to diverse tasks, e.g., street view text recognition and document analysis. Recently, zero-shot OCR has piqued the interest of the research community because it considers a practical OCR scenario with unbalanced data distribution. However, there is a lack of benchmarks for evaluating such zero-shot methods that apply a divide-and-conquer recognition strategy by decomposing characters into radicals. Meanwhile, radical recognition, as another important OCR task, also lacks radical-level annotation for model training. In this paper, we construct an ancient Chinese character image dataset that contains both radical-level and character-level annotations to satisfy the requirements of the above-mentioned methods, namely, ACCID, where radical-level annotations include radical categories, radical locations, and structural relations. To increase the adaptability of ACCID, we propose a splicing-based synthetic character algorithm to augment the training samples and apply an image denoising method to improve the image quality. By introducing character decomposition and recombination, we propose a baseline method for zero-shot OCR. The experimental results demonstrate the validity of ACCID and the baseline model quantitatively and qualitatively.

Comments:	Accepted by ACM MM 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.00655 [cs.CV]
	(or arXiv:2308.00655v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.00655
Related DOI:	https://doi.org/10.1145/3581783.3612201

Submission history

From: Xiaolei Diao [view email]
[v1] Tue, 1 Aug 2023 16:41:30 UTC (7,277 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators