Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition

Kang, Lei; Rusiñol, Marçal; Fornés, Alicia; Riba, Pau; Villegas, Mauricio

doi:10.1109/WACV45572.2020.9093392

Computer Science > Computer Vision and Pattern Recognition

arXiv:1909.08473 (cs)

[Submitted on 18 Sep 2019 (v1), last revised 26 May 2020 (this version, v2)]

Title:Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition

Authors:Lei Kang, Marçal Rusiñol, Alicia Fornés, Pau Riba, Mauricio Villegas

View PDF

Abstract:Handwritten Text Recognition (HTR) is still a challenging problem because it must deal with two important difficulties: the variability among writing styles, and the scarcity of labelled data. To alleviate such problems, synthetic data generation and data augmentation are typically used to train HTR systems. However, training with such data produces encouraging but still inaccurate transcriptions in real words. In this paper, we propose an unsupervised writer adaptation approach that is able to automatically adjust a generic handwritten word recognizer, fully trained with synthetic fonts, towards a new incoming writer. We have experimentally validated our proposal using five different datasets, covering several challenges (i) the document source: modern and historic samples, which may involve paper degradation problems; (ii) different handwriting styles: single and multiple writer collections; and (iii) language, which involves different character combinations. Across these challenging collections, we show that our system is able to maintain its performance, thus, it provides a practical and generic approach to deal with new document collections without requiring any expensive and tedious manual annotation step.

Comments:	Accepted to WACV 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1909.08473 [cs.CV]
	(or arXiv:1909.08473v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1909.08473
Related DOI:	https://doi.org/10.1109/WACV45572.2020.9093392

Submission history

From: Lei Kang [view email]
[v1] Wed, 18 Sep 2019 14:32:41 UTC (3,344 KB)
[v2] Tue, 26 May 2020 21:15:08 UTC (3,344 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators