Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models

Meoded, Erez

Computer Science > Computer Vision and Pattern Recognition

arXiv:2508.11499 (cs)

[Submitted on 15 Aug 2025]

Title:Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models

Authors:Erez Meoded

View PDF HTML (experimental)

Abstract:Historical handwritten text recognition (HTR) is essential for unlocking the cultural and scholarly value of archival documents, yet digitization is often hindered by scarce transcriptions, linguistic variation, and highly diverse handwriting styles. In this study, we apply TrOCR, a state-of-the-art transformer-based HTR model, to 16th-century Latin manuscripts authored by Rudolf Gwalther. We investigate targeted image preprocessing and a broad suite of data augmentation techniques, introducing four novel augmentation methods designed specifically for historical handwriting characteristics. We also evaluate ensemble learning approaches to leverage the complementary strengths of augmentation-trained models. On the Gwalther dataset, our best single-model augmentation (Elastic) achieves a Character Error Rate (CER) of 1.86, while a top-5 voting ensemble achieves a CER of 1.60 - representing a 50% relative improvement over the best reported TrOCR_BASE result and a 42% improvement over the previous state of the art. These results highlight the impact of domain-specific augmentations and ensemble strategies in advancing HTR performance for historical manuscripts.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Digital Libraries (cs.DL); Machine Learning (cs.LG)
Cite as:	arXiv:2508.11499 [cs.CV]
	(or arXiv:2508.11499v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2508.11499

Submission history

From: Erez Meoded [view email]
[v1] Fri, 15 Aug 2025 14:20:58 UTC (3,682 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators