A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis

Studer, Linda; Alberti, Michele; Pondenkandath, Vinaychandran; Goktepe, Pinar; Kolonko, Thomas; Fischer, Andreas; Liwicki, Marcus; Ingold, Rolf

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.09113 (cs)

[Submitted on 22 May 2019]

Title:A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis

Authors:Linda Studer, Michele Alberti, Vinaychandran Pondenkandath, Pinar Goktepe, Thomas Kolonko, Andreas Fischer, Marcus Liwicki, Rolf Ingold

View PDF

Abstract:Automatic analysis of scanned historical documents comprises a wide range of image analysis tasks, which are often challenging for machine learning due to a lack of human-annotated learning samples. With the advent of deep neural networks, a promising way to cope with the lack of training data is to pre-train models on images from a different domain and then fine-tune them on historical documents. In the current research, a typical example of such cross-domain transfer learning is the use of neural networks that have been pre-trained on the ImageNet database for object recognition. It remains a mostly open question whether or not this pre-training helps to analyse historical documents, which have fundamentally different image properties when compared with ImageNet. In this paper, we present a comprehensive empirical survey on the effect of ImageNet pre-training for diverse historical document analysis tasks, including character recognition, style classification, manuscript dating, semantic segmentation, and content-based retrieval. While we obtain mixed results for semantic segmentation at pixel-level, we observe a clear trend across different network architectures that ImageNet pre-training has a positive effect on classification as well as content-based retrieval.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1905.09113 [cs.CV]
	(or arXiv:1905.09113v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.09113

Submission history

From: Linda Studer [view email]
[v1] Wed, 22 May 2019 13:07:00 UTC (8,362 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators