An Image Dataset of Text Patches in Everyday Scenes

Ibrahim, Ahmed; Abbott, A. Lynn; Hussein, Mohamed E.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1610.06494 (cs)

[Submitted on 20 Oct 2016]

Title:An Image Dataset of Text Patches in Everyday Scenes

Authors:Ahmed Ibrahim, A. Lynn Abbott, Mohamed E. Hussein

View PDF

Abstract:This paper describes a dataset containing small images of text from everyday scenes. The purpose of the dataset is to support the development of new automated systems that can detect and analyze text. Although much research has been devoted to text detection and recognition in scanned documents, relatively little attention has been given to text detection in other types of images, such as photographs that are posted on social-media sites. This new dataset, known as COCO-Text-Patch, contains approximately 354,000 small images that are each labeled as "text" or "non-text". This dataset particularly addresses the problem of text verification, which is an essential stage in the end-to-end text detection and recognition pipeline. In order to evaluate the utility of this dataset, it has been used to train two deep convolution neural networks to distinguish text from non-text. One network is inspired by the GoogLeNet architecture, and the second one is based on CaffeNet. Accuracy levels of 90.2% and 90.9% were obtained using the two networks, respectively. All of the images, source code, and deep-learning trained models described in this paper will be publicly available

Comments:	Accepted in the 12th International Symposium on Visual Computing (ISVC'16)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1610.06494 [cs.CV]
	(or arXiv:1610.06494v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1610.06494

Submission history

From: Ahmed Ibrahim [view email]
[v1] Thu, 20 Oct 2016 16:38:42 UTC (2,238 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2016-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ahmed S. Ibrahim
A. Lynn Abbott
Mohamed E. Hussein

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:An Image Dataset of Text Patches in Everyday Scenes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:An Image Dataset of Text Patches in Everyday Scenes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators