Skip to main content

Showing 1–10 of 10 results for author: Leifert, G

.
  1. arXiv:1908.09584  [pdf, other

    cs.CV

    End-To-End Measure for Text Recognition

    Authors: Gundram Leifert, Roger Labahn, Tobias Grüning, Svenja Leifert

    Abstract: Measuring the performance of text recognition and text line detection engines is an important step to objectively compare systems and their configuration. There exist well-established measures for both tasks separately. However, there is no sophisticated evaluation scheme to measure the quality of a combined text line detection and text recognition system. The F-measure on word level is a well-kno… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: to appear in proceeding at ICDAR 2019

  2. arXiv:1807.06270  [pdf, other

    cs.CV cs.CL

    Bench-Marking Information Extraction in Semi-Structured Historical Handwritten Records

    Authors: Animesh Prasad, Hervé Déjean, Jean-Luc Meunier, Max Weidemann, Johannes Michael, Gundram Leifert

    Abstract: In this report, we present our findings from benchmarking experiments for information extraction on historical handwritten marriage records Esposalles from IEHHR - ICDAR 2017 robust reading competition. The information extraction is modeled as semantic labeling of the sequence across 2 set of labels. This can be achieved by sequentially or jointly applying handwritten text recognition (HTR) and na… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

  3. arXiv:1804.09943  [pdf, other

    cs.IR

    System Description of CITlab's Recognition & Retrieval Engine for ICDAR2017 Competition on Information Extraction in Historical Handwritten Records

    Authors: Tobias Strauß, Max Weidemann, Johannes Michael, Gundram Leifert, Tobias Grüning, Roger Labahn

    Abstract: We present a recognition and retrieval system for the ICDAR2017 Competition on Information Extraction in Historical Handwritten Records which successfully infers person names and other data from marriage records. The system extracts information from the line images with a high accuracy and outperforms the baseline. The optical model is based on Neural Networks. To infer the desired information, re… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

    MSC Class: 68T10

  4. A Two-Stage Method for Text Line Detection in Historical Documents

    Authors: Tobias Grüning, Gundram Leifert, Tobias Strauß, Johannes Michael, Roger Labahn

    Abstract: This work presents a two-stage text line detection method for historical documents. Each detected text line is represented by its baseline. In a first stage, a deep neural network called ARU-Net labels pixels to belong to one of the three classes: baseline, separator or other. The separator class marks beginning and end of each text line. The ARU-Net is trainable from scratch with manageably few m… ▽ More

    Submitted 11 July, 2019; v1 submitted 9 February, 2018; originally announced February 2018.

    Comments: to be published in IJDAR

    Journal ref: International Journal on Document Analysis and Recognition (IJDAR), (2019), 1-18

  5. arXiv:1605.08412  [pdf, ps, other

    cs.CV cs.AI cs.NE

    CITlab ARGUS for historical handwritten documents

    Authors: Gundram Leifert, Tobias Strauß, Tobias Grüning, Roger Labahn

    Abstract: We describe CITlab's recognition system for the HTRtS competition attached to the 13. International Conference on Document Analysis and Recognition, ICDAR 2015. The task comprises the recognition of historical handwritten documents. The core algorithms of our system are based on multi-dimensional recurrent neural networks (MDRNN) and connectionist temporal classification (CTC). The software module… ▽ More

    Submitted 26 May, 2016; originally announced May 2016.

    Comments: Description of CITlab's System for the HTRtS 2015 Task : Handwritten Text Recognition on the tranScriptorium Dataset

    MSC Class: 68T10; 68T05

  6. Regular expressions for decoding of neural network outputs

    Authors: Tobias Strauß, Gundram Leifert, Tobias Grüning, Roger Labahn

    Abstract: This article proposes a convenient tool for decoding the output of neural networks trained by Connectionist Temporal Classification (CTC) for handwritten text recognition. We use regular expressions to describe the complex structures expected in the writing. The corresponding finite automata are employed to build a decoder. We analyze theoretically which calculations are relevant and which can be… ▽ More

    Submitted 22 February, 2016; v1 submitted 15 September, 2015; originally announced September 2015.

    Comments: 21 pages, 8 (+2) figures, 2 tables

    Report number: NN3600 MSC Class: 49L20; 90C39; 82C32

  7. arXiv:1412.6061  [pdf

    cs.CV cs.NE

    CITlab ARGUS for Arabic Handwriting

    Authors: Gundram Leifert, Roger Labahn, Tobias Strauß

    Abstract: In the recent years it turned out that multidimensional recurrent neural networks (MDRNN) perform very well for offline handwriting recognition tasks like the OpenHaRT 2013 evaluation DIR. With suitable writing preprocessing and dictionary lookup, our ARGUS software completed this task with an error rate of 26.27% in its primary setup.

    Submitted 15 December, 2014; originally announced December 2014.

    Comments: http://www.nist.gov/itl/iad/mig/upload/OpenHaRT2013_SysDesc_CITLAB.pdf

    MSC Class: 68T10; 68T05

  8. arXiv:1412.6012  [pdf, ps, other

    cs.CV cs.NE

    CITlab ARGUS for historical data tables

    Authors: Gundram Leifert, Tobias Grüning, Tobias Strauß, Roger Labahn

    Abstract: We describe CITlab's recognition system for the ANWRESH-2014 competition attached to the 14. International Conference on Frontiers in Handwriting Recognition, ICFHR 2014. The task comprises word recognition from segmented historical documents. The core components of our system are based on multi-dimensional recurrent neural networks (MDRNN) and connectionist temporal classification (CTC). The soft… ▽ More

    Submitted 15 December, 2014; originally announced December 2014.

    Comments: arXiv admin note: text overlap with arXiv:1412.3949

    MSC Class: 68T05; 68T10

  9. arXiv:1412.3949  [pdf, ps, other

    cs.CV cs.NE

    CITlab ARGUS for historical handwritten documents

    Authors: Tobias Strauß, Tobias Grüning, Gundram Leifert, Roger Labahn

    Abstract: We describe CITlab's recognition system for the HTRtS competition attached to the 14. International Conference on Frontiers in Handwriting Recognition, ICFHR 2014. The task comprises the recognition of historical handwritten documents. The core algorithms of our system are based on multi-dimensional recurrent neural networks (MDRNN) and connectionist temporal classification (CTC). The software mod… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

    MSC Class: 68T05; 68T10

  10. arXiv:1412.2620  [pdf, other

    cs.AI cs.NE

    Cells in Multidimensional Recurrent Neural Networks

    Authors: G. Leifert, T. Strauß, T. Grüning, R. Labahn

    Abstract: The transcription of handwritten text on images is one task in machine learning and one solution to solve it is using multi-dimensional recurrent neural networks (MDRNN) with connectionist temporal classification (CTC). The RNNs can contain special units, the long short-term memory (LSTM) cells. They are able to learn long term dependencies but they get unstable when the dimension is chosen greate… ▽ More

    Submitted 16 February, 2016; v1 submitted 8 December, 2014; originally announced December 2014.

    MSC Class: 68T10; 68T05

    Journal ref: Journal of Machine Learning Research 17 (2016) 1-37