Skip to main content

Showing 1–12 of 12 results for author: Labahn, R

Searching in archive cs. Search in all archives.
.
  1. Optimizing small BERTs trained for German NER

    Authors: Jochen Zöllner, Konrad Sperfeld, Christoph Wick, Roger Labahn

    Abstract: Currently, the most widespread neural network architecture for training language models is the so called BERT which led to improvements in various Natural Language Processing (NLP) tasks. In general, the larger the number of parameters in a BERT model, the better the results obtained in these NLP tasks. Unfortunately, the memory consumption and the training duration drastically increases with the… ▽ More

    Submitted 1 November, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Journal ref: MDPI Information 2021, vol. 12 nr. 11, article-nr. 443

  2. arXiv:1908.09584  [pdf, other

    cs.CV

    End-To-End Measure for Text Recognition

    Authors: Gundram Leifert, Roger Labahn, Tobias Grüning, Svenja Leifert

    Abstract: Measuring the performance of text recognition and text line detection engines is an important step to objectively compare systems and their configuration. There exist well-established measures for both tasks separately. However, there is no sophisticated evaluation scheme to measure the quality of a combined text line detection and text recognition system. The F-measure on word level is a well-kno… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: to appear in proceeding at ICDAR 2019

  3. arXiv:1903.07377  [pdf, other

    cs.CV cs.LG

    Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition

    Authors: Johannes Michael, Roger Labahn, Tobias Grüning, Jochen Zöllner

    Abstract: Encoder-decoder models have become an effective approach for sequence learning tasks like machine translation, image captioning and speech recognition, but have yet to show competitive results for handwritten text recognition. To this end, we propose an attention-based sequence-to-sequence model. It combines a convolutional neural network as a generic feature extractor with a recurrent neural netw… ▽ More

    Submitted 15 July, 2019; v1 submitted 18 March, 2019; originally announced March 2019.

    Comments: 8 pages, 1 figure, 8 tables

  4. arXiv:1804.09943  [pdf, other

    cs.IR

    System Description of CITlab's Recognition & Retrieval Engine for ICDAR2017 Competition on Information Extraction in Historical Handwritten Records

    Authors: Tobias Strauß, Max Weidemann, Johannes Michael, Gundram Leifert, Tobias Grüning, Roger Labahn

    Abstract: We present a recognition and retrieval system for the ICDAR2017 Competition on Information Extraction in Historical Handwritten Records which successfully infers person names and other data from marriage records. The system extracts information from the line images with a high accuracy and outperforms the baseline. The optical model is based on Neural Networks. To infer the desired information, re… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

    MSC Class: 68T10

  5. A Two-Stage Method for Text Line Detection in Historical Documents

    Authors: Tobias Grüning, Gundram Leifert, Tobias Strauß, Johannes Michael, Roger Labahn

    Abstract: This work presents a two-stage text line detection method for historical documents. Each detected text line is represented by its baseline. In a first stage, a deep neural network called ARU-Net labels pixels to belong to one of the three classes: baseline, separator or other. The separator class marks beginning and end of each text line. The ARU-Net is trainable from scratch with manageably few m… ▽ More

    Submitted 11 July, 2019; v1 submitted 9 February, 2018; originally announced February 2018.

    Comments: to be published in IJDAR

    Journal ref: International Journal on Document Analysis and Recognition (IJDAR), (2019), 1-18

  6. arXiv:1705.03311  [pdf, other

    cs.CV

    READ-BAD: A New Dataset and Evaluation Scheme for Baseline Detection in Archival Documents

    Authors: Tobias Grüning, Roger Labahn, Markus Diem, Florian Kleber, Stefan Fiel

    Abstract: Text line detection is crucial for any application associated with Automatic Text Recognition or Keyword Spotting. Modern algorithms perform good on well-established datasets since they either comprise clean data or simple/homogeneous page layouts. We have collected and annotated 2036 archival document images from different locations and time periods. The dataset contains varying page layouts and… ▽ More

    Submitted 11 December, 2017; v1 submitted 9 May, 2017; originally announced May 2017.

    Comments: Submitted to DAS2018

  7. arXiv:1605.08412  [pdf, ps, other

    cs.CV cs.AI cs.NE

    CITlab ARGUS for historical handwritten documents

    Authors: Gundram Leifert, Tobias Strauß, Tobias Grüning, Roger Labahn

    Abstract: We describe CITlab's recognition system for the HTRtS competition attached to the 13. International Conference on Document Analysis and Recognition, ICDAR 2015. The task comprises the recognition of historical handwritten documents. The core algorithms of our system are based on multi-dimensional recurrent neural networks (MDRNN) and connectionist temporal classification (CTC). The software module… ▽ More

    Submitted 26 May, 2016; originally announced May 2016.

    Comments: Description of CITlab's System for the HTRtS 2015 Task : Handwritten Text Recognition on the tranScriptorium Dataset

    MSC Class: 68T10; 68T05

  8. Regular expressions for decoding of neural network outputs

    Authors: Tobias Strauß, Gundram Leifert, Tobias Grüning, Roger Labahn

    Abstract: This article proposes a convenient tool for decoding the output of neural networks trained by Connectionist Temporal Classification (CTC) for handwritten text recognition. We use regular expressions to describe the complex structures expected in the writing. The corresponding finite automata are employed to build a decoder. We analyze theoretically which calculations are relevant and which can be… ▽ More

    Submitted 22 February, 2016; v1 submitted 15 September, 2015; originally announced September 2015.

    Comments: 21 pages, 8 (+2) figures, 2 tables

    Report number: NN3600 MSC Class: 49L20; 90C39; 82C32

  9. arXiv:1412.6061  [pdf

    cs.CV cs.NE

    CITlab ARGUS for Arabic Handwriting

    Authors: Gundram Leifert, Roger Labahn, Tobias Strauß

    Abstract: In the recent years it turned out that multidimensional recurrent neural networks (MDRNN) perform very well for offline handwriting recognition tasks like the OpenHaRT 2013 evaluation DIR. With suitable writing preprocessing and dictionary lookup, our ARGUS software completed this task with an error rate of 26.27% in its primary setup.

    Submitted 15 December, 2014; originally announced December 2014.

    Comments: http://www.nist.gov/itl/iad/mig/upload/OpenHaRT2013_SysDesc_CITLAB.pdf

    MSC Class: 68T10; 68T05

  10. arXiv:1412.6012  [pdf, ps, other

    cs.CV cs.NE

    CITlab ARGUS for historical data tables

    Authors: Gundram Leifert, Tobias Grüning, Tobias Strauß, Roger Labahn

    Abstract: We describe CITlab's recognition system for the ANWRESH-2014 competition attached to the 14. International Conference on Frontiers in Handwriting Recognition, ICFHR 2014. The task comprises word recognition from segmented historical documents. The core components of our system are based on multi-dimensional recurrent neural networks (MDRNN) and connectionist temporal classification (CTC). The soft… ▽ More

    Submitted 15 December, 2014; originally announced December 2014.

    Comments: arXiv admin note: text overlap with arXiv:1412.3949

    MSC Class: 68T05; 68T10

  11. arXiv:1412.3949  [pdf, ps, other

    cs.CV cs.NE

    CITlab ARGUS for historical handwritten documents

    Authors: Tobias Strauß, Tobias Grüning, Gundram Leifert, Roger Labahn

    Abstract: We describe CITlab's recognition system for the HTRtS competition attached to the 14. International Conference on Frontiers in Handwriting Recognition, ICFHR 2014. The task comprises the recognition of historical handwritten documents. The core algorithms of our system are based on multi-dimensional recurrent neural networks (MDRNN) and connectionist temporal classification (CTC). The software mod… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

    MSC Class: 68T05; 68T10

  12. arXiv:1412.2620  [pdf, other

    cs.AI cs.NE

    Cells in Multidimensional Recurrent Neural Networks

    Authors: G. Leifert, T. Strauß, T. Grüning, R. Labahn

    Abstract: The transcription of handwritten text on images is one task in machine learning and one solution to solve it is using multi-dimensional recurrent neural networks (MDRNN) with connectionist temporal classification (CTC). The RNNs can contain special units, the long short-term memory (LSTM) cells. They are able to learn long term dependencies but they get unstable when the dimension is chosen greate… ▽ More

    Submitted 16 February, 2016; v1 submitted 8 December, 2014; originally announced December 2014.

    MSC Class: 68T10; 68T05

    Journal ref: Journal of Machine Learning Research 17 (2016) 1-37