Skip to main content

Showing 1–6 of 6 results for author: Hast, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2201.09633  [pdf, other

    cs.CV cs.LG

    Paired Image to Image Translation for Strikethrough Removal From Handwritten Words

    Authors: Raphaela Heil, Ekta Vats, Anders Hast

    Abstract: Transcribing struck-through, handwritten words, for example for the purpose of genetic criticism, can pose a challenge to both humans and machines, due to the obstructive properties of the superimposed strokes. This paper investigates the use of paired image to image translation approaches to remove strikethrough strokes from handwritten words. Four different neural network architectures are exami… ▽ More

    Submitted 1 April, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: accepted at DAS2022

  2. TexT - Text Extractor Tool for Handwritten Document Transcription and Annotation

    Authors: Anders Hast, Per Cullhed, Ekta Vats

    Abstract: This paper presents a framework for semi-automatic transcription of large-scale historical handwritten documents and proposes a simple user-friendly text extractor tool, TexT for transcription. The proposed approach provides a quick and easy transcription of text using computer assisted interactive technique. The algorithm finds multiple occurrences of the marked text on-the-fly using a word spott… ▽ More

    Submitted 22 November, 2017; originally announced January 2018.

    Journal ref: Digital Libraries and Multimedia Archives. IRCDL 2018. Communications in Computer and Information Science, vol 806. Springer, Cham

  3. Learning Surrogate Models of Document Image Quality Metrics for Automated Document Image Processing

    Authors: Prashant Singh, Ekta Vats, Anders Hast

    Abstract: Computation of document image quality metrics often depends upon the availability of a ground truth image corresponding to the document. This limits the applicability of quality metrics in applications such as hyperparameter optimization of image processing algorithms that operate on-the-fly on unseen documents. This work proposes the use of surrogate models to learn the behavior of a given docume… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

  4. Radial Line Fourier Descriptor for Historical Handwritten Text Representation

    Authors: Anders Hast, Ekta Vats

    Abstract: Automatic recognition of historical handwritten manuscripts is a daunting task due to paper degradation over time. Recognition-free retrieval or word spotting is popularly used for information retrieval and digitization of the historical handwritten documents. However, the performance of word spotting algorithms depends heavily on feature detection and representation methods. Although there exist… ▽ More

    Submitted 20 March, 2018; v1 submitted 6 September, 2017; originally announced September 2017.

    Comments: under review

  5. Automatic Document Image Binarization using Bayesian Optimization

    Authors: Ekta Vats, Anders Hast, Prashant Singh

    Abstract: Document image binarization is often a challenging task due to various forms of degradation. Although there exist several binarization techniques in literature, the binarized image is typically sensitive to control parameter settings of the employed technique. This paper presents an automatic document image binarization algorithm to segment the text from heavily degraded document images. The propo… ▽ More

    Submitted 21 October, 2017; v1 submitted 6 September, 2017; originally announced September 2017.

    Journal ref: 4th International Workshop on Historical Document Imaging and Processing (HIP2017). ACM, New York, NY, USA, 89-94

  6. arXiv:1709.01775  [pdf, other

    cs.IR cs.DL cs.HC

    On-the-fly Historical Handwritten Text Annotation

    Authors: Ekta Vats, Anders Hast

    Abstract: The performance of information retrieval algorithms depends upon the availability of ground truth labels annotated by experts. This is an important prerequisite, and difficulties arise when the annotated ground truth labels are incorrect or incomplete due to high levels of degradation. To address this problem, this paper presents a simple method to perform on-the-fly annotation of degraded histori… ▽ More

    Submitted 6 September, 2017; originally announced September 2017.

    Journal ref: 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Volume 8, IEEE, Kyoto, Japan, 2017, pp. 10-14