Skip to main content

Showing 1–1 of 1 results for author: Kulshrestha, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17591  [pdf, other

    cs.CV

    DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation

    Authors: Ahmad Mohammadshirazi, Ali Nosrati Firoozsalari, Mengxi Zhou, Dheeraj Kulshrestha, Rajiv Ramnath

    Abstract: Automating the annotation of scanned documents is challenging, requiring a balance between computational efficiency and accuracy. DocParseNet addresses this by combining deep learning and multi-modal learning to process both text and visual data. This model goes beyond traditional OCR and semantic segmentation, capturing the interplay between text and images to preserve contextual nuances in compl… ▽ More

    Submitted 21 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.