Skip to main content

Showing 1–1 of 1 results for author: Amador, B M

Searching in archive cs. Search in all archives.
.
  1. ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing

    Authors: Ayush Kumar Shah, Bryan Manrique Amador, Abhisek Dey, Ming Creekmore, Blake Ocampo, Scott Denmark, Richard Zanibbi

    Abstract: Most molecular diagram parsers recover chemical structure from raster images (e.g., PNGs). However, many PDFs include commands giving explicit locations and shapes for characters, lines, and polygons. We present a new parser that uses these born-digital PDF primitives as input. The parsing model is fast and accurate, and does not require GPUs, Optical Character Recognition (OCR), or vectorization.… ▽ More

    Submitted 26 February, 2025; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 20 pages without references, 12 figures, 4 Tables, submitted to International Conference on Document Analysis and Recognition (ICDAR) - Journal Track

    Journal ref: IJDAR, vol. 27, no. 3, pp. 395-414, Sep. 2024