SEMv2: Table Separation Line Detection Based on Instance Segmentation

Zhang, Zhenrong; Hu, Pengfei; Ma, Jiefeng; Du, Jun; Zhang, Jianshu; Zhu, Huihui; Yin, Baocai; Yin, Bing; Liu, Cong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.04384 (cs)

[Submitted on 8 Mar 2023 (v1), last revised 12 Jan 2024 (this version, v2)]

Title:SEMv2: Table Separation Line Detection Based on Instance Segmentation

Authors:Zhenrong Zhang, Pengfei Hu, Jiefeng Ma, Jun Du, Jianshu Zhang, Huihui Zhu, Baocai Yin, Bing Yin, Cong Liu

View PDF HTML (experimental)

Abstract:Table structure recognition is an indispensable element for enabling machines to comprehend tables. Its primary purpose is to identify the internal structure of a table. Nevertheless, due to the complexity and diversity of their structure and style, it is highly challenging to parse the tabular data into a structured format that machines can comprehend. In this work, we adhere to the principle of the split-and-merge based methods and propose an accurate table structure recognizer, termed SEMv2 (SEM: Split, Embed and Merge). Unlike the previous works in the ``split'' stage, we aim to address the table separation line instance-level discrimination problem and introduce a table separation line detection strategy based on conditional convolution. Specifically, we design the ``split'' in a top-down manner that detects the table separation line instance first and then dynamically predicts the table separation line mask for each instance. The final table separation line shape can be accurately obtained by processing the table separation line mask in a row-wise/column-wise manner. To comprehensively evaluate the SEMv2, we also present a more challenging dataset for table structure recognition, dubbed iFLYTAB, which encompasses multiple style tables in various scenarios such as photos, scanned documents, etc. Extensive experiments on publicly available datasets (e.g. SciTSR, PubTabNet and iFLYTAB) demonstrate the efficacy of our proposed approach. The code and iFLYTAB dataset are available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.04384 [cs.CV]
	(or arXiv:2303.04384v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.04384

Submission history

From: Zhenrong Zhang [view email]
[v1] Wed, 8 Mar 2023 05:15:01 UTC (20,463 KB)
[v2] Fri, 12 Jan 2024 07:00:30 UTC (36,078 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SEMv2: Table Separation Line Detection Based on Instance Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SEMv2: Table Separation Line Detection Based on Instance Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators