Document Understanding Dataset and Evaluation (DUDE)

Van Landeghem, Jordy; Tito, Rubén; Borchmann, Łukasz; Pietruszka, Michał; Józiak, Paweł; Powalski, Rafał; Jurkiewicz, Dawid; Coustaty, Mickaël; Ackaert, Bertrand; Valveny, Ernest; Blaschko, Matthew; Moens, Sien; Stanisławek, Tomasz

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.08455 (cs)

[Submitted on 15 May 2023 (v1), last revised 11 Sep 2023 (this version, v3)]

Title:Document Understanding Dataset and Evaluation (DUDE)

Authors:Jordy Van Landeghem, Rubén Tito, Łukasz Borchmann, Michał Pietruszka, Paweł Józiak, Rafał Powalski, Dawid Jurkiewicz, Mickaël Coustaty, Bertrand Ackaert, Ernest Valveny, Matthew Blaschko, Sien Moens, Tomasz Stanisławek

View PDF

Abstract:We call on the Document AI (DocAI) community to reevaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset with novelties related to types of questions, answers, and document layouts based on multi-industry, multi-domain, and multi-page VRDs of various origins, and dates. Moreover, we are pushing the boundaries of current methods by creating multi-task and multi-domain evaluation setups that more accurately simulate real-world situations where powerful generalization and adaptation under low-resource settings are desired. DUDE aims to set a new standard as a more practical, long-standing benchmark for the community, and we hope that it will lead to future extensions and contributions that address real-world challenges. Finally, our work illustrates the importance of finding more efficient ways to model language, images, and layout in DocAI.

Comments:	Accepted at ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2305.08455 [cs.CV]
	(or arXiv:2305.08455v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.08455

Submission history

From: Jordy Van Landeghem [view email]
[v1] Mon, 15 May 2023 08:54:32 UTC (14,620 KB)
[v2] Tue, 30 May 2023 10:06:57 UTC (14,620 KB)
[v3] Mon, 11 Sep 2023 10:36:41 UTC (14,620 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Document Understanding Dataset and Evaluation (DUDE)

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Document Understanding Dataset and Evaluation (DUDE)

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators