REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction

Sharif, Omar; Gatto, Joseph; Basak, Madhusudan; Preum, Sarah M.

Computer Science > Computation and Language

arXiv:2502.16838 (cs)

[Submitted on 24 Feb 2025]

Title:REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction

Authors:Omar Sharif, Joseph Gatto, Madhusudan Basak, Sarah M. Preum

View PDF HTML (experimental)

Abstract:Event argument extraction identifies arguments for predefined event roles in text. Traditional evaluations rely on exact match (EM), requiring predicted arguments to match annotated spans exactly. However, this approach fails for generative models like large language models (LLMs), which produce diverse yet semantically accurate responses. EM underestimates performance by disregarding valid variations, implicit arguments (unstated but inferable), and scattered arguments (distributed across a document). To bridge this gap, we introduce Reliable Evaluation framework for Generative event argument extraction (REGen), a framework that better aligns with human judgment. Across six datasets, REGen improves performance by an average of 23.93 F1 points over EM. Human validation further confirms REGen's effectiveness, achieving 87.67% alignment with human assessments of argument correctness.

Comments:	20 pages, 9 figures, 13 tables
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.16838 [cs.CL]
	(or arXiv:2502.16838v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.16838

Submission history

From: Omar Sharif [view email]
[v1] Mon, 24 Feb 2025 04:49:49 UTC (12,290 KB)

Computer Science > Computation and Language

Title:REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators