Consistent Document-Level Relation Extraction via Counterfactuals

Modarressi, Ali; Köksal, Abdullatif; Schütze, Hinrich

Computer Science > Computation and Language

arXiv:2407.06699 (cs)

[Submitted on 9 Jul 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:Consistent Document-Level Relation Extraction via Counterfactuals

Authors:Ali Modarressi, Abdullatif Köksal, Hinrich Schütze

View PDF HTML (experimental)

Abstract:Many datasets have been developed to train and evaluate document-level relation extraction (RE) models. Most of these are constructed using real-world data. It has been shown that RE models trained on real-world data suffer from factual biases. To evaluate and address this issue, we present CovEReD, a counterfactual data generation approach for document-level relation extraction datasets using entity replacement. We first demonstrate that models trained on factual data exhibit inconsistent behavior: while they accurately extract triples from factual data, they fail to extract the same triples after counterfactual modification. This inconsistency suggests that models trained on factual data rely on spurious signals such as specific entities and external knowledge $\unicode{x2013}$ rather than on the input context $\unicode{x2013}$ to extract triples. We show that by generating document-level counterfactual data with CovEReD and training models on them, consistency is maintained with minimal impact on RE performance. We release our CovEReD pipeline as well as Re-DocRED-CF, a dataset of counterfactual RE documents, to assist in evaluating and addressing inconsistency in document-level RE.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2407.06699 [cs.CL]
	(or arXiv:2407.06699v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.06699

Submission history

From: Ali Modarressi [view email]
[v1] Tue, 9 Jul 2024 09:21:55 UTC (200 KB)
[v2] Tue, 15 Oct 2024 13:37:35 UTC (191 KB)

Computer Science > Computation and Language

Title:Consistent Document-Level Relation Extraction via Counterfactuals

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Consistent Document-Level Relation Extraction via Counterfactuals

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators