Realistic Evaluation Principles for Cross-document Coreference Resolution

Cattan, Arie; Eirew, Alon; Stanovsky, Gabriel; Joshi, Mandar; Dagan, Ido

Computer Science > Computation and Language

arXiv:2106.04192 (cs)

[Submitted on 8 Jun 2021]

Title:Realistic Evaluation Principles for Cross-document Coreference Resolution

Authors:Arie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi, Ido Dagan

View PDF

Abstract:We point out that common evaluation practices for cross-document coreference resolution have been unrealistically permissive in their assumed settings, yielding inflated results. We propose addressing this issue via two evaluation methodology principles. First, as in other tasks, models should be evaluated on predicted mentions rather than on gold mentions. Doing this raises a subtle issue regarding singleton coreference clusters, which we address by decoupling the evaluation of mention detection from that of coreference linking. Second, we argue that models should not exploit the synthetic topic structure of the standard ECB+ dataset, forcing models to confront the lexical ambiguity challenge, as intended by the dataset creators. We demonstrate empirically the drastic impact of our more realistic evaluation principles on a competitive model, yielding a score which is 33 F1 lower compared to evaluating by prior lenient practices.

Comments:	*SEM 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2106.04192 [cs.CL]
	(or arXiv:2106.04192v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2106.04192

Submission history

From: Arie Cattan [view email]
[v1] Tue, 8 Jun 2021 09:05:21 UTC (5,232 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Alon Eirew
Gabriel Stanovsky
Mandar Joshi
Ido Dagan

export BibTeX citation

Computer Science > Computation and Language

Title:Realistic Evaluation Principles for Cross-document Coreference Resolution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Realistic Evaluation Principles for Cross-document Coreference Resolution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators