FEWS: Large-Scale, Low-Shot Word Sense Disambiguation with the Dictionary

Blevins, Terra; Joshi, Mandar; Zettlemoyer, Luke

Computer Science > Computation and Language

arXiv:2102.07983 (cs)

[Submitted on 16 Feb 2021]

Title:FEWS: Large-Scale, Low-Shot Word Sense Disambiguation with the Dictionary

Authors:Terra Blevins, Mandar Joshi, Luke Zettlemoyer

View PDF

Abstract:Current models for Word Sense Disambiguation (WSD) struggle to disambiguate rare senses, despite reaching human performance on global WSD metrics. This stems from a lack of data for both modeling and evaluating rare senses in existing WSD datasets. In this paper, we introduce FEWS (Few-shot Examples of Word Senses), a new low-shot WSD dataset automatically extracted from example sentences in Wiktionary. FEWS has high sense coverage across different natural language domains and provides: (1) a large training set that covers many more senses than previous datasets and (2) a comprehensive evaluation set containing few- and zero-shot examples of a wide variety of senses. We establish baselines on FEWS with knowledge-based and neural WSD approaches and present transfer learning experiments demonstrating that models additionally trained with FEWS better capture rare senses in existing WSD datasets. Finally, we find humans outperform the best baseline models on FEWS, indicating that FEWS will support significant future work on low-shot WSD.

Comments:	EACL 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2102.07983 [cs.CL]
	(or arXiv:2102.07983v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2102.07983

Submission history

From: Terra Blevins [view email]
[v1] Tue, 16 Feb 2021 07:13:34 UTC (8,516 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Terra Blevins
Mandar Joshi
Luke Zettlemoyer

export BibTeX citation

Computer Science > Computation and Language

Title:FEWS: Large-Scale, Low-Shot Word Sense Disambiguation with the Dictionary

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FEWS: Large-Scale, Low-Shot Word Sense Disambiguation with the Dictionary

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators