KILT: a Benchmark for Knowledge Intensive Language Tasks

Petroni, Fabio; Piktus, Aleksandra; Fan, Angela; Lewis, Patrick; Yazdani, Majid; De Cao, Nicola; Thorne, James; Jernite, Yacine; Karpukhin, Vladimir; Maillard, Jean; Plachouras, Vassilis; Rocktäschel, Tim; Riedel, Sebastian

Computer Science > Computation and Language

arXiv:2009.02252 (cs)

[Submitted on 4 Sep 2020 (v1), last revised 27 May 2021 (this version, v4)]

Title:KILT: a Benchmark for Knowledge Intensive Language Tasks

Authors:Fabio Petroni, Aleksandra Piktus, Angela Fan, Patrick Lewis, Majid Yazdani, Nicola De Cao, James Thorne, Yacine Jernite, Vladimir Karpukhin, Jean Maillard, Vassilis Plachouras, Tim Rocktäschel, Sebastian Riedel

View PDF

Abstract:Challenging problems such as open-domain question answering, fact checking, slot filling and entity linking require access to large, external knowledge sources. While some models do well on individual tasks, developing general models is difficult as each task might require computationally expensive indexing of custom knowledge sources, in addition to dedicated infrastructure. To catalyze research on models that condition on specific information in large textual resources, we present a benchmark for knowledge-intensive language tasks (KILT). All tasks in KILT are grounded in the same snapshot of Wikipedia, reducing engineering turnaround through the re-use of components, as well as accelerating research into task-agnostic memory architectures. We test both task-specific and general baselines, evaluating downstream performance in addition to the ability of the models to provide provenance. We find that a shared dense vector index coupled with a seq2seq model is a strong baseline, outperforming more tailor-made approaches for fact checking, open-domain question answering and dialogue, and yielding competitive results on entity linking and slot filling, by generating disambiguated text. KILT data and code are available at this https URL.

Comments:	accepted at NAACL 2021
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2009.02252 [cs.CL]
	(or arXiv:2009.02252v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2009.02252

Submission history

From: Fabio Petroni [view email]
[v1] Fri, 4 Sep 2020 15:32:19 UTC (7,991 KB)
[v2] Fri, 9 Apr 2021 08:59:41 UTC (9,887 KB)
[v3] Mon, 12 Apr 2021 09:27:43 UTC (9,887 KB)
[v4] Thu, 27 May 2021 15:20:59 UTC (9,886 KB)

Computer Science > Computation and Language

Title:KILT: a Benchmark for Knowledge Intensive Language Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:KILT: a Benchmark for Knowledge Intensive Language Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators