NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Soares, Livio Baldini; Gillick, Daniel; Cole, Jeremy R.; Kwiatkowski, Tom

Computer Science > Computation and Language

arXiv:2305.14499 (cs)

[Submitted on 23 May 2023 (v1), last revised 23 Oct 2023 (this version, v2)]

Title:NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Authors:Livio Baldini Soares, Daniel Gillick, Jeremy R. Cole, Tom Kwiatkowski

View PDF

Abstract:Neural document rerankers are extremely effective in terms of accuracy. However, the best models require dedicated hardware for serving, which is costly and often not feasible. To avoid this serving-time requirement, we present a method of capturing up to 86% of the gains of a Transformer cross-attention model with a lexicalized scoring function that only requires 10-6% of the Transformer's FLOPs per document and can be served using commodity CPUs. When combined with a BM25 retriever, this approach matches the quality of a state-of-the art dual encoder retriever, that still requires an accelerator for query encoding. We introduce NAIL (Non-Autoregressive Indexing with Language models) as a model architecture that is compatible with recent encoder-decoder and decoder-only large language models, such as T5, GPT-3 and PaLM. This model architecture can leverage existing pre-trained checkpoints and can be fine-tuned for efficiently constructing document representations that do not require neural processing of queries.

Comments:	To appear at EMNLP 2023
Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR)
Cite as:	arXiv:2305.14499 [cs.CL]
	(or arXiv:2305.14499v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.14499

Submission history

From: Livio Baldini Soares [view email]
[v1] Tue, 23 May 2023 20:09:52 UTC (7,184 KB)
[v2] Mon, 23 Oct 2023 14:46:34 UTC (292 KB)

Computer Science > Computation and Language

Title:NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators