Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

Zamani, Hamed; Bendersky, Michael

Computer Science > Computation and Language

arXiv:2405.02816 (cs)

[Submitted on 5 May 2024]

Title:Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

Authors:Hamed Zamani, Michael Bendersky

View PDF HTML (experimental)

Abstract:This paper introduces Stochastic RAG--a novel approach for end-to-end optimization of retrieval-augmented generation (RAG) models that relaxes the simplifying assumptions of marginalization and document independence, made in most prior work. Stochastic RAG casts the retrieval process in RAG as a stochastic sampling without replacement process. Through this formulation, we employ straight-through Gumbel-top-k that provides a differentiable approximation for sampling without replacement and enables effective end-to-end optimization for RAG. We conduct extensive experiments on seven diverse datasets on a wide range of tasks, from open-domain question answering to fact verification to slot-filling for relation extraction and to dialogue systems. By applying this optimization method to a recent and effective RAG model, we advance state-of-the-art results on six out of seven datasets.

Comments:	To appear in the proceedings of SIGIR 2024
Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2405.02816 [cs.CL]
	(or arXiv:2405.02816v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.02816

Submission history

From: Hamed Zamani [view email]
[v1] Sun, 5 May 2024 05:42:33 UTC (175 KB)

Computer Science > Computation and Language

Title:Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators