PrefRAG: Preference-Driven Multi-Source Retrieval Augmented Generation

Zhao, Qingfei; Wang, Ruobing; Cen, Yukuo; Zha, Daren; Tan, Shicheng; Tang, Jie

Computer Science > Computation and Language

arXiv:2411.00689 (cs)

[Submitted on 1 Nov 2024 (v1), last revised 7 Apr 2025 (this version, v2)]

Title:PrefRAG: Preference-Driven Multi-Source Retrieval Augmented Generation

Authors:Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Jie Tang

View PDF HTML (experimental)

Abstract:Retrieval-Augmented Generation (RAG) has emerged as a reliable external knowledge augmentation technique to mitigate hallucination issues and parameterized knowledge limitations in Large Language Models (LLMs). Existing adaptive RAG (ARAG) systems excel at in-depth exploration within a single source but struggle to effectively and controllably explore different retrieval sources, as they fail to foresee their internal knowledge features. We develop a novel multi-source ARAG system, PrefRAG, which enhances RAG by enabling in-depth and controllable exploration of diverse retrieval sources through preference-driven adaptive retrieval and self-reflection. PrefRAG first fully explores controllable local sources in adaptive retrieval and supplements with the web when appropriate, ultimately selecting the optimal source for knowledge observation. Subsequently, PrefRAG feeds answer quality feedback into the retrieval process, optimizing it from the generation perspective to produce higher-quality responses. Extensive experiments confirm its superiority, high retrieval efficiency, and knowledge controllability. PrefRAG outperforms Vanilla RAG and the leading MS-ARAG by up to 25.6% and 13.9% respectively. Additionally, PrefRAG trained with DPO achieves higher performance. The code and data are available at this https URL.

Comments:	33 pages, 5 figures, 28 tables
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2411.00689 [cs.CL]
	(or arXiv:2411.00689v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2411.00689

Submission history

From: Qingfei Zhao [view email]
[v1] Fri, 1 Nov 2024 15:50:58 UTC (309 KB)
[v2] Mon, 7 Apr 2025 16:38:59 UTC (719 KB)

Computer Science > Computation and Language

Title:PrefRAG: Preference-Driven Multi-Source Retrieval Augmented Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:PrefRAG: Preference-Driven Multi-Source Retrieval Augmented Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators