Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs

Wang, Ziliang; An, Kang; Zheng, Xuhui; Qian, Faqiang; Zhang, Weikun; Ouyang, Cijun; Cai, Jialu; Wang, Yuhang; Wu, Yichao

Computer Science > Computation and Language

arXiv:2510.00861 (cs)

[Submitted on 1 Oct 2025]

Title:Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs

Authors:Ziliang Wang, Kang An, Xuhui Zheng, Faqiang Qian, Weikun Zhang, Cijun Ouyang, Jialu Cai, Yuhang Wang, Yichao Wu

View PDF

Abstract:While search-augmented large language models (LLMs) exhibit impressive capabilities, their reliability in complex multi-hop reasoning remains limited. This limitation arises from three fundamental challenges: decomposition errors, where tasks are incorrectly broken down; retrieval missing, where key evidence fails to be retrieved; and reasoning errors, where flawed logic propagates through the reasoning chain. A single failure in any of these stages can derail the final answer. We propose Erasable Reinforcement Learning (ERL), a novel framework that transforms fragile reasoning into a robust process. ERL explicitly identifies faulty steps, erases them, and regenerates reasoning in place, preventing defective logic from propagating through the reasoning chain. This targeted correction mechanism turns brittle reasoning into a more resilient process. Models trained with ERL, termed ESearch, achieve substantial improvements on HotpotQA, MuSiQue, 2Wiki, and Bamboogle, with the 3B model achieving +8.48% EM and +11.56% F1, and the 7B model achieving +5.38% EM and +7.22% F1 over previous state-of-the-art(SOTA) results. These findings suggest that erasable reinforcement learning provides a powerful paradigm shift for robust multi-step reasoning in LLMs.

Comments:	10 pages, 4 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
Cite as:	arXiv:2510.00861 [cs.CL]
	(or arXiv:2510.00861v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2510.00861

Submission history

From: Yuhang Wang [view email]
[v1] Wed, 1 Oct 2025 13:10:36 UTC (2,984 KB)

Computer Science > Computation and Language

Title:Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators