TaoSR1: The Thinking Model for E-commerce Relevance Search

Dong, Chenhe; Yao, Shaowei; Jiao, Pengkun; Yang, Jianhui; Jin, Yiming; Huang, Zerui; Zhou, Xiaojiang; Ou, Dan; Tang, Haihong; Zheng, Bo

Computer Science > Information Retrieval

arXiv:2508.12365 (cs)

[Submitted on 17 Aug 2025 (v1), last revised 27 Oct 2025 (this version, v2)]

Title:TaoSR1: The Thinking Model for E-commerce Relevance Search

Authors:Chenhe Dong, Shaowei Yao, Pengkun Jiao, Jianhui Yang, Yiming Jin, Zerui Huang, Xiaojiang Zhou, Dan Ou, Haihong Tang, Bo Zheng

View PDF HTML (experimental)

Abstract:Query-product relevance prediction is a core task in e-commerce search. BERT-based models excel at semantic matching but lack complex reasoning capabilities. While Large Language Models (LLMs) are explored, most still use discriminative fine-tuning or distill to smaller models for deployment. We propose a framework to directly deploy LLMs for this task, addressing key challenges: Chain-of-Thought (CoT) error accumulation, discriminative hallucination, and deployment feasibility. Our framework, TaoSR1, involves three stages: (1) Supervised Fine-Tuning (SFT) with CoT to instill reasoning; (2) Offline sampling with a pass@N strategy and Direct Preference Optimization (DPO) to improve generation quality; and (3) Difficulty-based dynamic sampling with Group Relative Policy Optimization (GRPO) to mitigate discriminative hallucination. Additionally, post-CoT processing and a cumulative probability-based partitioning method enable efficient online deployment. TaoSR1 significantly outperforms baselines on offline datasets and achieves substantial gains in online side-by-side human evaluations, introducing a novel paradigm for applying CoT reasoning to relevance classification.

Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2508.12365 [cs.IR]
	(or arXiv:2508.12365v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2508.12365

Submission history

From: Shaowei Yao [view email]
[v1] Sun, 17 Aug 2025 13:48:48 UTC (536 KB)
[v2] Mon, 27 Oct 2025 13:03:18 UTC (537 KB)

Computer Science > Information Retrieval

Title:TaoSR1: The Thinking Model for E-commerce Relevance Search

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:TaoSR1: The Thinking Model for E-commerce Relevance Search

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators