Sparks of Tabular Reasoning via Text2SQL Reinforcement Learning

Stoisser, Josefa Lia; Martell, Marc Boubnovski; Fauqueur, Julien

Computer Science > Computation and Language

arXiv:2505.00016 (cs)

[Submitted on 23 Apr 2025 (v1), last revised 2 May 2025 (this version, v2)]

Title:Sparks of Tabular Reasoning via Text2SQL Reinforcement Learning

Authors:Josefa Lia Stoisser, Marc Boubnovski Martell, Julien Fauqueur

View PDF HTML (experimental)

Abstract:This work reframes the Text-to-SQL task as a pathway for teaching large language models (LLMs) to reason over and manipulate tabular data--moving beyond the traditional focus on query generation. We propose a two-stage framework that leverages SQL supervision to develop transferable table reasoning capabilities. First, we synthesize detailed chain-of-thought (CoT) traces from real-world SQL queries, providing step-by-step, clause-level supervision that teaches the model how to traverse, filter, and aggregate table fields. Second, we introduce a Group Relative Policy Optimization (GRPO) reinforcement learning objective that connects SQL execution accuracy to generalizable reasoning by encouraging steps that extend beyond task-specific syntax and transfer across datasets. Empirically, our approach improves performance on standard Text-to-SQL benchmarks and achieves substantial gains on reasoning-intensive datasets such as BIRD and CRT-QA, demonstrating enhanced generalization and interpretability. Specifically, the distilled-quantized LLaMA model achieved a relative 33.9\% increase in accuracy when trained on Text-to-SQL tasks, while Qwen achieved a relative 14.5\% increase. These results suggest that SQL can serve not only as a target formalism but also as an effective scaffold for learning robust, transferable reasoning over structured data.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2505.00016 [cs.CL]
	(or arXiv:2505.00016v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2505.00016

Submission history

From: Marc Boubnovski Martell [view email]
[v1] Wed, 23 Apr 2025 19:02:04 UTC (1,295 KB)
[v2] Fri, 2 May 2025 11:34:00 UTC (1,295 KB)

Computer Science > Computation and Language

Title:Sparks of Tabular Reasoning via Text2SQL Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Sparks of Tabular Reasoning via Text2SQL Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators