Auto-ARGUE: LLM-Based Report Generation Evaluation

Walden, William; Mason, Marc; Weller, Orion; Dietz, Laura; Recknor, Hannah; Li, Bryan; Liu, Gabrielle Kaili-May; Hou, Yu; Mayfield, James; Yang, Eugene

Computer Science > Information Retrieval

arXiv:2509.26184 (cs)

[Submitted on 30 Sep 2025 (v1), last revised 1 Oct 2025 (this version, v2)]

Title:Auto-ARGUE: LLM-Based Report Generation Evaluation

Authors:William Walden, Marc Mason, Orion Weller, Laura Dietz, Hannah Recknor, Bryan Li, Gabrielle Kaili-May Liu, Yu Hou, James Mayfield, Eugene Yang

View PDF HTML (experimental)

Abstract:Generation of long-form, citation-backed reports is a primary use case for retrieval augmented generation (RAG) systems. While open-source evaluation tools exist for various RAG tasks, ones tailored to report generation are lacking. Accordingly, we introduce Auto-ARGUE, a robust LLM-based implementation of the recent ARGUE framework for report generation evaluation. We present analysis of Auto-ARGUE on the report generation pilot task from the TREC 2024 NeuCLIR track, showing good system-level correlations with human judgments. We further release a web app for visualization of Auto-ARGUE outputs.

Comments:	ECIR 2025 demo format
Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2509.26184 [cs.IR]
	(or arXiv:2509.26184v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2509.26184

Submission history

From: William Walden [view email]
[v1] Tue, 30 Sep 2025 12:41:11 UTC (4,091 KB)
[v2] Wed, 1 Oct 2025 13:05:17 UTC (4,129 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IR

< prev | next >

new | recent | 2025-09

Change to browse by:

cs
cs.AI
cs.CL

References & Citations

export BibTeX citation

Computer Science > Information Retrieval

Title:Auto-ARGUE: LLM-Based Report Generation Evaluation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Auto-ARGUE: LLM-Based Report Generation Evaluation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators