DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

Li, Jingyao; Sun, Hao; Qiao, Zile; Jiang, Yong; Xie, Pengjun; Huang, Fei; Xu, Hong; Jia, Jiaya

Computer Science > Machine Learning

arXiv:2506.21343 (cs)

[Submitted on 26 Jun 2025]

Title:DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

Authors:Jingyao Li, Hao Sun, Zile Qiao, Yong Jiang, Pengjun Xie, Fei Huang, Hong Xu, Jiaya Jia

View PDF HTML (experimental)

Abstract:Traditional benchmarks for large language models (LLMs) typically rely on static evaluations through storytelling or opinion expression, which fail to capture the dynamic requirements of real-time information processing in contemporary applications. To address this limitation, we present DynamicBench, a benchmark designed to evaluate the proficiency of LLMs in storing and processing up-to-the-minute data. DynamicBench utilizes a dual-path retrieval pipeline, integrating web searches with local report databases. It necessitates domain-specific knowledge, ensuring accurate responses report generation within specialized fields. By evaluating models in scenarios that either provide or withhold external documents, DynamicBench effectively measures their capability to independently process recent information or leverage contextual enhancements. Additionally, we introduce an advanced report generation system adept at managing dynamic information synthesis. Our experimental results confirm the efficacy of our approach, with our method achieving state-of-the-art performance, surpassing GPT4o in document-free and document-assisted scenarios by 7.0% and 5.8%, respectively. The code and data will be made publicly available.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2506.21343 [cs.LG]
	(or arXiv:2506.21343v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2506.21343

Submission history

From: Jingyao Li [view email]
[v1] Thu, 26 Jun 2025 14:53:44 UTC (1,700 KB)

Computer Science > Machine Learning

Title:DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators