Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Pletenev, Sergey; Marina, Maria; Ivanov, Nikolay; Galimzianova, Daria; Krayko, Nikita; Salnikov, Mikhail; Konovalov, Vasily; Panchenko, Alexander; Moskvoretskii, Viktor

Computer Science > Computation and Language

arXiv:2505.21115 (cs)

[Submitted on 27 May 2025]

Title:Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Authors:Sergey Pletenev, Maria Marina, Nikolay Ivanov, Daria Galimzianova, Nikita Krayko, Mikhail Salnikov, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii

View PDF

Abstract:Large Language Models (LLMs) often hallucinate in question answering (QA) tasks. A key yet underexplored factor contributing to this is the temporality of questions -- whether they are evergreen (answers remain stable over time) or mutable (answers change). In this work, we introduce EverGreenQA, the first multilingual QA dataset with evergreen labels, supporting both evaluation and training. Using EverGreenQA, we benchmark 12 modern LLMs to assess whether they encode question temporality explicitly (via verbalized judgments) or implicitly (via uncertainty signals). We also train EG-E5, a lightweight multilingual classifier that achieves SoTA performance on this task. Finally, we demonstrate the practical utility of evergreen classification across three applications: improving self-knowledge estimation, filtering QA datasets, and explaining GPT-4o retrieval behavior.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2505.21115 [cs.CL]
	(or arXiv:2505.21115v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2505.21115

Submission history

From: Viktor Moskvoretskii [view email]
[v1] Tue, 27 May 2025 12:35:13 UTC (217 KB)

Computer Science > Computation and Language

Title:Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators