Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Servedio, Giovanni; De Bellis, Alessandro; Di Palma, Dario; Anelli, Vito Walter; Di Noia, Tommaso

Computer Science > Computation and Language

arXiv:2505.16520 (cs)

[Submitted on 22 May 2025 (v1), last revised 30 May 2025 (this version, v3)]

Title:Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Authors:Giovanni Servedio, Alessandro De Bellis, Dario Di Palma, Vito Walter Anelli, Tommaso Di Noia

View PDF HTML (experimental)

Abstract:Factual hallucinations are a major challenge for Large Language Models (LLMs). They undermine reliability and user trust by generating inaccurate or fabricated content. Recent studies suggest that when generating false statements, the internal states of LLMs encode information about truthfulness. However, these studies often rely on synthetic datasets that lack realism, which limits generalization when evaluating the factual accuracy of text generated by the model itself. In this paper, we challenge the findings of previous work by investigating truthfulness encoding capabilities, leading to the generation of a more realistic and challenging dataset. Specifically, we extend previous work by introducing: (1) a strategy for sampling plausible true-false factoid sentences from tabular data and (2) a procedure for generating realistic, LLM-dependent true-false datasets from Question Answering collections. Our analysis of two open-source LLMs reveals that while the findings from previous studies are partially validated, generalization to LLM-generated datasets remains challenging. This study lays the groundwork for future research on factuality in LLMs and offers practical guidelines for more effective evaluation.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2505.16520 [cs.CL]
	(or arXiv:2505.16520v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2505.16520

Submission history

From: Dario Di Palma [view email]
[v1] Thu, 22 May 2025 11:00:53 UTC (58 KB)
[v2] Mon, 26 May 2025 07:30:08 UTC (58 KB)
[v3] Fri, 30 May 2025 10:53:48 UTC (75 KB)

Computer Science > Computation and Language

Title:Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators