Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing

Zhao, Raoyuan; Köksal, Abdullatif; Modarressi, Ali; Hedderich, Michael A.; Schütze, Hinrich

Computer Science > Computation and Language

arXiv:2505.21701 (cs)

[Submitted on 27 May 2025 (v1), last revised 30 May 2025 (this version, v2)]

Title:Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing

Authors:Raoyuan Zhao, Abdullatif Köksal, Ali Modarressi, Michael A. Hedderich, Hinrich Schütze

View PDF HTML (experimental)

Abstract:The reliability of large language models (LLMs) is greatly compromised by their tendency to hallucinate, underscoring the need for precise identification of knowledge gaps within LLMs. Various methods for probing such gaps exist, ranging from calibration-based to prompting-based methods. To evaluate these probing methods, in this paper, we propose a new process based on using input variations and quantitative metrics. Through this, we expose two dimensions of inconsistency in knowledge gap probing. (1) Intra-method inconsistency: Minimal non-semantic perturbations in prompts lead to considerable variance in detected knowledge gaps within the same probing method; e.g., the simple variation of shuffling answer options can decrease agreement to around 40%. (2) Cross-method inconsistency: Probing methods contradict each other on whether a model knows the answer. Methods are highly inconsistent -- with decision consistency across methods being as low as 7% -- even though the model, dataset, and prompt are all the same. These findings challenge existing probing methods and highlight the urgent need for perturbation-robust probing frameworks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2505.21701 [cs.CL]
	(or arXiv:2505.21701v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2505.21701

Submission history

From: Raoyuan Zhao [view email]
[v1] Tue, 27 May 2025 19:39:49 UTC (693 KB)
[v2] Fri, 30 May 2025 14:39:34 UTC (693 KB)

Computer Science > Computation and Language

Title:Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators