FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs

Sawczyn, Albert; Binkowski, Jakub; Janiak, Denis; Gabrys, Bogdan; Kajdanowicz, Tomasz

Computer Science > Machine Learning

arXiv:2503.17229 (cs)

[Submitted on 21 Mar 2025 (v1), last revised 30 May 2025 (this version, v2)]

Title:FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs

Authors:Albert Sawczyn, Jakub Binkowski, Denis Janiak, Bogdan Gabrys, Tomasz Kajdanowicz

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) frequently generate hallucinated content, posing significant challenges for applications where factuality is crucial. While existing hallucination detection methods typically operate at the sentence level or passage level, we propose FactSelfCheck, a novel black-box sampling-based method that enables fine-grained fact-level detection. Our approach represents text as knowledge graphs consisting of facts in the form of triples. Through analyzing factual consistency across multiple LLM responses, we compute fine-grained hallucination scores without requiring external resources or training data. Our evaluation demonstrates that FactSelfCheck performs competitively with leading sentence-level sampling-based methods while providing more detailed insights. Most notably, our fact-level approach significantly improves hallucination correction, achieving a 35.5% increase in factual content compared to the baseline, while sentence-level SelfCheckGPT yields only a 10.6% improvement. The granular nature of our detection enables more precise identification and correction of hallucinated content. Additionally, we contribute a new dataset for evaluating sampling-based methods - FavaMultiSamples.

Comments:	Preprint
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2503.17229 [cs.LG]
	(or arXiv:2503.17229v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.17229

Submission history

From: Albert Sawczyn [view email]
[v1] Fri, 21 Mar 2025 15:32:24 UTC (377 KB)
[v2] Fri, 30 May 2025 14:59:56 UTC (453 KB)

Computer Science > Machine Learning

Title:FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators