Hallucination Detection on a Budget: Efficient Bayesian Estimation of Semantic Entropy

Ciosek, Kamil; Felicioni, Nicolò; Ghiassian, Sina

Computer Science > Machine Learning

arXiv:2504.03579 (cs)

[Submitted on 4 Apr 2025]

Title:Hallucination Detection on a Budget: Efficient Bayesian Estimation of Semantic Entropy

Authors:Kamil Ciosek, Nicolò Felicioni, Sina Ghiassian

View PDF HTML (experimental)

Abstract:Detecting whether an LLM hallucinates is an important research challenge. One promising way of doing so is to estimate the semantic entropy (Farquhar et al., 2024) of the distribution of generated sequences. We propose a new algorithm for doing that, with two main advantages. First, due to us taking the Bayesian approach, we achieve a much better quality of semantic entropy estimates for a given budget of samples from the LLM. Second, we are able to tune the number of samples adaptively so that `harder' contexts receive more samples. We demonstrate empirically that our approach systematically beats the baselines, requiring only 59% of samples used by Farquhar et al. (2024) to achieve the same quality of hallucination detection as measured by AUROC. Moreover, quite counterintuitively, our estimator is useful even with just one sample from the LLM.

Comments:	22 pages
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2504.03579 [cs.LG]
	(or arXiv:2504.03579v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2504.03579

Submission history

From: Sina Ghiassian [view email]
[v1] Fri, 4 Apr 2025 16:30:44 UTC (2,155 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2025-04

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Hallucination Detection on a Budget: Efficient Bayesian Estimation of Semantic Entropy

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hallucination Detection on a Budget: Efficient Bayesian Estimation of Semantic Entropy

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators