Skip to main content

Showing 1–1 of 1 results for author: Rivera, M U

.
  1. arXiv:2505.17345  [pdf, ps, other

    cs.CL

    Language models should be subject to repeatable, open, domain-contextualized hallucination benchmarking

    Authors: Justin D. Norman, Michael U. Rivera, D. Alex Hughes

    Abstract: Plausible, but inaccurate, tokens in model-generated text are widely believed to be pervasive and problematic for the responsible adoption of language models. Despite this concern, there is little scientific work that attempts to measure the prevalence of language model hallucination in a comprehensive way. In this paper, we argue that language models should be evaluated using repeatable, open, an… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 9 pages