Skip to main content

Showing 1–1 of 1 results for author: Reganova, E

.
  1. arXiv:2411.14465  [pdf, other

    cs.CL cs.LG

    Testing Uncertainty of Large Language Models for Physics Knowledge and Reasoning

    Authors: Elizaveta Reganova, Peter Steinbach

    Abstract: Large Language Models (LLMs) have gained significant popularity in recent years for their ability to answer questions in various fields. However, these models have a tendency to "hallucinate" their responses, making it challenging to evaluate their performance. A major challenge is determining how to assess the certainty of a model's predictions and how it correlates with accuracy. In this work, w… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.