Skip to main content

Showing 1–3 of 3 results for author: Berenbeim, A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.03788  [pdf, other

    cs.CL cs.AI cs.CV

    Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding

    Authors: Trilok Padhi, Ramneet Kaur, Adam D. Cobb, Manoj Acharya, Anirban Roy, Colin Samplawski, Brian Matejek, Alexander M. Berenbeim, Nathaniel D. Bastian, Susmit Jha

    Abstract: We introduce a novel approach for calibrating uncertainty quantification (UQ) tailored for multi-modal large language models (LLMs). Existing state-of-the-art UQ methods rely on consistency among multiple responses generated by the LLM on an input query under diverse settings. However, these approaches often report higher confidence in scenarios where the LLM is consistently incorrect. This leads… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

  2. arXiv:2411.02381  [pdf, other

    cs.AI

    Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI

    Authors: Ramneet Kaur, Colin Samplawski, Adam D. Cobb, Anirban Roy, Brian Matejek, Manoj Acharya, Daniel Elenius, Alexander M. Berenbeim, John A. Pavlik, Nathaniel D. Bastian, Susmit Jha

    Abstract: In this paper, we present a dynamic semantic clustering approach inspired by the Chinese Restaurant Process, aimed at addressing uncertainty in the inference of Large Language Models (LLMs). We quantify uncertainty of an LLM on a given query by calculating entropy of the generated semantic clusters. Further, we propose leveraging the (negative) likelihood of these clusters as the (non)conformity s… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  3. arXiv:2303.14568  [pdf, ps, other

    stat.ML cs.AI cs.LG math.DG math.PR

    Measuring Classification Decision Certainty and Doubt

    Authors: Alexander M. Berenbeim, Iain J. Cruickshank, Susmit Jha, Robert H. Thomson, Nathaniel D. Bastian

    Abstract: Quantitative characterizations and estimations of uncertainty are of fundamental importance in optimization and decision-making processes. Herein, we propose intuitive scores, which we call certainty and doubt, that can be used in both a Bayesian and frequentist framework to assess and compare the quality and uncertainty of predictions in (multi-)classification decision machine learning problems.

    Submitted 27 March, 2023; v1 submitted 25 March, 2023; originally announced March 2023.

    Comments: 4 pages