Skip to main content

Showing 1–1 of 1 results for author: Alim, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.09338  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Know What You Don't Know: Uncertainty Calibration of Process Reward Models

    Authors: Young-Jin Park, Kristjan Greenewald, Kaveh Alim, Hao Wang, Navid Azizan

    Abstract: Process reward models (PRMs) play a central role in guiding inference-time scaling algorithms for large language models (LLMs). However, we observe that even state-of-the-art PRMs can be poorly calibrated and often overestimate success probabilities. To address this, we present a calibration approach, performed via quantile regression, that adjusts PRM outputs to better align with true success pro… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.