Skip to main content

Showing 1–1 of 1 results for author: Brynda, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00069  [pdf, ps, other

    cs.CL cs.AI

    Evaluating the Sensitivity of LLMs to Prior Context

    Authors: Robert Hankache, Kingsley Nketia Acheampong, Liang Song, Marek Brynda, Raad Khraishi, Greig A. Cowan

    Abstract: As large language models (LLMs) are increasingly deployed in multi-turn dialogue and other sustained interactive scenarios, it is essential to understand how extended context affects their performance. Popular benchmarks, focusing primarily on single-turn question answering (QA) tasks, fail to capture the effects of multi-turn exchanges. To address this gap, we introduce a novel set of benchmarks… ▽ More

    Submitted 29 May, 2025; originally announced June 2025.