Skip to main content

Showing 1–3 of 3 results for author: Staruch, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13148  [pdf, ps, other

    cs.CL cs.AI

    Adapting LLMs for Minimal-edit Grammatical Error Correction

    Authors: Ryszard Staruch, Filip Graliński, Daniel Dzienisiewicz

    Abstract: Decoder-only large language models have shown superior performance in the fluency-edit English Grammatical Error Correction, but their adaptation for minimal-edit English GEC is still underexplored. To improve their effectiveness in the minimal-edit approach, we explore the error rate adaptation topic and propose a novel training schedule method. Our experiments set a new state-of-the-art result f… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Accepted at BEA-2025

  2. arXiv:2501.02266  [pdf, other

    cs.CL cs.AI

    LLMzSzŁ: a comprehensive LLM benchmark for Polish

    Authors: Krzysztof Jassem, Michał Ciesiółka, Filip Graliński, Piotr Jabłoński, Jakub Pokrywka, Marek Kubis, Monika Jabłońska, Ryszard Staruch

    Abstract: This article introduces the first comprehensive benchmark for the Polish language at this scale: LLMzSzŁ (LLMs Behind the School Desk). It is based on a coherent collection of Polish national exams, including both academic and professional tests extracted from the archives of the Polish Central Examination Board. It covers 4 types of exams, coming from 154 domains. Altogether, it consists of almos… ▽ More

    Submitted 4 January, 2025; originally announced January 2025.

  3. arXiv:2409.03046  [pdf, other

    cs.CL

    Oddballness: universal anomaly detection with language models

    Authors: Filip Graliński, Ryszard Staruch, Krzysztof Jurkiewicz

    Abstract: We present a new method to detect anomalies in texts (in general: in sequences of any data), using language models, in a totally unsupervised manner. The method considers probabilities (likelihoods) generated by a language model, but instead of focusing on low-likelihood tokens, it considers a new metric introduced in this paper: oddballness. Oddballness measures how ``strange'' a given token is a… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.