Skip to main content

Showing 1–10 of 10 results for author: Bryant, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.24726  [pdf, ps, other

    cs.CL

    Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

    Authors: Shelly Bensal, Umar Jamil, Christopher Bryant, Melisa Russak, Kiran Kamble, Dmytro Mozolevskyi, Muayad Ali, Waseem AlShikh

    Abstract: We explore a method for improving the performance of large language models through self-reflection and reinforcement learning. By incentivizing the model to generate better self-reflections when it answers incorrectly, we demonstrate that a model's ability to solve complex, verifiable tasks can be enhanced even when generating synthetic data is infeasible and only binary feedback is available. Our… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  2. arXiv:2408.14906  [pdf, other

    cs.CL cs.IR

    Writing in the Margins: Better Inference Pattern for Long Context Retrieval

    Authors: Melisa Russak, Umar Jamil, Christopher Bryant, Kiran Kamble, Axel Magnuson, Mateusz Russak, Waseem AlShikh

    Abstract: In this paper, we introduce Writing in the Margins (WiM), a new inference pattern for Large Language Models designed to optimize the handling of long input sequences in retrieval-oriented tasks. This approach leverages the chunked prefill of the key-value cache to perform segment-wise inference, which enables efficient processing of extensive contexts along with the generation and classification o… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  3. arXiv:2404.12489  [pdf, other

    cs.CL

    Grammatical Error Correction for Code-Switched Sentences by Learners of English

    Authors: Kelvin Wey Han Chan, Christopher Bryant, Li Nguyen, Andrew Caines, Zheng Yuan

    Abstract: Code-switching (CSW) is a common phenomenon among multilingual speakers where multiple languages are used in a single discourse or utterance. Mixed language utterances may still contain grammatical errors however, yet most existing Grammar Error Correction (GEC) systems have been trained on monolingual data and not developed with CSW in mind. In this work, we conduct the first exploration into the… ▽ More

    Submitted 6 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Journal ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics

  4. Prompting open-source and commercial language models for grammatical error correction of English learner text

    Authors: Christopher Davis, Andrew Caines, Øistein Andersen, Shiva Taslimipoor, Helen Yannakoudakis, Zheng Yuan, Christopher Bryant, Marek Rei, Paula Buttery

    Abstract: Thanks to recent advances in generative AI, we are able to prompt large language models (LLMs) to produce texts which are fluent and grammatical. In addition, it has been shown that we can elicit attempts at grammatical error correction (GEC) from LLMs when prompted with ungrammatical input sentences. We evaluate how well LLMs can perform at GEC by measuring their performance on established benchm… ▽ More

    Submitted 6 April, 2025; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 8 pages with appendices; accepted to ACL Findings 2024

  5. arXiv:2307.08393  [pdf, other

    cs.CL cs.LG

    On the application of Large Language Models for language teaching and assessment technology

    Authors: Andrew Caines, Luca Benedetto, Shiva Taslimipoor, Christopher Davis, Yuan Gao, Oeistein Andersen, Zheng Yuan, Mark Elliott, Russell Moore, Christopher Bryant, Marek Rei, Helen Yannakoudakis, Andrew Mullooly, Diane Nicholls, Paula Buttery

    Abstract: The recent release of very large language models such as PaLM and GPT-4 has made an unprecedented impact in the popular media and public consciousness, giving rise to a mixture of excitement and fear as to their capabilities and potential uses, and shining a light on natural language processing research which had not previously received so much attention. The developments offer great promise for e… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted at the AIED2023 workshop: Empowering Education with LLMs - the Next-Gen Interface and Content Generation

  6. arXiv:2302.05913  [pdf, other

    cs.CL

    An Extended Sequence Tagging Vocabulary for Grammatical Error Correction

    Authors: Stuart Mesham, Christopher Bryant, Marek Rei, Zheng Yuan

    Abstract: We extend a current sequence-tagging approach to Grammatical Error Correction (GEC) by introducing specialised tags for spelling correction and morphological inflection using the SymSpell and LemmInflect algorithms. Our approach improves generalisation: the proposed new tagset allows a smaller number of tags to correct a larger range of errors. Our results show a performance improvement both overa… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  7. Grammatical Error Correction: A Survey of the State of the Art

    Authors: Christopher Bryant, Zheng Yuan, Muhammad Reza Qorib, Hannan Cao, Hwee Tou Ng, Ted Briscoe

    Abstract: Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject-verb agreement, but also orthographic and semantic errors, such as misspellings and word choice errors respectively. The field has seen significant progress in the last decade, m… ▽ More

    Submitted 29 April, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Journal ref: Computational Linguistics (2023) 49 (3): 643-701

  8. arXiv:2210.16228  [pdf, other

    cs.CL

    Probing for targeted syntactic knowledge through grammatical error detection

    Authors: Christopher Davis, Christopher Bryant, Andrew Caines, Marek Rei, Paula Buttery

    Abstract: Targeted studies testing knowledge of subject-verb agreement (SVA) indicate that pre-trained language models encode syntactic information. We assert that if models robustly encode subject-verb agreement, they should be able to identify when agreement is correct and when it is incorrect. To that end, we propose grammatical error detection as a diagnostic probe to evaluate token-level contextual rep… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: CoNLL 2022

  9. arXiv:2110.03727  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Contextual Sentence Classification: Detecting Sustainability Initiatives in Company Reports

    Authors: Dan Hirlea, Christopher Bryant, Maurizio Zollo, Marek Rei

    Abstract: We introduce the novel task of detecting sustainability initiatives in company reports. Given a full report, the aim is to automatically identify mentions of practical activities that a company has performed in order to tackle specific societal issues. New methods for identifying continuous sentence spans need to be developed for capturing the multi-sentence structure of individual sustainability… ▽ More

    Submitted 3 March, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 10 pages, 2 figures, 7 tables

  10. arXiv:1903.10625  [pdf, other

    cs.CL

    Neural Grammatical Error Correction with Finite State Transducers

    Authors: Felix Stahlberg, Christopher Bryant, Bill Byrne

    Abstract: Grammatical error correction (GEC) is one of the areas in natural language processing in which purely neural models have not yet superseded more traditional symbolic models. Hybrid systems combining phrase-based statistical machine translation (SMT) and neural sequence models are currently among the most effective approaches to GEC. However, both SMT and neural sequence-to-sequence models require… ▽ More

    Submitted 5 April, 2019; v1 submitted 25 March, 2019; originally announced March 2019.

    Comments: NAACL 2019