Skip to main content

Showing 1–50 of 52 results for author: Rei, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20209  [pdf, other

    cs.CL

    How to Improve the Robustness of Closed-Source Models on NLI

    Authors: Joe Stacey, Lisa Alazraki, Aran Ubhi, Beyza Ermis, Aaron Mueller, Marek Rei

    Abstract: Closed-source Large Language Models (LLMs) have become increasingly popular, with impressive performance across a wide range of natural language tasks. These models can be fine-tuned to further improve performance, but this often results in the models learning from dataset-specific heuristics that reduce their robustness on out-of-distribution (OOD) data. Existing methods to improve robustness eit… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    ACM Class: I.2.7

  2. arXiv:2505.15795  [pdf, ps, other

    cs.CL

    Reverse Engineering Human Preferences with Reinforcement Learning

    Authors: Lisa Alazraki, Tan Yi-Chern, Jon Ander Campos, Maximilian Mozes, Marek Rei, Max Bartolo

    Abstract: The capabilities of Large Language Models (LLMs) are routinely evaluated by other LLMs trained to predict human preferences. This framework--known as LLM-as-a-judge--is highly scalable and relatively low cost. However, it is also vulnerable to malicious exploitation, as LLM responses can be tuned to overfit the preferences of the judge. Previous work shows that the answers generated by a candidate… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  3. arXiv:2504.02733  [pdf, other

    cs.CL

    Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study

    Authors: Aryan Agrawal, Lisa Alazraki, Shahin Honarvar, Marek Rei

    Abstract: Large Language Models (LLMs) are highly vulnerable to input perturbations, as even a small prompt change may result in a substantially different output. Existing methods to enhance LLM robustness are primarily focused on perturbed data samples, whereas improving resiliency to perturbations of task-level instructions has remained relatively underexplored. In this work, we focus on character- and wo… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Building Trust Workshop, ICLR 2025

  4. arXiv:2502.08550  [pdf, other

    cs.CL cs.AI

    No Need for Explanations: LLMs can implicitly learn from mistakes in-context

    Authors: Lisa Alazraki, Maximilian Mozes, Jon Ander Campos, Tan Yi-Chern, Marek Rei, Max Bartolo

    Abstract: Showing incorrect answers to Large Language Models (LLMs) is a popular strategy to improve their performance in reasoning-intensive tasks. It is widely assumed that, in order to be helpful, the incorrect answers must be accompanied by comprehensive rationales, explicitly detailing where the mistakes are and how to correct them. However, in this work we present a counterintuitive finding: we observ… ▽ More

    Submitted 21 May, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  5. KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports

    Authors: Hajung Kim, Chanhwi Kim, Jiwoong Sohn, Tim Beck, Marek Rei, Sunkyu Kim, T Ian Simpson, Joram M Posma, Antoine Lain, Mujeen Sung, Jaewoo Kang

    Abstract: The objective of BioCreative8 Track 3 is to extract phenotypic key medical findings embedded within EHR texts and subsequently normalize these findings to their Human Phenotype Ontology (HPO) terms. However, the presence of diverse surface forms in phenotypic findings makes it challenging to accurately normalize them to the correct HPO terms. To address this challenge, we explored various models f… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Comments: This article is part of the Proceedings of the BioCreative VIII Challenge and Workshop: Curation and Evaluation in the era of Generative Models

  6. arXiv:2411.04535  [pdf, other

    cs.CL cs.AI

    Meta-Reasoning Improves Tool Use in Large Language Models

    Authors: Lisa Alazraki, Marek Rei

    Abstract: External tools help large language models succeed at tasks where they would otherwise typically fail. In existing frameworks, choosing tools at test time relies on naive greedy decoding, regardless of whether the model has been fine-tuned on tool-annotated data or prompted with in-context examples. In contrast, we find that gathering and choosing among a suitable set of candidate tools has greater… ▽ More

    Submitted 8 February, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: NAACL 2025 Findings

  7. arXiv:2410.02810  [pdf, other

    cs.AI cs.CL cs.LG

    StateAct: Enhancing LLM Base Agents via Self-prompting and State-tracking

    Authors: Nikolai Rozanov, Marek Rei

    Abstract: Large language models (LLMs) are increasingly used as autonomous agents, tackling tasks from robotics to web navigation. Their performance depends on the underlying base agent. Existing methods, however, struggle with long-context reasoning and goal adherence. We introduce StateAct, a novel and efficient base agent that enhances decision-making through (1) self-prompting, which reinforces task goa… ▽ More

    Submitted 8 April, 2025; v1 submitted 21 September, 2024; originally announced October 2024.

    Comments: 9 pages, 5 pages appendix, 7 figures, 5 tables

  8. arXiv:2407.00248  [pdf, other

    cs.CL

    DiffuseDef: Improved Robustness to Adversarial Attacks via Iterative Denoising

    Authors: Zhenhao Li, Huichi Zhou, Marek Rei, Lucia Specia

    Abstract: Pretrained language models have significantly advanced performance across various natural language processing tasks. However, adversarial attacks continue to pose a critical challenge to systems built using these models, as they can be exploited with carefully crafted adversarial texts. Inspired by the ability of diffusion models to predict and reduce noise in computer vision, we propose a novel a… ▽ More

    Submitted 16 May, 2025; v1 submitted 28 June, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2025

  9. arXiv:2406.17975  [pdf, ps, other

    cs.CL cs.CR cs.LG

    SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It)

    Authors: Matthieu Meeus, Igor Shilov, Shubham Jain, Manuel Faysse, Marek Rei, Yves-Alexandre de Montjoye

    Abstract: Whether LLMs memorize their training data and what this means, from measuring privacy leakage to detecting copyright violations, has become a rapidly growing area of research. In the last few months, more than 10 new methods have been proposed to perform Membership Inference Attacks (MIAs) against LLMs. Contrary to traditional MIAs which rely on fixed-but randomized-records or models, these method… ▽ More

    Submitted 7 March, 2025; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: IEEE Conference on Secure and Trustworthy Machine Learning (SaTML 2025)

  10. arXiv:2405.11622  [pdf, other

    cs.CL cs.LG

    Continuous Predictive Modeling of Clinical Notes and ICD Codes in Patient Health Records

    Authors: Mireia Hernandez Caralt, Clarence Boon Liang Ng, Marek Rei

    Abstract: Electronic Health Records (EHR) serve as a valuable source of patient information, offering insights into medical histories, treatments, and outcomes. Previous research has developed systems for detecting applicable ICD codes that should be assigned while writing a given EHR document, mainly focusing on discharge summaries written at the end of a hospital stay. In this work, we investigate the pot… ▽ More

    Submitted 5 July, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    ACM Class: I.2.7; J.3

  11. Prompting open-source and commercial language models for grammatical error correction of English learner text

    Authors: Christopher Davis, Andrew Caines, Øistein Andersen, Shiva Taslimipoor, Helen Yannakoudakis, Zheng Yuan, Christopher Bryant, Marek Rei, Paula Buttery

    Abstract: Thanks to recent advances in generative AI, we are able to prompt large language models (LLMs) to produce texts which are fluent and grammatical. In addition, it has been shown that we can elicit attempts at grammatical error correction (GEC) from LLMs when prompted with ungrammatical input sentences. We evaluate how well LLMs can perform at GEC by measuring their performance on established benchm… ▽ More

    Submitted 6 April, 2025; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 8 pages with appendices; accepted to ACL Findings 2024

  12. arXiv:2310.15007  [pdf, other

    cs.CL cs.CR cs.LG

    Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models

    Authors: Matthieu Meeus, Shubham Jain, Marek Rei, Yves-Alexandre de Montjoye

    Abstract: With large language models (LLMs) poised to become embedded in our daily lives, questions are starting to be raised about the data they learned from. These questions range from potential bias or misinformation LLMs could retain from their training data to questions of copyright and fair use of human-generated text. However, while these questions emerge, developers of the recent state-of-the-art LL… ▽ More

    Submitted 15 July, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted at 33rd USENIX Security Symposium (USENIX Security 2024)

  13. arXiv:2307.08393  [pdf, other

    cs.CL cs.LG

    On the application of Large Language Models for language teaching and assessment technology

    Authors: Andrew Caines, Luca Benedetto, Shiva Taslimipoor, Christopher Davis, Yuan Gao, Oeistein Andersen, Zheng Yuan, Mark Elliott, Russell Moore, Christopher Bryant, Marek Rei, Helen Yannakoudakis, Andrew Mullooly, Diane Nicholls, Paula Buttery

    Abstract: The recent release of very large language models such as PaLM and GPT-4 has made an unprecedented impact in the popular media and public consciousness, giving rise to a mixture of excitement and fear as to their capabilities and potential uses, and shining a light on natural language processing research which had not previously received so much attention. The developments offer great promise for e… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted at the AIED2023 workshop: Empowering Education with LLMs - the Next-Gen Interface and Content Generation

  14. arXiv:2305.13214  [pdf, other

    cs.CL

    Atomic Inference for NLI with Generated Facts as Atoms

    Authors: Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei

    Abstract: With recent advances, neural models can achieve human-level performance on various natural language tasks. However, there are no guarantees that any explanations from these models are faithful, i.e. that they reflect the inner workings of the model. Atomic inference overcomes this issue, providing interpretable and faithful model decisions. This approach involves making predictions for different c… ▽ More

    Submitted 1 October, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2024

    ACM Class: I.2.7

  15. arXiv:2305.13067  [pdf, other

    cs.CL cs.LG

    Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation

    Authors: Joe Stacey, Marek Rei

    Abstract: Knowledge distillation optimises a smaller student model to behave similarly to a larger teacher model, retaining some of the performance benefits. While this method can improve results on in-distribution examples, it does not necessarily generalise to out-of-distribution (OOD) settings. We investigate two complementary methods for improving the robustness of the resulting student models on OOD do… ▽ More

    Submitted 24 July, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL Findings 2024

    ACM Class: I.2.7

  16. arXiv:2303.07991  [pdf, other

    cs.CL cs.LG

    Finding the Needle in a Haystack: Unsupervised Rationale Extraction from Long Text Classifiers

    Authors: Kamil Bujel, Andrew Caines, Helen Yannakoudakis, Marek Rei

    Abstract: Long-sequence transformers are designed to improve the representation of longer texts by language models and their performance on downstream document-level tasks. However, not much is understood about the quality of token-level predictions in long-form models. We investigate the performance of such architectures in the context of document classification with unsupervised rationale extraction. We f… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  17. arXiv:2302.12666  [pdf, other

    cs.LG cs.AI cs.CL

    Modelling Temporal Document Sequences for Clinical ICD Coding

    Authors: Clarence Boon Liang Ng, Diogo Santos, Marek Rei

    Abstract: Past studies on the ICD coding problem focus on predicting clinical codes primarily based on the discharge summary. This covers only a small fraction of the notes generated during each hospital stay and leaves potential for improving performance by analysing all the available clinical notes. We propose a hierarchical transformer architecture that uses text across the entire sequence of clinical no… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  18. arXiv:2302.05913  [pdf, other

    cs.CL

    An Extended Sequence Tagging Vocabulary for Grammatical Error Correction

    Authors: Stuart Mesham, Christopher Bryant, Marek Rei, Zheng Yuan

    Abstract: We extend a current sequence-tagging approach to Grammatical Error Correction (GEC) by introducing specialised tags for spelling correction and morphological inflection using the SymSpell and LemmInflect algorithms. Our approach improves generalisation: the proposed new tagset allows a smaller number of tags to correct a larger range of errors. Our results show a performance improvement both overa… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  19. arXiv:2210.16228  [pdf, other

    cs.CL

    Probing for targeted syntactic knowledge through grammatical error detection

    Authors: Christopher Davis, Christopher Bryant, Andrew Caines, Marek Rei, Paula Buttery

    Abstract: Targeted studies testing knowledge of subject-verb agreement (SVA) indicate that pre-trained language models encode syntactic information. We assert that if models robustly encode subject-verb agreement, they should be able to identify when agreement is correct and when it is incorrect. To that end, we propose grammatical error detection as a diagnostic probe to evaluate token-level contextual rep… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: CoNLL 2022

  20. arXiv:2205.11432  [pdf, other

    cs.CL cs.LG

    Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models

    Authors: Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Marek Rei

    Abstract: Current Natural Language Inference (NLI) models achieve impressive results, sometimes outperforming humans when evaluating on in-distribution test sets. However, as these models are known to learn from annotation artefacts and dataset biases, it is unclear to what extent the models are learning the task of NLI instead of learning from shallow heuristics in their training data. We address this issu… ▽ More

    Submitted 21 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted at EMNLP 2022

  21. arXiv:2110.08329  [pdf, other

    cs.CL cs.AI cs.LG

    Control Prefixes for Parameter-Efficient Text Generation

    Authors: Jordan Clive, Kris Cao, Marek Rei

    Abstract: Prefix-tuning is a powerful lightweight technique for adapting a large pre-trained language model to a downstream application. However, it uses the same dataset-level tuned prompt for all examples in the dataset. We extend this idea and propose a dynamic method, Control Prefixes, which allows for the inclusion of conditional input-dependent information, combining the benefits of prompt tuning and… ▽ More

    Submitted 10 May, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  22. arXiv:2110.08226  [pdf, other

    cs.LG cs.CL cs.CV

    Guiding Visual Question Generation

    Authors: Nihir Vedd, Zixu Wang, Marek Rei, Yishu Miao, Lucia Specia

    Abstract: In traditional Visual Question Generation (VQG), most images have multiple concepts (e.g. objects and categories) for which a question could be generated, but models are trained to mimic an arbitrary choice of concept as given in their training data. This makes training difficult and also poses issues for evaluation -- multiple valid questions exist for most images but only one or a few are captur… ▽ More

    Submitted 26 July, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: 14 pages including references and Appendix. 3 figures and 4 tables

  23. arXiv:2110.03727  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Contextual Sentence Classification: Detecting Sustainability Initiatives in Company Reports

    Authors: Dan Hirlea, Christopher Bryant, Maurizio Zollo, Marek Rei

    Abstract: We introduce the novel task of detecting sustainability initiatives in company reports. Given a full report, the aim is to automatically identify mentions of practical activities that a company has performed in order to tackle specific societal issues. New methods for identifying continuous sentence spans need to be developed for capturing the multi-sentence structure of individual sustainability… ▽ More

    Submitted 3 March, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 10 pages, 2 figures, 7 tables

  24. arXiv:2105.00828  [pdf, other

    cs.CL cs.LG

    Memorisation versus Generalisation in Pre-trained Language Models

    Authors: Michael Tänzer, Sebastian Ruder, Marek Rei

    Abstract: State-of-the-art pre-trained language models have been shown to memorise facts and perform well with limited amounts of training data. To gain a better understanding of how these models learn, we study their generalisation and memorisation capabilities in noisy and low-resource scenarios. We find that the training of these models is almost unaffected by label noise and that it is possible to reach… ▽ More

    Submitted 14 March, 2022; v1 submitted 16 April, 2021; originally announced May 2021.

    Comments: 15 pages, 25 figures. To be published in ACL2022

  25. arXiv:2104.08142  [pdf, other

    cs.CL cs.LG

    Supervising Model Attention with Human Explanations for Robust Natural Language Inference

    Authors: Joe Stacey, Yonatan Belinkov, Marek Rei

    Abstract: Natural Language Inference (NLI) models are known to learn from biases and artefacts within their training data, impacting how well they generalise to other unseen datasets. Existing de-biasing approaches focus on preventing the models from learning these biases, which can result in restrictive models and lower performance. We instead investigate teaching the model how a human would approach the N… ▽ More

    Submitted 1 May, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at AAAI 2022

  26. arXiv:2104.03928  [pdf, other

    cs.CL cs.SI

    How Metaphors Impact Political Discourse: A Large-Scale Topic-Agnostic Study Using Neural Metaphor Detection

    Authors: Vinodkumar Prabhakaran, Marek Rei, Ekaterina Shutova

    Abstract: Metaphors are widely used in political rhetoric as an effective framing device. While the efficacy of specific metaphors such as the war metaphor in political discourse has been documented before, those studies often rely on small number of hand-coded instances of metaphor use. Larger-scale topic-agnostic studies are required to establish the general persuasiveness of metaphors as a device, and to… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: Published at ICWSM 2021. Please cite that version for academic publications

    Journal ref: The International AAAI Conference on Web and Social Media (ICWSM) 2021

  27. arXiv:2103.14465  [pdf, other

    cs.CL cs.AI

    Zero-shot Sequence Labeling for Transformer-based Sentence Classifiers

    Authors: Kamil Bujel, Helen Yannakoudakis, Marek Rei

    Abstract: We investigate how sentence-level transformers can be modified into effective sequence labelers at the token level without any direct supervision. Existing approaches to zero-shot sequence labeling do not perform well when applied on transformer-based architectures. As transformers contain multiple layers of multi-head self-attention, information in the sentence gets distributed between many token… ▽ More

    Submitted 8 June, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

  28. Visual Cues and Error Correction for Translation Robustness

    Authors: Zhenhao Li, Marek Rei, Lucia Specia

    Abstract: Neural Machine Translation models are sensitive to noise in the input texts, such as misspelled words and ungrammatical constructions. Existing robustness techniques generally fail when faced with unseen types of noise and their performance degrades on clean texts. In this paper, we focus on three types of realistic noise that are commonly generated by humans and introduce the idea of visual conte… ▽ More

    Submitted 2 May, 2022; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: Accepted at Findings of EMNLP 2021; add acknowledgements

  29. arXiv:2011.00470  [pdf, other

    cs.CL

    Seeing Both the Forest and the Trees: Multi-head Attention for Joint Classification on Different Compositional Levels

    Authors: Miruna Pislar, Marek Rei

    Abstract: In natural languages, words are used in association to construct sentences. It is not words in isolation, but the appropriate combination of hierarchical structures that conveys the meaning of the whole sentence. Neural networks can capture expressive language features; however, insights into the link between words and sentences are difficult to acquire automatically. In this work, we design a dee… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Journal ref: COLING 2020

  30. arXiv:2010.12532  [pdf, other

    cs.CL

    GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method

    Authors: Nicole Peinelt, Marek Rei, Maria Liakata

    Abstract: Large pre-trained language models such as BERT have been the driving force behind recent improvements across many NLP tasks. However, BERT is only trained to predict missing words - either behind masks or in the next sentence - and has no knowledge of lexical, syntactic or semantic information beyond what it picks up through unsupervised pre-training. We propose a novel method to explicitly inject… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  31. arXiv:2010.07574  [pdf, other

    cs.CL

    Grammatical Error Correction in Low Error Density Domains: A New Benchmark and Analyses

    Authors: Simon Flachs, Ophélie Lacroix, Helen Yannakoudakis, Marek Rei, Anders Søgaard

    Abstract: Evaluation of grammatical error correction (GEC) systems has primarily focused on essays written by non-native learners of English, which however is only part of the full spectrum of GEC applications. We aim to broaden the target domain of GEC and release CWEB, a new benchmark for GEC consisting of website text generated by English speakers of varying levels of proficiency. Website data is a commo… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted at EMNLP 2020

  32. arXiv:2005.05264  [pdf, other

    cs.CL

    Multidirectional Associative Optimization of Function-Specific Word Representations

    Authors: Daniela Gerz, Ivan Vulić, Marek Rei, Roi Reichart, Anna Korhonen

    Abstract: We present a neural framework for learning associations between interrelated groups of words such as the ones found in Subject-Verb-Object (SVO) structures. Our model induces a joint function-specific word vector space, where vectors of e.g. plausible SVO compositions lie close together. The model retains information about word group membership even in the joint space, and can thereby effectively… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: ACL 2020 (Long paper)

  33. arXiv:1911.11672  [pdf, other

    cs.CL

    Semi-supervised Bootstrapping of Dialogue State Trackers for Task Oriented Modelling

    Authors: Bo-Hsiang Tseng, Marek Rei, Paweł Budzianowski, Richard E. Turner, Bill Byrne, Anna Korhonen

    Abstract: Dialogue systems benefit greatly from optimizing on detailed annotations, such as transcribed utterances, internal dialogue state representations and dialogue act labels. However, collecting these annotations is expensive and time-consuming, holding back development in the area of dialogue modelling. In this paper, we investigate semi-supervised learning methods that are able to reduce the amount… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: This article is published at EMNLP-IJCNLP 2019

  34. arXiv:1910.00275  [pdf, other

    cs.CL cs.LG

    Bad Form: Comparing Context-Based and Form-Based Few-Shot Learning in Distributional Semantic Models

    Authors: Jeroen Van Hautte, Guy Emerson, Marek Rei

    Abstract: Word embeddings are an essential component in a wide range of natural language processing applications. However, distributional semantic models are known to struggle when only a small number of context sentences are available. Several methods have been proposed to obtain higher-quality vectors for these words, leveraging both this context information and sometimes the word forms themselves through… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: Accepted to the Proceedings of the Second Workshop on Deep Learning for Low-Resource NLP (DeepLo 2019)

  35. Context is Key: Grammatical Error Detection with Contextual Word Representations

    Authors: Samuel Bell, Helen Yannakoudakis, Marek Rei

    Abstract: Grammatical error detection (GED) in non-native writing requires systems to identify a wide range of errors in text written by language learners. Error detection as a purely supervised task can be challenging, as GED datasets are limited in size and the label distributions are highly imbalanced. Contextualized word representations offer a possible solution, as they can efficiently capture composit… ▽ More

    Submitted 1 May, 2020; v1 submitted 15 June, 2019; originally announced June 2019.

    Journal ref: Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2019), pp. 103-115

  36. arXiv:1811.12938  [pdf, other

    cs.LG q-bio.QM stat.ML

    Advance Prediction of Ventricular Tachyarrhythmias using Patient Metadata and Multi-Task Networks

    Authors: Marek Rei, Joshua Oppenheimer, Marek Sirendi

    Abstract: We describe a novel neural network architecture for the prediction of ventricular tachyarrhythmias. The model receives input features that capture the change in RR intervals and ectopic beats, along with features based on heart rate variability and frequency analysis. Patient age is also included as a trainable embedding, while the whole network is optimized with multi-task objectives. Each of the… ▽ More

    Submitted 30 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018

    Report number: ML4H/2018/158

  37. arXiv:1811.05949  [pdf, other

    cs.CL cs.LG cs.NE

    Jointly Learning to Label Sentences and Tokens

    Authors: Marek Rei, Anders Søgaard

    Abstract: Learning to construct text representations in end-to-end systems can be difficult, as natural languages are highly compositional and task-specific annotated datasets are often limited in size. Methods for directly supervising language composition can allow us to guide the models based on existing knowledge, regularizing them towards more robust and interpretable representations. In this paper, we… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

    Comments: AAAI 2019

    ACM Class: I.2.7; I.2.6; I.5.1

  38. arXiv:1805.09355  [pdf, other

    cs.CL cs.LG cs.NE

    Scoring Lexical Entailment with a Supervised Directional Similarity Network

    Authors: Marek Rei, Daniela Gerz, Ivan Vulić

    Abstract: We present the Supervised Directional Similarity Network (SDSN), a novel neural architecture for learning task-specific transformation functions on top of general-purpose word embeddings. Relying on only a limited amount of supervision from task-specific scores on a subset of the vocabulary, our architecture is able to generalise and transform a general-purpose distributional vector space to model… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: ACL 2018

    ACM Class: I.2.7; I.2.6; I.5.1

  39. arXiv:1805.02214  [pdf, other

    cs.CL cs.LG cs.NE

    Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens

    Authors: Marek Rei, Anders Søgaard

    Abstract: Can attention- or gradient-based visualization techniques be used to infer token-level labels for binary sequence tagging problems, using networks trained only on sentence-level labels? We construct a neural network architecture based on soft attention, train it as a binary sentence classifier and evaluate against token-level annotation on four different datasets. Inferring token labels from a net… ▽ More

    Submitted 6 May, 2018; originally announced May 2018.

    Comments: NAACL 2018

    ACM Class: I.2.7; I.2.6; I.5.1

  40. arXiv:1801.06830  [pdf, other

    cs.CL

    Neural Multi-task Learning in Automated Assessment

    Authors: Ronan Cummins, Marek Rei

    Abstract: Grammatical error detection and automated essay scoring are two tasks in the area of automated assessment. Traditionally these tasks have been treated independently with different machine learning models and features used for each task. In this paper, we develop a multi-task neural network model that jointly optimises for both tasks, and in particular we show that neural automated essay scoring ca… ▽ More

    Submitted 21 January, 2018; originally announced January 2018.

    Comments: 9 pages

  41. arXiv:1709.00575  [pdf, other

    cs.CL cs.LG cs.NE

    Grasping the Finer Point: A Supervised Similarity Network for Metaphor Detection

    Authors: Marek Rei, Luana Bulat, Douwe Kiela, Ekaterina Shutova

    Abstract: The ubiquity of metaphor in our everyday communication makes it an important problem for natural language understanding. Yet, the majority of metaphor processing systems to date rely on hand-engineered features and there is still no consensus in the field as to which features are optimal for this task. In this paper, we present the first deep learning architecture designed to capture metaphorical… ▽ More

    Submitted 2 September, 2017; originally announced September 2017.

    Comments: EMNLP 2017

    ACM Class: I.2.7; I.2.6; I.5.1

  42. arXiv:1707.06841  [pdf, other

    cs.CL cs.LG cs.NE

    An Error-Oriented Approach to Word Embedding Pre-Training

    Authors: Youmna Farag, Marek Rei, Ted Briscoe

    Abstract: We propose a novel word embedding pre-training approach that exploits writing errors in learners' scripts. We compare our method to previous models that tune the embeddings based on script scores and the discrimination between correct and corrupt word contexts in addition to the generic commonly-used embeddings pre-trained on large corpora. The comparison is achieved by using the aforementioned mo… ▽ More

    Submitted 21 July, 2017; originally announced July 2017.

    Comments: 10 pages, 2 figures, 4 tables, BEA 2017

    Journal ref: The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2017)

  43. arXiv:1707.05236  [pdf, other

    cs.CL cs.LG

    Artificial Error Generation with Machine Translation and Syntactic Patterns

    Authors: Marek Rei, Mariano Felice, Zheng Yuan, Ted Briscoe

    Abstract: Shortage of available training data is holding back progress in the area of automated error detection. This paper investigates two alternative methods for artificially generating writing errors, in order to create additional resources. We propose treating error generation as a machine translation task, where grammatically correct text is translated to contain errors. In addition, we explore a syst… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2017)

    ACM Class: I.2.7; I.2.6; I.5.1

  44. arXiv:1707.05233  [pdf, other

    cs.CL cs.LG cs.NE

    Detecting Off-topic Responses to Visual Prompts

    Authors: Marek Rei

    Abstract: Automated methods for essay scoring have made great progress in recent years, achieving accuracies very close to human annotators. However, a known weakness of such automated scorers is not taking into account the semantic relevance of the submitted text. While there is existing work on detecting answer relevance given a textual prompt, very little previous research has been done to incorporate vi… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2017)

    ACM Class: I.2.7; I.2.6; I.5.1

  45. arXiv:1707.05227  [pdf, other

    cs.CL cs.LG cs.NE

    Auxiliary Objectives for Neural Error Detection Models

    Authors: Marek Rei, Helen Yannakoudakis

    Abstract: We investigate the utility of different auxiliary objectives and training strategies within a neural sequence labeling approach to error detection in learner writing. Auxiliary costs provide the model with additional linguistic information, allowing it to learn general-purpose compositional features that can then be exploited for other objectives. Our experiments show that a joint learning approac… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2017)

    ACM Class: I.2.7; I.2.6; I.5.1

  46. arXiv:1704.07156  [pdf, other

    cs.CL cs.LG cs.NE

    Semi-supervised Multitask Learning for Sequence Labeling

    Authors: Marek Rei

    Abstract: We propose a sequence labeling framework with a secondary training objective, learning to predict surrounding words for every word in the dataset. This language modeling objective incentivises the system to learn general-purpose patterns of semantic and syntactic composition, which are also useful for improving accuracy on different sequence labeling tasks. The architecture was evaluated on a rang… ▽ More

    Submitted 24 April, 2017; originally announced April 2017.

    Comments: ACL 2017

    ACM Class: I.5.1; I.2.6; I.2.7

  47. arXiv:1611.04361  [pdf, other

    cs.CL cs.LG cs.NE

    Attending to Characters in Neural Sequence Labeling Models

    Authors: Marek Rei, Gamal K. O. Crichton, Sampo Pyysalo

    Abstract: Sequence labeling architectures use word embeddings for capturing similarity, but suffer when handling previously unseen or rare words. We investigate character-level extensions to such models and propose a novel architecture for combining alternative word representations. By using an attention mechanism, the model is able to dynamically decide how much information to use from a word- or character… ▽ More

    Submitted 14 November, 2016; originally announced November 2016.

    Comments: Proceedings of COLING 2016

    ACM Class: I.5.1; I.2.6; I.2.7

  48. Compositional Sequence Labeling Models for Error Detection in Learner Writing

    Authors: Marek Rei, Helen Yannakoudakis

    Abstract: In this paper, we present the first experiments using neural network models for the task of error detection in learner writing. We perform a systematic comparison of alternative compositional architectures and propose a framework for error detection based on bidirectional LSTMs. Experiments on the CoNLL-14 shared task dataset show the model is able to outperform other participants on detecting err… ▽ More

    Submitted 20 July, 2016; originally announced July 2016.

    Comments: Proceedings of ACL 2016

    ACM Class: I.5.1; I.2.6; I.2.7

  49. arXiv:1606.04289  [pdf, other

    cs.CL cs.LG cs.NE

    Automatic Text Scoring Using Neural Networks

    Authors: Dimitrios Alikaniotis, Helen Yannakoudakis, Marek Rei

    Abstract: Automated Text Scoring (ATS) provides a cost-effective and consistent alternative to human marking. However, in order to achieve good performance, the predictive features of the system need to be manually engineered by human experts. We introduce a model that forms word representations by learning the extent to which specific words contribute to the text's score. Using Long-Short Term Memory netwo… ▽ More

    Submitted 16 June, 2016; v1 submitted 14 June, 2016; originally announced June 2016.

    Comments: 11 pages, 3 figures, 2 tables, ACL-2016

    ACM Class: I.5.1; I.2.6; I.2.7

  50. arXiv:1606.03144  [pdf, other

    cs.CL cs.LG cs.NE

    Sentence Similarity Measures for Fine-Grained Estimation of Topical Relevance in Learner Essays

    Authors: Marek Rei, Ronan Cummins

    Abstract: We investigate the task of assessing sentence-level prompt relevance in learner essays. Various systems using word overlap, neural embeddings and neural compositional models are evaluated on two datasets of learner writing. We propose a new method for sentence-level similarity calculation, which learns to adjust the weights of pre-trained word embeddings for a specific task, achieving substantiall… ▽ More

    Submitted 9 June, 2016; originally announced June 2016.

    Comments: Accepted for publication at BEA-2016