Skip to main content

Showing 1–8 of 8 results for author: Aina, L

.
  1. arXiv:2503.19114  [pdf, other

    cs.CL cs.IR cs.LG

    Understanding and Improving Information Preservation in Prompt Compression for LLMs

    Authors: Weronika Łajewska, Momchil Hardalov, Laura Aina, Neha Anna John, Hang Su, Lluís Màrquez

    Abstract: Recent advancements in large language models (LLMs) have enabled their successful application to a broad range of tasks. However, in information-intensive tasks, the prompt length can grow fast, leading to increased computational requirements, performance degradation, and induced biases from irrelevant or redundant information. Recently, various prompt compression techniques have been introduced t… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 21 pages, 6 figures, 23 tables

  2. Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators

    Authors: Matéo Mahaut, Laura Aina, Paula Czarnowska, Momchil Hardalov, Thomas Müller, Lluís Màrquez

    Abstract: Large Language Models (LLMs) tend to be unreliable in the factuality of their answers. To address this problem, NLP researchers have proposed a range of techniques to estimate LLM's confidence over facts. However, due to the lack of a systematic comparison, it is not clear how the different methods compare to one another. To fill this gap, we present a survey and empirical comparison of estimators… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: accepted on the main track of ACL 2024

  3. arXiv:2210.12022  [pdf, other

    cs.CL

    Performance-Efficiency Trade-Offs in Adapting Language Models to Text Classification Tasks

    Authors: Laura Aina, Nikos Voskarides, Roi Blanco

    Abstract: Pre-trained language models (LMs) obtain state-of-the-art performance when adapted to text classification tasks. However, when using such models in real-world applications, efficiency considerations are paramount. In this paper, we study how different training procedures that adapt LMs to text classification perform, as we vary model and train set size. More specifically, we compare standard fine-… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: AACL-IJCNLP 2022

  4. arXiv:2109.13105  [pdf, other

    cs.CL

    Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolution

    Authors: Laura Aina, Xixian Liao, Gemma Boleda, Matthijs Westera

    Abstract: It is often posited that more predictable parts of a speaker's meaning tend to be made less explicit, for instance using shorter, less informative words. Studying these dynamics in the domain of referring expressions has proven difficult, with existing studies, both psycholinguistic and corpus-based, providing contradictory results. We test the hypothesis that speakers produce less informative ref… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  5. arXiv:2109.07848  [pdf, other

    cs.CL

    The Language Model Understood the Prompt was Ambiguous: Probing Syntactic Uncertainty Through Generation

    Authors: Laura Aina, Tal Linzen

    Abstract: Temporary syntactic ambiguities arise when the beginning of a sentence is compatible with multiple syntactic analyses. We inspect to which extent neural language models (LMs) exhibit uncertainty over such analyses when processing temporarily ambiguous inputs, and how that uncertainty is modulated by disambiguating cues. We probe the LM's expectations by generating from it: we use stochastic decodi… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: To appear in Proceedings of BlackboxNLP 2021: Analyzing and Interpreting Neural Networks for NLP

  6. arXiv:1906.05149  [pdf, other

    cs.CL

    Putting words in context: LSTM language models and lexical ambiguity

    Authors: Laura Aina, Kristina Gulordava, Gemma Boleda

    Abstract: In neural network models of language, words are commonly represented using context-invariant representations (word embeddings) which are then put in context in the hidden layers. Since words are often ambiguous, representing the contextually relevant information is not trivial. We investigate how an LSTM language model deals with lexical ambiguity in English, designing a method to probe its hidden… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: To appear in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL)

  7. arXiv:1905.06649  [pdf, other

    cs.CL

    What do Entity-Centric Models Learn? Insights from Entity Linking in Multi-Party Dialogue

    Authors: Laura Aina, Carina Silberer, Matthijs Westera, Ionut-Teodor Sorodoc, Gemma Boleda

    Abstract: Humans use language to refer to entities in the external world. Motivated by this, in recent years several models that incorporate a bias towards learning entity representations have been proposed. Such entity-centric models have shown empirical success, but we still know little about why. In this paper we analyze the behavior of two recently proposed entity-centric models in a referential task, E… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: To appear in Proceedings of NAACL 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics

  8. arXiv:1805.05370  [pdf, other

    cs.CL

    AMORE-UPF at SemEval-2018 Task 4: BiLSTM with Entity Library

    Authors: Laura Aina, Carina Silberer, Ionut-Teodor Sorodoc, Matthijs Westera, Gemma Boleda

    Abstract: This paper describes our winning contribution to SemEval 2018 Task 4: Character Identification on Multiparty Dialogues. It is a simple, standard model with one key innovation, an entity library. Our results show that this innovation greatly facilitates the identification of infrequent characters. Because of the generic nature of our model, this finding is potentially relevant to any task that requ… ▽ More

    Submitted 14 May, 2018; originally announced May 2018.