Skip to main content

Showing 1–9 of 9 results for author: Hagström, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.16518  [pdf, ps, other

    cs.CL cs.AI

    CUB: Benchmarking Context Utilisation Techniques for Language Models

    Authors: Lovisa Hagström, Youna Kim, Haeun Yu, Sang-goo Lee, Richard Johansson, Hyunsoo Cho, Isabelle Augenstein

    Abstract: Incorporating external knowledge is crucial for knowledge-intensive tasks, such as question answering and fact checking. However, language models (LMs) may ignore relevant information that contradicts outdated parametric memory or be distracted by irrelevant contexts. While many context utilisation manipulation techniques (CMTs) that encourage or suppress context utilisation have recently been pro… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 27 pages

  2. arXiv:2502.17036  [pdf, ps, other

    cs.CL cs.AI

    Language Model Re-rankers are Fooled by Lexical Similarities

    Authors: Lovisa Hagström, Ercong Nie, Ruben Halifa, Helmut Schmid, Richard Johansson, Alexander Junge

    Abstract: Language model (LM) re-rankers are used to refine retrieval results for retrieval-augmented generation (RAG). They are more expensive than lexical matching methods like BM25 but assumed to better process semantic information and the relations between the query and the retrieved answers. To understand whether LM re-rankers always live up to this assumption, we evaluate 6 different LM re-rankers on… ▽ More

    Submitted 24 June, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: Accepted to FEVER 2025

  3. arXiv:2412.17031  [pdf, ps, other

    cs.CL cs.AI

    A Reality Check on Context Utilisation for Retrieval-Augmented Generation

    Authors: Lovisa Hagström, Sara Vera Marjanović, Haeun Yu, Arnav Arora, Christina Lioma, Maria Maistro, Pepa Atanasova, Isabelle Augenstein

    Abstract: Retrieval-augmented generation (RAG) helps address the limitations of parametric knowledge embedded within a language model (LM). In real world settings, retrieved information can vary in complexity, yet most investigations of LM utilisation of context has been limited to synthetic text. We introduce DRUID (Dataset of Retrieved Unreliable, Insufficient and Difficult-to-understand contexts) with re… ▽ More

    Submitted 29 May, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

    Comments: Accepted at ACL 2025

  4. arXiv:2410.14405  [pdf, ps, other

    cs.CL

    Fact Recall, Heuristics or Pure Guesswork? Precise Interpretations of Language Models for Fact Completion

    Authors: Denitsa Saynova, Lovisa Hagström, Moa Johansson, Richard Johansson, Marco Kuhlmann

    Abstract: Language models (LMs) can make a correct prediction based on many possible signals in a prompt, not all corresponding to recall of factual associations. However, current interpretations of LMs fail to take this into account. For example, given the query "Astrid Lindgren was born in" with the corresponding completion "Sweden", no difference is made between whether the prediction was based on knowin… ▽ More

    Submitted 1 July, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

    Comments: accepted to ACL Findings 2025

  5. arXiv:2311.01307  [pdf, other

    cs.CL

    The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models

    Authors: Lovisa Hagström, Denitsa Saynova, Tobias Norlund, Moa Johansson, Richard Johansson

    Abstract: Large Language Models (LLMs) make natural interfaces to factual knowledge, but their usefulness is limited by their tendency to deliver inconsistent answers to semantically equivalent questions. For example, a model might predict both "Anne Redpath passed away in Edinburgh." and "Anne Redpath's life ended in London." In this work, we identify potential causes of inconsistency and evaluate the effe… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023

  6. arXiv:2209.08982  [pdf, other

    cs.CL

    How to Adapt Pre-trained Vision-and-Language Models to a Text-only Input?

    Authors: Lovisa Hagström, Richard Johansson

    Abstract: Current language models have been criticised for learning language from text alone without connection between words and their meaning. Consequently, multimodal training has been proposed as a way for creating models with better language understanding by providing the lacking connection. We focus on pre-trained multimodal vision-and-language (VL) models for which there already are some results on t… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  7. arXiv:2205.07065  [pdf, other

    cs.CL

    What do Models Learn From Training on More Than Text? Measuring Visual Commonsense Knowledge

    Authors: Lovisa Hagström, Richard Johansson

    Abstract: There are limitations in learning language from text alone. Therefore, recent focus has been on developing multimodal models. However, few benchmarks exist that can measure what language models learn about language from multimodal training. We hypothesize that training on a visual modality should improve on the visual commonsense knowledge in language models. Therefore, we introduce two evaluation… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted to the ACL Student Research Workshop 2022

  8. arXiv:2201.10665  [pdf, other

    cs.CV

    Writer Recognition Using Off-line Handwritten Single Block Characters

    Authors: Adrian Leo Hagström, Rustam Stanikzai, Josef Bigun, Fernando Alonso-Fernandez

    Abstract: Block characters are often used when filling paper forms for a variety of purposes. We investigate if there is biometric information contained within individual digits of handwritten text. In particular, we use personal identity numbers consisting of the six digits of the date of birth, DoB. We evaluate two recognition approaches, one based on handcrafted features that compute contour directional… ▽ More

    Submitted 7 March, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Accepted for publication at IEEE International Workshop on Biometrics and Forensics IWBF 2022

  9. arXiv:2109.11321  [pdf, other

    cs.CL

    Transferring Knowledge from Vision to Language: How to Achieve it and how to Measure it?

    Authors: Tobias Norlund, Lovisa Hagström, Richard Johansson

    Abstract: Large language models are known to suffer from the hallucination problem in that they are prone to output statements that are false or inconsistent, indicating a lack of knowledge. A proposed solution to this is to provide the model with additional data modalities that complements the knowledge obtained through text. We investigate the use of visual data to complement the knowledge of large langua… ▽ More

    Submitted 30 September, 2021; v1 submitted 23 September, 2021; originally announced September 2021.