Skip to main content

Showing 1–26 of 26 results for author: Guillou, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03867  [pdf, ps, other

    cs.CL

    EuroGEST: Investigating gender stereotypes in multilingual language models

    Authors: Jacqueline Rowe, Mateusz Klimaszewski, Liane Guillou, Shannon Vallor, Alexandra Birch

    Abstract: Large language models increasingly support multiple languages, yet most benchmarks for gender bias remain English-centric. We introduce EuroGEST, a dataset designed to measure gender-stereotypical reasoning in LLMs across English and 29 European languages. EuroGEST builds on an existing expert-informed benchmark covering 16 gender stereotypes, expanded in this work using translation tools, quality… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 8 pages, 6 figures, 1 table

  2. arXiv:2505.21266  [pdf, ps, other

    cs.DC

    Distributed Discrete Morse Sandwich: Efficient Computation of Persistence Diagrams for Massive Scalar Data

    Authors: Eve Le Guillou, Pierre Fortin, Julien Tierny

    Abstract: The persistence diagram, which describes the topological features of a dataset, is a key descriptor in Topological Data Analysis. The "Discrete Morse Sandwich" (DMS) method has been reported to be the most efficient algorithm for computing persistence diagrams of 3D scalar fields on a single node, using shared-memory parallelism. In this work, we extend DMS to distributed-memory parallelism for th… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2504.20699  [pdf, other

    cs.CL cs.AI

    Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?

    Authors: Evangelia Gogoulou, Shorouq Zahra, Liane Guillou, Luise Dürlich, Joakim Nivre

    Abstract: A frequently observed problem with LLMs is their tendency to generate output that is nonsensical, illogical, or factually incorrect, often referred to broadly as hallucination. Building on the recently proposed HalluciGen task for hallucination detection and generation, we evaluate a suite of open-access LLMs on their ability to detect intrinsic hallucinations in two conditional generation tasks:… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  4. arXiv:2504.11975  [pdf, other

    cs.CL

    SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration Mistakes

    Authors: Raúl Vázquez, Timothee Mickus, Elaine Zosa, Teemu Vahtola, Jörg Tiedemann, Aman Sinha, Vincent Segonne, Fernando Sánchez-Vega, Alessandro Raganato, Jindřich Libovický, Jussi Karlgren, Shaoxiong Ji, Jindřich Helcl, Liane Guillou, Ona de Gibert, Jaione Bengoetxea, Joseph Attieh, Marianna Apidianaki

    Abstract: We present the Mu-SHROOM shared task which is focused on detecting hallucinations and other overgeneration mistakes in the output of instruction-tuned large language models (LLMs). Mu-SHROOM addresses general-purpose LLMs in 14 languages, and frames the hallucination detection problem as a span-labeling task. We received 2,618 submissions from 43 participating teams employing diverse methodologies… ▽ More

    Submitted 28 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: Mu-SHROOM is part of SemEval-2025 (Task 3). TBP: Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)

  5. arXiv:2503.10267  [pdf, ps, other

    cs.CL

    An Expanded Massive Multilingual Dataset for High-Performance Language Technologies (HPLT)

    Authors: Laurie Burchell, Ona de Gibert, Nikolay Arefyev, Mikko Aulamo, Marta Bañón, Pinzhen Chen, Mariia Fedorova, Liane Guillou, Barry Haddow, Jan Hajič, Jindřich Helcl, Erik Henriksson, Mateusz Klimaszewski, Ville Komulainen, Andrey Kutuzov, Joona Kytöniemi, Veronika Laippala, Petter Mæhlum, Bhavitvya Malik, Farrokh Mehryary, Vladislav Mikhailov, Nikita Moghe, Amanda Myntti, Dayyán O'Brien, Stephan Oepen , et al. (10 additional authors not shown)

    Abstract: Training state-of-the-art large language models requires vast amounts of clean and diverse textual data. However, building suitable multilingual datasets remains a challenge. In this work, we present HPLT v2, a collection of high-quality multilingual monolingual and parallel corpora, extending prior work of the HPLT project. The monolingual portion of the data contains 8T tokens covering 193 langu… ▽ More

    Submitted 4 June, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: ACL'2025 Main Proceedings

  6. arXiv:2502.10338  [pdf, ps, other

    cs.CL cs.AI

    Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering

    Authors: Nick Ferguson, Liane Guillou, Alan Bundy, Kwabena Nuamah

    Abstract: Large Language Models (LLMs) excel in natural language tasks but still face challenges in Question Answering (QA) tasks requiring complex, multi-step reasoning. We outline the types of reasoning required in some of these tasks, and reframe them in terms of meta-level reasoning (akin to high-level strategic reasoning or planning) and object-level reasoning (embodied in lower-level tasks such as mat… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: 8 pages. Accepted to the Workshop on Planning in the Era of LLMs (LM4Plan @ AAAI 2025)

  7. arXiv:2406.15202  [pdf, other

    cs.LO cs.MA

    Phase-Bounded Broadcast Networks over Topologies of Communication

    Authors: Lucie Guillou, Arnaud Sangnier, Nathalie Sznajder

    Abstract: We study networks of processes that all execute the same finite state protocol and that communicate through broadcasts. The processes are organized in a graph (a topology) and only the neighbors of a process in this graph can receive its broadcasts. The coverability problem asks, given a protocol and a state of the protocol, whether there is a topology for the processes such that one of them (at l… ▽ More

    Submitted 4 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: long version of a paper accepted to appear at CONCUR 2024

  8. arXiv:2403.18591  [pdf, ps, other

    cs.LO cs.CL cs.MA

    Safety Verification of Wait-Only Non-Blocking Broadcast Protocols

    Authors: Lucie Guillou, Arnaud Sangnier, Nathalie Sznajder

    Abstract: We study networks of processes that all execute the same finite protocol and communicate synchronously in two different ways: a process can broadcast one message to all other processes or send it to at most one other process. In both cases, if no process can receive the message, it will still be sent. We establish a precise complexity class for two coverability problems with a parameterised number… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Long version of a paper accepted to PetriNets 2024

  9. arXiv:2401.16313  [pdf, other

    cs.CL

    Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets

    Authors: Nikita Moghe, Arnisa Fazla, Chantal Amrhein, Tom Kocmi, Mark Steedman, Alexandra Birch, Rico Sennrich, Liane Guillou

    Abstract: Recent machine translation (MT) metrics calibrate their effectiveness by correlating with human judgement but without any insights about their behaviour across different error types. Challenge sets are used to probe specific dimensions of metric behaviour but there are very few such datasets and they either focus on a limited number of phenomena or a limited number of language pairs. We introduce… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2210.15615

  10. arXiv:2311.13936  [pdf, other

    cs.DC

    Process-Commutative Distributed Objects: From Cryptocurrencies to Byzantine-Fault-Tolerant CRDTs

    Authors: Davide Frey, Lucie Guillou, Michel Raynal, François Taïani

    Abstract: This paper explores the territory that lies between best-effort Byzantine-Fault-Tolerant Conflict-free Replicated Data Types (BFT CRDTs) and totally ordered distributed ledgers, such as those implemented by Blockchains. It formally characterizes a novel class of distributed objects that only requires a First In First Out (FIFO) order on the object operations from each process (taken individually).… ▽ More

    Submitted 8 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: A preliminary version of this work appeared at the 2021 International Conference on Parallel Computing Technologies (PaCT 2021)

  11. arXiv:2311.01153  [pdf, other

    cs.CL

    ACES: Translation Accuracy Challenge Sets at WMT 2023

    Authors: Chantal Amrhein, Nikita Moghe, Liane Guillou

    Abstract: We benchmark the performance of segmentlevel metrics submitted to WMT 2023 using the ACES Challenge Set (Amrhein et al., 2022). The challenge set consists of 36K examples representing challenges from 68 phenomena and covering 146 language pairs. The phenomena range from simple perturbations at the word/character level to more complex errors based on discourse and real-world knowledge. For each met… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Camera Ready WMT 2023. arXiv admin note: text overlap with arXiv:2210.15615

  12. arXiv:2310.08339  [pdf, other

    cs.DC cs.CG cs.CV cs.LG cs.MS

    TTK is Getting MPI-Ready

    Authors: Eve Le Guillou, Michael Will, Pierre Guillou, Jonas Lukasczyk, Pierre Fortin, Christoph Garth, Julien Tierny

    Abstract: This system paper documents the technical foundations for the extension of the Topology ToolKit (TTK) to distributed-memory parallelism with the Message Passing Interface (MPI). While several recent papers introduced topology-based approaches for distributed-memory environments, these were reporting experiments obtained with tailored, mono-algorithm implementations. In contrast, we describe in thi… ▽ More

    Submitted 15 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 18 pages, 13 figures

  13. arXiv:2307.04546  [pdf, other

    cs.LO cs.MA

    Safety Analysis of Parameterised Networks with Non-Blocking Rendez-Vous

    Authors: Lucie Guillou, Arnaud Sangnier, Nathalie Sznajder

    Abstract: We consider networks of processes that all execute the same finite-state protocol and communicate via a rendez-vous mechanism. When a process requests a rendez-vous, another process can respond to it and they both change their control states accordingly. We focus here on a specific semantics, called non-blocking, where the process requesting a rendez-vous can change its state even if no process ca… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Long version of of a paper accepted at CONCUR 2023

    ACM Class: C.2.4; F.4.3

  14. arXiv:2306.01517  [pdf, ps, other

    cs.LO cs.DC cs.FL

    Parameterized Broadcast Networks with Registers: from NP to the Frontiers of Decidability

    Authors: Lucie Guillou, Corto Mascle, Nicolas Waldburger

    Abstract: We consider the parameterized verification of arbitrarily large networks of agents which communicate by broadcasting and receiving messages. In our model, the broadcast topology is reconfigurable so that a sent message can be received by any set of agents. In addition, agents have local registers which are initially distinct and may therefore be thought of as identifiers. When an agent broadcasts… ▽ More

    Submitted 4 March, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Long version of a paper published at FoSSaCS 2024

  15. arXiv:2212.10455  [pdf, other

    cs.CL

    MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue

    Authors: Nikita Moghe, Evgeniia Razumovskaia, Liane Guillou, Ivan Vulić, Anna Korhonen, Alexandra Birch

    Abstract: Task-oriented dialogue (TOD) systems have been widely deployed in many industries as they deliver more efficient customer support. These systems are typically constructed for a single domain or language and do not generalise well beyond this. To support work on Natural Language Understanding (NLU) in TOD across multiple languages and domains simultaneously, we constructed MULTI3NLU++, a multilingu… ▽ More

    Submitted 19 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 (Findings) Camera Ready

  16. arXiv:2210.15615  [pdf, other

    cs.CL

    ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation Metrics

    Authors: Chantal Amrhein, Nikita Moghe, Liane Guillou

    Abstract: As machine translation (MT) metrics improve their correlation with human judgement every year, it is crucial to understand the limitations of such metrics at the segment level. Specifically, it is important to investigate metric behaviour when facing accuracy errors in MT because these can have dangerous consequences in certain contexts (e.g., legal, medical). We curate ACES, a translation accurac… ▽ More

    Submitted 6 December, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: preprint for WMT 2022 with updated tables

    ACM Class: I.2.7

  17. arXiv:2206.02737  [pdf, other

    cs.CL

    Investigating the use of Paraphrase Generation for Question Reformulation in the FRANK QA system

    Authors: Nick Ferguson, Liane Guillou, Kwabena Nuamah, Alan Bundy

    Abstract: We present a study into the ability of paraphrase generation methods to increase the variety of natural language questions that the FRANK Question Answering system can answer. We first evaluate paraphrase generation methods on the LC-QuAD 2.0 dataset using both automatic metrics and human judgement, and discuss their correlation. Error analysis on the dataset is also performed using both automatic… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 14 pages, 6 figures

  18. arXiv:2203.06264  [pdf, other

    cs.CL

    Cross-lingual Inference with A Chinese Entailment Graph

    Authors: Tianyi Li, Sabine Weber, Mohammad Javad Hosseini, Liane Guillou, Mark Steedman

    Abstract: Predicate entailment detection is a crucial task for question-answering from text, where previous work has explored unsupervised learning of entailment graphs from typed open relation triples. In this paper, we present the first pipeline for building Chinese entailment graphs, which involves a novel high-recall open relation extraction (ORE) method and the first Chinese fine-grained entity typing… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of ACL 2022

  19. arXiv:2201.10432  [pdf, ps, other

    cs.LO cs.MA

    Parameterized Analysis of Reconfigurable Broadcast Networks (Long Version)

    Authors: A. R. Balasubramanian, Lucie Guillou, Chana Weil-Kennedy

    Abstract: Reconfigurable broadcast networks (RBN) are a model of distributed computation in which agents can broadcast messages to other agents using some underlying communication topology which can change arbitrarily over the course of executions. In this paper, we conduct parameterized analysis of RBN. We consider cubes,(infinite) sets of configurations in the form of lower and upper bounds on the number… ▽ More

    Submitted 11 July, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: This is the long version of a paper accepted at FoSSaCS 2022. Erratum: The proof of Theorem 2 contains a mistake, kindly pointed out by Nicolas Waldburger. We are working on a solution

  20. arXiv:2109.10227  [pdf, other

    cs.CL

    Blindness to Modality Helps Entailment Graph Mining

    Authors: Liane Guillou, Sander Bijl de Vroe, Mark Johnson, Mark Steedman

    Abstract: Understanding linguistic modality is widely seen as important for downstream tasks such as Question Answering and Knowledge Graph Population. Entailment Graph learning might also be expected to benefit from attention to modality. We build Entailment Graphs using a news corpus filtered with a modality parser, and show that stripping modal modifiers from predicates in fact increases performance. Thi… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: To appear at the Workshop on Insights from Negative Results in NLP at EMNLP 2021

  21. arXiv:2109.09412  [pdf, other

    cs.CL

    Incorporating Temporal Information in Entailment Graph Mining

    Authors: Liane Guillou, Sander Bijl de Vroe, Mohammad Javad Hosseini, Mark Johnson, Mark Steedman

    Abstract: We present a novel method for injecting temporality into entailment graphs to address the problem of spurious entailments, which may arise from similar but temporally distinct events involving the same pair of entities. We focus on the sports domain in which the same pairs of teams play on different occasions, with different outcomes. We present an unsupervised model that aims to learn entailments… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: L. Guillou, S. Bijl de Vroe, M.J. Hosseini, M. Johnson, and M. Steedman. 2020. Incorporating temporal information in entailment graph mining. In Proceedings of the Graph-based Methods for Natural Language Processing (TextGraphs), pages 60-71, Barcelona, Spain (Online). Association for Computational Linguistics

    Journal ref: In Proceedings of TextGraphs 2020, pages 60-71, Barcelona, Spain (Online)

  22. Modality and Negation in Event Extraction

    Authors: Sander Bijl de Vroe, Liane Guillou, Miloš Stanojević, Nick McKenna, Mark Steedman

    Abstract: Language provides speakers with a rich system of modality for expressing thoughts about events, without being committed to their actual occurrence. Modality is commonly used in the political news domain, where both actual and possible courses of events are discussed. NLP systems struggle with these semantic phenomena, often incorrectly extracting events which did not happen, which can lead to issu… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: S. Bijl de Vroe, L. Guillou, M. Stanojević, N. McKenna, and M. Steedman. 2021. Modality and Negation in Event Extraction. In Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021), pages 31-42, online. Association for Computational Linguistics

    Journal ref: In Proceedings of CASE 2021, pages 31-42, online. Association for Computational Linguistics

  23. arXiv:2104.07846  [pdf, other

    cs.CL

    Multivalent Entailment Graphs for Question Answering

    Authors: Nick McKenna, Liane Guillou, Mohammad Javad Hosseini, Sander Bijl de Vroe, Mark Johnson, Mark Steedman

    Abstract: Drawing inferences between open-domain natural language predicates is a necessity for true language understanding. There has been much progress in unsupervised learning of entailment graphs for this purpose. We make three contributions: (1) we reinterpret the Distributional Inclusion Hypothesis to model entailment between predicates of different valencies, like DEFEAT(Biden, Trump) entails WIN(Bid… ▽ More

    Submitted 19 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to EMNLP 2021

  24. arXiv:1911.12091  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction

    Authors: Liane Guillou, Christian Hardmeier, Preslav Nakov, Sara Stymne, Jörg Tiedemann, Yannick Versley, Mauro Cettolo, Bonnie Webber, Andrei Popescu-Belis

    Abstract: We describe the design, the evaluation setup, and the results of the 2016 WMT shared task on cross-lingual pronoun prediction. This is a classification task in which participants are asked to provide predictions on what pronoun class label should replace a placeholder value in the target-language text, provided in lemmatised and PoS-tagged form. We provided four subtasks, for the English-French an… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: cross-lingual pronoun prediction, WMT, shared task, English, German, French

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: WMT-2016

  25. arXiv:1808.10196  [pdf, ps, other

    cs.CL

    Pronoun Translation in English-French Machine Translation: An Analysis of Error Types

    Authors: Christian Hardmeier, Liane Guillou

    Abstract: Pronouns are a long-standing challenge in machine translation. We present a study of the performance of a range of rule-based, statistical and neural MT systems on pronoun translation based on an extensive manual evaluation using the PROTEST test suite, which enables a fine-grained analysis of different pronoun types and sheds light on the difficulties of the task. We find that the rule-based appr… ▽ More

    Submitted 30 August, 2018; originally announced August 2018.

  26. arXiv:1808.04164  [pdf, ps, other

    cs.CL

    Automatic Reference-Based Evaluation of Pronoun Translation Misses the Point

    Authors: Liane Guillou, Christian Hardmeier

    Abstract: We compare the performance of the APT and AutoPRF metrics for pronoun translation against a manually annotated dataset comprising human judgements as to the correctness of translations of the PROTEST test suite. Although there is some correlation with the human judgements, a range of issues limit the performance of the automated metrics. Instead, we recommend the use of semi-automatic metrics and… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018