Skip to main content

Showing 1–21 of 21 results for author: Mickus, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.08966  [pdf, other

    cs.CL cs.LG cs.NE

    Pre-trained Language Models Learn Remarkably Accurate Representations of Numbers

    Authors: Marek Kadlčík, Michal Štefánik, Timothee Mickus, Michal Spiegel, Josef Kuchař

    Abstract: Pretrained language models (LMs) are prone to arithmetic errors. Existing work showed limited success in probing numeric values from models' representations, indicating that these errors can be attributed to the inherent unreliability of distributionally learned embeddings in representing exact quantities. However, we observe that previous probing methods are inadequate for the emergent structure… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  2. arXiv:2504.11975  [pdf, other

    cs.CL

    SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration Mistakes

    Authors: Raúl Vázquez, Timothee Mickus, Elaine Zosa, Teemu Vahtola, Jörg Tiedemann, Aman Sinha, Vincent Segonne, Fernando Sánchez-Vega, Alessandro Raganato, Jindřich Libovický, Jussi Karlgren, Shaoxiong Ji, Jindřich Helcl, Liane Guillou, Ona de Gibert, Jaione Bengoetxea, Joseph Attieh, Marianna Apidianaki

    Abstract: We present the Mu-SHROOM shared task which is focused on detecting hallucinations and other overgeneration mistakes in the output of instruction-tuned large language models (LLMs). Mu-SHROOM addresses general-purpose LLMs in 14 languages, and frames the hallucination detection problem as a span-labeling task. We received 2,618 submissions from 43 participating teams employing diverse methodologies… ▽ More

    Submitted 28 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: Mu-SHROOM is part of SemEval-2025 (Task 3). TBP: Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)

  3. arXiv:2503.01235  [pdf, other

    cs.CL

    Your Model is Overconfident, and Other Lies We Tell Ourselves

    Authors: Timothee Mickus, Aman Sinha, Raúl Vázquez

    Abstract: The difficulty intrinsic to a given example, rooted in its inherent ambiguity, is a key yet often overlooked factor in evaluating neural NLP models. We investigate the interplay and divergence among various metrics for assessing intrinsic difficulty, including annotator dissensus, training dynamics, and model confidence. Through a comprehensive analysis using 29 models on three datasets, we reveal… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  4. arXiv:2407.15489  [pdf, other

    cs.CL

    A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives

    Authors: Zihao Li, Shaoxiong Ji, Timothee Mickus, Vincent Segonne, Jörg Tiedemann

    Abstract: Pretrained language models (PLMs) display impressive performances and have captured the attention of the NLP community. Establishing best practices in pretraining has, therefore, become a major focus of NLP research, especially since insights gained from monolingual English models may not necessarily apply to more complex multilingual models. One significant caveat of the current state of the art… ▽ More

    Submitted 7 October, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: Proceedings of EMNLP 2024

  5. arXiv:2407.12626  [pdf, other

    cs.CL

    Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

    Authors: Aman Sinha, Timothee Mickus, Marianne Clausel, Mathieu Constant, Xavier Coubez

    Abstract: The success of pretrained language models (PLMs) across a spate of use-cases has led to significant investment from the NLP community towards building domain-specific foundational models. On the other hand, in mission critical settings such as biomedical applications, other aspects also factor in-chief of which is a model's ability to produce reasonable estimates of its own uncertainty. In the pre… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: BioNLP 2024

  6. arXiv:2407.04079  [pdf, other

    cs.CL

    AXOLOTL'24 Shared Task on Multilingual Explainable Semantic Change Modeling

    Authors: Mariia Fedorova, Timothee Mickus, Niko Partanen, Janine Siewert, Elena Spaziani, Andrey Kutuzov

    Abstract: This paper describes the organization and findings of AXOLOTL'24, the first multilingual explainable semantic change modeling shared task. We present new sense-annotated diachronic semantic change datasets for Finnish and Russian which were employed in the shared task, along with a surprise test-only German dataset borrowed from an existing source. The setup of AXOLOTL'24 is new to the semantic ch… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Proceedings of the 5th Workshop on Computational Approaches to Historical Language Change (ACL'24)

  7. arXiv:2404.17918  [pdf, other

    cs.CL

    I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures

    Authors: Timothee Mickus, Raúl Vázquez, Joseph Attieh

    Abstract: Modularity is a paradigm of machine translation with the potential of bringing forth models that are large at training time and small during inference. Within this field of study, modular approaches, and in particular attention bridges, have been argued to improve the generalization capabilities of models by fostering language-independent representations. In the present paper, we study whether mod… ▽ More

    Submitted 30 April, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

  8. arXiv:2403.16777  [pdf, other

    cs.CL

    Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?

    Authors: Shaoxiong Ji, Timothee Mickus, Vincent Segonne, Jörg Tiedemann

    Abstract: Multilingual pretraining and fine-tuning have remarkably succeeded in various natural language processing tasks. Transferring representations from one language to another is especially crucial for cross-lingual learning. One can expect machine translation objectives to be well suited to fostering such capabilities, as they involve the explicit alignment of semantically equivalent sentences from di… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  9. arXiv:2403.07726  [pdf, other

    cs.CL

    SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

    Authors: Timothee Mickus, Elaine Zosa, Raúl Vázquez, Teemu Vahtola, Jörg Tiedemann, Vincent Segonne, Alessandro Raganato, Marianna Apidianaki

    Abstract: This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate. Such cases of overgeneration put in jeopardy many NLG applications, where correctness is often mission-critical. The shared task was conducted with a newly constructed dataset of 4000 model outputs labeled by 5 ann… ▽ More

    Submitted 29 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: SemEval 2024 shared task. Pre-review version

  10. arXiv:2403.07544  [pdf, other

    cs.CL

    MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki

    Authors: Timothee Mickus, Stig-Arne Grönroos, Joseph Attieh, Michele Boggia, Ona De Gibert, Shaoxiong Ji, Niki Andreas Lopi, Alessandro Raganato, Raúl Vázquez, Jörg Tiedemann

    Abstract: NLP in the age of monolithic large language models is approaching its limits in terms of size and information that can be handled. The trend goes to modularization, a necessary step into the direction of designing smaller sub-networks and components with specialized functionality. In this paper, we present the MAMMOTH toolkit: a framework designed for training massively multilingual modular machin… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Presented as a demo at EACL 2024

  11. arXiv:2402.03191  [pdf, other

    cs.LG cs.CL

    Isotropy, Clusters, and Classifiers

    Authors: Timothee Mickus, Stig-Arne Grönroos, Joseph Attieh

    Abstract: Whether embedding spaces use all their dimensions equally, i.e., whether they are isotropic, has been a recent subject of discussion. Evidence has been accrued both for and against enforcing isotropy in embedding spaces. In the present paper, we stress that isotropy imposes requirements on the embedding space that are not compatible with the presence of clusters -- which also negatively impacts li… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  12. arXiv:2310.11938  [pdf, other

    cs.CL

    Grounded and Well-rounded: A Methodological Approach to the Study of Cross-modal and Cross-lingual Grounding

    Authors: Timothee Mickus, Elaine Zosa, Denis Paperno

    Abstract: Grounding has been argued to be a crucial component towards the development of more complete and truly semantically competent artificial intelligence systems. Literature has divided into two camps: While some argue that grounding allows for qualitatively different generalizations, others believe it can be compensated by mono-modal data quantity. Limited empirical evidence has emerged for or agains… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: accepted to Findings of EMNLP 2023

  13. arXiv:2310.06977  [pdf, other

    cs.CL

    Why bother with geometry? On the relevance of linear decompositions of Transformer embeddings

    Authors: Timothee Mickus, Raúl Vázquez

    Abstract: A recent body of work has demonstrated that Transformer embeddings can be linearly decomposed into well-defined sums of factors, that can in turn be related to specific network inputs or components. There is however still a dearth of work studying whether these mathematical reformulations are empirically meaningful. In the present work, we study representations from machine-translation decoders us… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to BlackBoxNLP 2023

  14. arXiv:2306.08433  [pdf, other

    cs.CL

    "Definition Modeling: To model definitions." Generating Definitions With Little to No Semantics

    Authors: Vincent Segonne, Timothee Mickus

    Abstract: Definition Modeling, the task of generating definitions, was first proposed as a means to evaluate the semantic quality of word embeddings-a coherent lexical semantic representations of a word in context should contain all the information necessary to generate its definition. The relative novelty of this task entails that we do not know which factors are actually relied upon by a Definition Modeli… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted at IWCS 2023

  15. arXiv:2206.03529  [pdf, other

    cs.CL

    How to Dissect a Muppet: The Structure of Transformer Embedding Spaces

    Authors: Timothee Mickus, Denis Paperno, Mathieu Constant

    Abstract: Pretrained embeddings based on the Transformer architecture have taken the NLP community by storm. We show that they can mathematically be reframed as a sum of vector factors and showcase how to use this reframing to study the impact of each component. We provide evidence that multi-head attentions and feed-forwards are not equally useful in all downstream applications, as well as a quantitative o… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: Accepted at TACL (pre-MIT Press publication version)

  16. arXiv:2205.13858  [pdf, other

    cs.CL

    Semeval-2022 Task 1: CODWOE -- Comparing Dictionaries and Word Embeddings

    Authors: Timothee Mickus, Kees van Deemter, Mathieu Constant, Denis Paperno

    Abstract: Word embeddings have advanced the state of the art in NLP across numerous tasks. Understanding the contents of dense neural representations is of utmost interest to the computational semantics community. We propose to focus on relating these opaque word vectors with human-readable definitions, as found in dictionaries. This problem naturally divides into two subtasks: converting definitions into e… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  17. arXiv:2108.07708  [pdf, other

    cs.CL

    A Game Interface to Study Semantic Grounding in Text-Based Models

    Authors: Timothee Mickus, Mathieu Constant, Denis Paperno

    Abstract: Can language models learn grounded representations from text distribution alone? This question is both central and recurrent in natural language processing; authors generally agree that grounding requires more than textual distribution. We propose to experimentally test this claim: if any two words have different meanings and yet cannot be distinguished from distribution alone, then grounding is o… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

  18. arXiv:2012.03833  [pdf, other

    cs.CL

    What Meaning-Form Correlation Has to Compose With

    Authors: Timothee Mickus, Timothée Bernard, Denis Paperno

    Abstract: Compositionality is a widely discussed property of natural languages, although its exact definition has been elusive. We focus on the proposal that compositionality can be assessed by measuring meaning-form correlation. We analyze meaning-form correlation on three sets of languages: (i) artificial toy languages tailored to be compositional, (ii) a set of English dictionary definitions, and (iii) a… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Journal ref: Proceedings of the 28th International Conference on Computational Linguistics (2020) 3737-3749

  19. arXiv:2012.00391  [pdf

    cs.NI

    IRIS: A Low Duty Cycle Cross-Layer Protocol for Long-Range Wireless Sensor Networks with Low Power Budget

    Authors: Yi Chu, Paul Mitchell, David Grace, Jonathan Roberts, Dominic White, Tautvydas Mickus

    Abstract: This paper presents a cross-layer protocol (IRIS) designed for long-range pipeline Wireless Sensor Networks with extremely low power budget, typically seen in a range of monitoring applications. IRIS uses ping packets initiated by a base station to travel through the multi-hop network and carry monitoring information. The protocol is able to operate with less than 1% duty cycle, thereby conforming… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  20. What do you mean, BERT? Assessing BERT as a Distributional Semantics Model

    Authors: Timothee Mickus, Denis Paperno, Mathieu Constant, Kees van Deemter

    Abstract: Contextualized word embeddings, i.e. vector representations for words in context, are naturally seen as an extension of previous noncontextual distributional semantic models. In this work, we focus on BERT, a deep neural network that produces contextualized embeddings and has set the state-of-the-art in several semantic tasks, and study the semantic coherence of its embedding space. While showing… ▽ More

    Submitted 8 May, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Journal ref: Proceedings of the Society for Computation in Linguistics: Vol. 3 (2020), Article 34

  21. arXiv:1911.05715  [pdf, other

    cs.CL

    Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling

    Authors: Timothee Mickus, Denis Paperno, Mathieu Constant

    Abstract: Defining words in a textual context is a useful task both for practical purposes and for gaining insight into distributed word representations. Building on the distributional hypothesis, we argue here that the most natural formalization of definition modeling is to treat it as a sequence-to-sequence task, rather than a word-to-sequence task: given an input sequence with a highlighted word, generat… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Journal ref: Proceedings of the First NLPL Workshop on Deep Learning for Natural Language Processing, 30 September, 2019, University of Turku, Turku, Finland