Skip to main content

Showing 1–17 of 17 results for author: Beinborn, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.22906  [pdf, other

    cs.CL

    From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes

    Authors: Zébulon Goriely, Richard Diehl Martinez, Andrew Caines, Lisa Beinborn, Paula Buttery

    Abstract: Language models are typically trained on large corpora of text in their default orthographic form. However, this is not the only option; representing data as streams of phonemes can offer unique advantages, from deeper insights into phonological language acquisition to improved performance on sound-based tasks. The challenge lies in evaluating the impact of phoneme-based training, as most benchmar… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

  2. arXiv:2410.11462  [pdf, other

    cs.CL

    Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing

    Authors: Richard Diehl Martinez, Zebulon Goriely, Andrew Caines, Paula Buttery, Lisa Beinborn

    Abstract: Language models strongly rely on frequency information because they maximize the likelihood of tokens during pre-training. As a consequence, language models tend to not generalize well to tokens that are seldom seen during training. Moreover, maximum likelihood training has been discovered to give rise to anisotropy: representations of tokens in a model tend to cluster tightly in a high-dimensiona… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  3. arXiv:2403.19424  [pdf, other

    cs.CL cs.AI

    The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement

    Authors: Jonathan Kamp, Lisa Beinborn, Antske Fokkens

    Abstract: Post-hoc explanation methods are an important tool for increasing model transparency for users. Unfortunately, the currently used methods for attributing token importance often yield diverging patterns. In this work, we study potential sources of disagreement across methods from a linguistic perspective. We find that different methods systematically select different classes of words and that metho… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Long paper accepted to LREC-Coling 2024 main conference. Please cite the conference proceedings version when available

  4. arXiv:2311.08886  [pdf, other

    cs.CL

    CLIMB: Curriculum Learning for Infant-inspired Model Building

    Authors: Richard Diehl Martinez, Zebulon Goriely, Hope McGovern, Christopher Davis, Andrew Caines, Paula Buttery, Lisa Beinborn

    Abstract: We describe our team's contribution to the STRICT-SMALL track of the BabyLM Challenge. The challenge requires training a language model from scratch using only a relatively small training dataset of ten million words. We experiment with three variants of cognitively-motivated curriculum learning and analyze their effect on the performance of the model on linguistic evaluation tasks. In the vocabul… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  5. arXiv:2310.13348  [pdf, other

    cs.CL

    Analyzing Cognitive Plausibility of Subword Tokenization

    Authors: Lisa Beinborn, Yuval Pinter

    Abstract: Subword tokenization has become the de-facto standard for tokenization, although comparative evaluations of subword vocabulary quality across languages are scarce. Existing evaluation studies focus on the effect of a tokenization algorithm on the performance in downstream tasks, or on engineering criteria such as the compression rate. We present a new evaluation paradigm that focuses on the cognit… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (main)

  6. arXiv:2310.05619  [pdf, other

    cs.CL cs.AI

    Dynamic Top-k Estimation Consolidates Disagreement between Feature Attribution Methods

    Authors: Jonathan Kamp, Lisa Beinborn, Antske Fokkens

    Abstract: Feature attribution scores are used for explaining the prediction of a text classifier to users by highlighting a k number of tokens. In this work, we propose a way to determine the number of optimal k tokens that should be displayed from sequential properties of the attribution scores. Our approach is dynamic across sentences, method-agnostic, and deals with sentence length bias. We compare agree… ▽ More

    Submitted 3 November, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Short paper accepted to EMNLP 2023 main conference. Please cite the EMNLP version when available

  7. arXiv:2302.12695  [pdf, other

    cs.CL cs.LG

    Cross-Lingual Transfer of Cognitive Processing Complexity

    Authors: Charlotte Pouw, Nora Hollenstein, Lisa Beinborn

    Abstract: When humans read a text, their eye movements are influenced by the structural complexity of the input sentences. This cognitive phenomenon holds across languages and recent studies indicate that multilingual language models utilize structural similarities between languages to facilitate cross-lingual transfer. We use sentence-level eye-tracking patterns as a cognitive indicator for structural comp… ▽ More

    Submitted 27 February, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted at Findings of EACL 2023

    ACM Class: I.2.7

  8. arXiv:2209.14780  [pdf, other

    cs.CL

    Perturbations and Subpopulations for Testing Robustness in Token-Based Argument Unit Recognition

    Authors: Jonathan Kamp, Lisa Beinborn, Antske Fokkens

    Abstract: Argument Unit Recognition and Classification aims at identifying argument units from text and classifying them as pro or against. One of the design choices that need to be made when developing systems for this task is what the unit of classification should be: segments of tokens or full sentences. Previous research suggests that fine-tuning language models on the token-level yields more robust res… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted at the 9th Workshop on Argument Mining, co-located with COLING 2022. Please cite the published version when available

  9. arXiv:2106.03471  [pdf, other

    cs.CL cs.AI

    Relative Importance in Sentence Processing

    Authors: Nora Hollenstein, Lisa Beinborn

    Abstract: Determining the relative importance of the elements in a sentence is a key factor for effortless natural language understanding. For human language processing, we can approximate patterns of relative importance by measuring reading fixations using eye-tracking technology. In neural language models, gradient-based saliency methods indicate the relative importance of a token for the target objective… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: accepted at ACL 2021

  10. arXiv:2104.05433  [pdf, other

    cs.CL

    Multilingual Language Models Predict Human Reading Behavior

    Authors: Nora Hollenstein, Federico Pirovano, Ce Zhang, Lena Jäger, Lisa Beinborn

    Abstract: We analyze if large language models are able to predict patterns of human reading behavior. We compare the performance of language-specific and multilingual pretrained transformer models to predict reading time measures reflecting natural human sentence processing on Dutch, English, German, and Russian texts. This results in accurate models of human reading behavior, which indicates that transform… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: accepted at NAACL 2021

  11. arXiv:2011.04592  [pdf, other

    cs.CL cs.CV

    Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze

    Authors: Ece Takmaz, Sandro Pezzelle, Lisa Beinborn, Raquel Fernández

    Abstract: When speakers describe an image, they tend to look at objects before mentioning them. In this paper, we investigate such sequential cross-modal alignment by modelling the image description generation process computationally. We take as our starting point a state-of-the-art image captioning system and develop several model variants that exploit information from human gaze patterns recorded during l… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

  12. arXiv:2011.02070  [pdf, other

    cs.CL

    Probing Multilingual BERT for Genetic and Typological Signals

    Authors: Taraka Rama, Lisa Beinborn, Steffen Eger

    Abstract: We probe the layers in multilingual BERT (mBERT) for phylogenetic and geographic language signals across 100 languages and compute language distances based on the mBERT representations. We 1) employ the language distances to infer and evaluate language trees, finding that they are close to the reference family tree in terms of quartet tree distance, 2) perform distance matrix regression analysis,… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: COLING 2020

  13. arXiv:2002.08880  [pdf, other

    cs.CL

    The Fluidity of Concept Representations in Human Brain Signals

    Authors: Eva Hendrikx, Lisa Beinborn

    Abstract: Cognitive theories of human language processing often distinguish between concrete and abstract concepts. In this work, we analyze the discriminability of concrete and abstract concepts in fMRI data using a range of analysis methods. We find that the distinction can be decoded from the signal with an accuracy significantly above chance, but it is not found to be a relevant structuring factor in cl… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: 12 pages, 5 figures, 1 table

  14. arXiv:1906.01539  [pdf, other

    cs.AI cs.CL q-bio.NC

    Blackbox meets blackbox: Representational Similarity and Stability Analysis of Neural Language Models and Brains

    Authors: Samira Abnar, Lisa Beinborn, Rochelle Choenni, Willem Zuidema

    Abstract: In this paper, we define and apply representational stability analysis (ReStA), an intuitive way of analyzing neural language models. ReStA is a variant of the popular representational similarity analysis (RSA) in cognitive neuroscience. While RSA can be used to compare representations in models, model components, and human brains, ReStA compares instances of the same model, while systematically v… ▽ More

    Submitted 5 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Journal ref: 2nd BlackBoxNLP workshop @ACL2019

  15. arXiv:1904.10820  [pdf, other

    cs.CL cs.AI

    Semantic Drift in Multilingual Representations

    Authors: Lisa Beinborn, Rochelle Choenni

    Abstract: Multilingual representations have mostly been evaluated based on their performance on specific tasks. In this article, we look beyond engineering goals and analyze the relations between languages in computational representations. We introduce a methodology for comparing languages based on their organization of semantic concepts. We propose to conduct an adapted version of representational similari… ▽ More

    Submitted 16 November, 2020; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: Almost final version. Paper will appear in the Computational Linguistics Journal, Volume 46, Issue 3

  16. arXiv:1904.02547  [pdf, other

    cs.CL cs.AI cs.LG

    Robust Evaluation of Language-Brain Encoding Experiments

    Authors: Lisa Beinborn, Samira Abnar, Rochelle Choenni

    Abstract: Language-brain encoding experiments evaluate the ability of language models to predict brain responses elicited by language stimuli. The evaluation scenarios for this task have not yet been standardized which makes it difficult to compare and interpret results. We perform a series of evaluation experiments with a consistent encoding setup and compute the results for multiple fMRI datasets. In addi… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

  17. arXiv:1806.06371  [pdf, other

    cs.CL cs.AI

    Multimodal Grounding for Language Processing

    Authors: Lisa Beinborn, Teresa Botschen, Iryna Gurevych

    Abstract: This survey discusses how recent developments in multimodal processing facilitate conceptual grounding of language. We categorize the information flow in multimodal processing with respect to cognitive models of human information processing and analyze different methods for combining multimodal representations. Based on this methodological inventory, we discuss the benefit of multimodal grounding… ▽ More

    Submitted 3 July, 2019; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: The paper has been published in the Proceedings of the 27 Conference of Computational Linguistics. Please refer to this version for citations: https://www.aclweb.org/anthology/papers/C/C18/C18-1197/