Skip to main content

Showing 1–21 of 21 results for author: Zarriess, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11807  [pdf, ps, other

    cs.CL

    Are Multimodal Large Language Models Pragmatically Competent Listeners in Simple Reference Resolution Tasks?

    Authors: Simeon Junker, Manar Ali, Larissa Koch, Sina Zarrieß, Hendrik Buschmeier

    Abstract: We investigate the linguistic abilities of multimodal large language models in reference resolution tasks featuring simple yet abstract visual stimuli, such as color patches and color grids. Although the task may not seem challenging for today's language models, being straightforward for human dyads, we consider it to be a highly relevant probe of the pragmatic capabilities of MLLMs. Our results a… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: To appear in ACL Findings 2025

  2. arXiv:2506.11631  [pdf, ps, other

    cs.CL

    SceneGram: Conceptualizing and Describing Tangrams in Scene Context

    Authors: Simeon Junker, Sina Zarrieß

    Abstract: Research on reference and naming suggests that humans can come up with very different ways of conceptualizing and referring to the same object, e.g. the same abstract tangram shape can be a "crab", "sink" or "space ship". Another common assumption in cognitive science is that scene context fundamentally shapes our visual perception of objects and conceptual expectations. This paper contributes Sce… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: To appear in ACL Findings 2025

  3. arXiv:2506.08952  [pdf, ps, other

    cs.CL cs.AI

    Can LLMs Ground when they (Don't) Know: A Study on Direct and Loaded Political Questions

    Authors: Clara Lachenmaier, Judith Sieker, Sina Zarrieß

    Abstract: Communication among humans relies on conversational grounding, allowing interlocutors to reach mutual understanding even when they do not have perfect knowledge and must resolve discrepancies in each other's beliefs. This paper investigates how large language models (LLMs) manage common ground in cases where they (don't) possess knowledge, focusing on facts in the political domain where the risk o… ▽ More

    Submitted 11 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: Preprint accepted at ACL Main Conference 2025

  4. arXiv:2505.22354  [pdf, ps, other

    cs.CL

    LLMs Struggle to Reject False Presuppositions when Misinformation Stakes are High

    Authors: Judith Sieker, Clara Lachenmaier, Sina Zarrieß

    Abstract: This paper examines how LLMs handle false presuppositions and whether certain linguistic factors influence their responses to falsely presupposed content. Presuppositions subtly introduce information as given, making them highly effective at embedding disputable or false information. This raises concerns about whether LLMs, like humans, may fail to detect and correct misleading assumptions introdu… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 8 pages (including References). Accepted at CogSci 2025

  5. arXiv:2503.22006  [pdf, other

    cs.CL cs.LG

    Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without Them

    Authors: Marc Brinner, Tarek Al Mustafa, Sina Zarrieß

    Abstract: We investigate the use of LLM-generated data for continual pretraining of encoder models in specialized domains with limited training data, using the scientific domain of invasion biology as a case study. To this end, we leverage domain-specific ontologies by enriching them with LLM-generated data and pretraining the encoder model as an ontology-informed embedding model for concept definitions. To… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  6. arXiv:2503.11593  [pdf, ps, other

    cs.CL

    Do Construction Distributions Shape Formal Language Learning In German BabyLMs?

    Authors: Bastian Bunzeck, Daniel Duran, Sina Zarrieß

    Abstract: We analyze the influence of utterance-level construction distributions in German child-directed/child-available speech on the resulting word-level, syntactic and semantic competence (and their underlying learning trajectories) in small LMs, which we train on a novel collection of developmentally plausible language data for German. We find that trajectories are surprisingly robust for markedly diff… ▽ More

    Submitted 17 June, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: Accepted at CoNNL 2025

  7. arXiv:2502.12835  [pdf, ps, other

    cs.CL

    Subword models struggle with word learning, but surprisal hides it

    Authors: Bastian Bunzeck, Sina Zarrieß

    Abstract: We study word learning in subword and character language models with the psycholinguistic lexical decision task. While subword LMs struggle to discern words and non-words with high accuracy, character LMs solve this task easily and consistently. Only when supplied with further contexts do subword LMs perform similarly to character models. Additionally, when looking at word-level and syntactic lear… ▽ More

    Submitted 2 June, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: Accepted to ACL 2025 (Main)

  8. arXiv:2502.06551  [pdf, other

    cs.CL

    Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments

    Authors: Marc Felix Brinner, Sina Zarrieß

    Abstract: This study explores strategies for efficiently classifying scientific full texts using both small, BERT-based models and local large language models like Llama-3.1 8B. We focus on developing methods for selecting subsets of input sentences to reduce input size while simultaneously enhancing classification performance. To this end, we compile a novel dataset consisting of full-text scientific paper… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  9. arXiv:2501.18287  [pdf, other

    cs.CL cs.AI cs.DL

    Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models

    Authors: Jennifer D'Souza, Zachary Laubach, Tarek Al Mustafa, Sina Zarrieß, Robert Frühstückl, Phyllis Illari

    Abstract: This paper presents an exploratory study that harnesses the capabilities of large language models (LLMs) to mine key ecological entities from invasion biology literature. Specifically, we focus on extracting species names, their locations, associated habitats, and ecosystems, information that is critical for understanding species spread, predicting future invasions, and informing conservation effo… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: 8 pages, 2 figures, accepted to the NLP4Ecology Workshop 2025 (https://nlp4ecology2025.di.unito.it/) co-located with the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies

  10. arXiv:2501.12980  [pdf, other

    cs.CL

    Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities

    Authors: Florian Kankowski, Torgrim Solstad, Sina Zarriess, Oliver Bott

    Abstract: In this paper, we compare data generated with mono- and multilingual LLMs spanning a range of model sizes with data provided by human participants in an experimental setting investigating well-established discourse biases. Beyond the comparison as such, we aim to develop a benchmark to assess the capabilities of LLMs with discourse biases as a robust proxy for more general discourse understanding… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Comments: 38 pages, 8 figures

  11. arXiv:2412.02427  [pdf, other

    cs.CL cs.AI

    GerPS-Compare: Comparing NER methods for legal norm analysis

    Authors: Sarah T. Bachinger, Christoph Unger, Robin Erd, Leila Feddoul, Clara Lachenmaier, Sina Zarrieß, Birgitta König-Ries

    Abstract: We apply NER to a particular sub-genre of legal texts in German: the genre of legal norms regulating administrative processes in public service administration. The analysis of such texts involves identifying stretches of text that instantiate one of ten classes identified by public service administration professionals. We investigate and compare three methods for performing Named Entity Recognitio… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  12. arXiv:2410.01487  [pdf

    cs.CL

    Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas

    Authors: Bastian Bunzeck, Daniel Duran, Leonie Schade, Sina Zarrieß

    Abstract: Recent work investigates whether LMs learn human-like linguistic generalizations and representations from developmentally plausible amounts of data. Yet, the basic linguistic units processed in these LMs are determined by subword-based tokenization, which limits their validity as models of learning at and below the word level. In this paper, we explore the potential of tokenization-free, phoneme-… ▽ More

    Submitted 3 January, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted at COLING 2025

  13. The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems

    Authors: Judith Sieker, Simeon Junker, Ronja Utescher, Nazia Attari, Heiko Wersing, Hendrik Buschmeier, Sina Zarrieß

    Abstract: We examine how users perceive the limitations of an AI system when it encounters a task that it cannot perform perfectly and whether providing explanations alongside its answers aids users in constructing an appropriate mental model of the system's capabilities and limitations. We employ a visual question answer and explanation task where we control the AI system's limitations by manipulating the… ▽ More

    Submitted 21 October, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 17 pages (including Appendix). Accepted at EMNLP 2024 main

    Journal ref: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp. 19459-19475

  14. arXiv:2406.15267  [pdf, other

    cs.CL

    Evaluating Diversity in Automatic Poetry Generation

    Authors: Yanran Chen, Hannes Gröner, Sina Zarrieß, Steffen Eger

    Abstract: Natural Language Generation (NLG), and more generally generative AI, are among the currently most impactful research fields. Creative NLG, such as automatic poetry generation, is a fascinating niche in this area. While most previous research has focused on forms of the Turing test when evaluating automatic poetry generation -- can humans distinguish between automatic and human generated poetry --… ▽ More

    Submitted 8 November, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: EMNLP 2024 main; camera-ready

  15. arXiv:2404.12289  [pdf, other

    cs.CL

    Resilience through Scene Context in Visual Referring Expression Generation

    Authors: Simeon Junker, Sina Zarrieß

    Abstract: Scene context is well known to facilitate humans' perception of visible objects. In this paper, we investigate the role of context in Referring Expression Generation (REG) for objects in images, where existing research has often focused on distractor contexts that exert pressure on the generator. We take a new perspective on scene context in REG and hypothesize that contextual information can be c… ▽ More

    Submitted 23 August, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  16. arXiv:2302.10282  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Paparazzi: A Deep Dive into the Capabilities of Language and Vision Models for Grounding Viewpoint Descriptions

    Authors: Henrik Voigt, Jan Hombeck, Monique Meuschke, Kai Lawonn, Sina Zarrieß

    Abstract: Existing language and vision models achieve impressive performance in image-text understanding. Yet, it is an open question to what extent they can be used for language understanding in 3D environments and whether they implicitly acquire 3D object knowledge, e.g. about different views of an object. In this paper, we investigate whether a state-of-the-art language and vision model, CLIP, is able to… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  17. arXiv:2101.12338  [pdf, other

    cs.RO cs.AI

    Enabling Robots to Draw and Tell: Towards Visually Grounded Multimodal Description Generation

    Authors: Ting Han, Sina Zarrieß

    Abstract: Socially competent robots should be equipped with the ability to perceive the world that surrounds them and communicate about it in a human-like manner. Representative skills that exhibit such ability include generating image descriptions and visually grounded referring expressions. In the NLG community, these generation tasks are largely investigated in non-interactive and language-only settings.… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

    Comments: The 2nd Workshop on NLG for HRI colocated with The 13th International Conference on Natural Language Generation

  18. arXiv:1907.05084  [pdf, other

    cs.CL cs.CV

    MeetUp! A Corpus of Joint Activity Dialogues in a Visual Environment

    Authors: Nikolai Ilinykh, Sina Zarrieß, David Schlangen

    Abstract: Building computer systems that can converse about their visual environment is one of the oldest concerns of research in Artificial Intelligence and Computational Linguistics (see, for example, Winograd's 1972 SHRDLU system). Only recently, however, have methods from computer vision and natural language processing become powerful enough to make this vision seem more attainable. Pushed especially by… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: In Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue (semdial / LondonLogue), London, September 2019

  19. arXiv:1906.05518  [pdf, other

    cs.CL

    Know What You Don't Know: Modeling a Pragmatic Speaker that Refers to Objects of Unknown Categories

    Authors: Sina Zarrieß, David Schlangen

    Abstract: Zero-shot learning in Language & Vision is the task of correctly labelling (or naming) objects of novel categories. Another strand of work in L&V aims at pragmatically informative rather than ``correct'' object descriptions, e.g. in reference games. We combine these lines of research and model zero-shot reference games, where a speaker needs to successfully refer to a novel object in an image. Ins… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Accepted at ACL 2019

  20. The Code2Text Challenge: Text Generation in Source Code Libraries

    Authors: Kyle Richardson, Sina Zarrieß, Jonas Kuhn

    Abstract: We propose a new shared task for tactical data-to-text generation in the domain of source code libraries. Specifically, we focus on text generation of function descriptions from example software projects. Data is drawn from existing resources used for studying the related problem of semantic parser induction (Richardson and Kuhn, 2017b; Richardson and Kuhn, 2017a), and spans a wide variety of both… ▽ More

    Submitted 31 July, 2017; originally announced August 2017.

    Comments: Proceedings of INLG 2017, shared task track

  21. arXiv:1510.02125  [pdf, other

    cs.CL

    Resolving References to Objects in Photographs using the Words-As-Classifiers Model

    Authors: David Schlangen, Sina Zarriess, Casey Kennington

    Abstract: A common use of language is to refer to visually present objects. Modelling it in computers requires modelling the link between language and perception. The "words as classifiers" model of grounded semantics views words as classifiers of perceptual contexts, and composes the meaning of a phrase through composition of the denotations of its component words. It was recently shown to perform well in… ▽ More

    Submitted 3 June, 2016; v1 submitted 7 October, 2015; originally announced October 2015.

    Comments: 11 pages; as in Proceedings of ACL 2016, Berlin, 2016