Skip to main content

Showing 1–4 of 4 results for author: Orlikowski, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.08217  [pdf, ps, other

    cs.CL cs.AI

    Balancing Quality and Variation: Spam Filtering Distorts Data Label Distributions

    Authors: Eve Fleisig, Matthias Orlikowski, Philipp Cimiano, Dan Klein

    Abstract: For machine learning datasets to accurately represent diverse opinions in a population, they must preserve variation in data labels while filtering out spam or low-quality responses. How can we balance annotator reliability and representation? We empirically evaluate how a range of heuristics for annotator filtering affect the preservation of variation on subjective tasks. We find that these metho… ▽ More

    Submitted 9 September, 2025; originally announced September 2025.

  2. arXiv:2502.20897  [pdf, other

    cs.CL

    Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions

    Authors: Matthias Orlikowski, Jiaxin Pei, Paul Röttger, Philipp Cimiano, David Jurgens, Dirk Hovy

    Abstract: People naturally vary in their annotations for subjective questions and some of this variation is thought to be due to the person's sociodemographic characteristics. LLMs have also been used to label data, but recent work has shown that models perform poorly when prompted with sociodemographic attributes, suggesting limited inherent sociodemographic knowledge. Here, we ask whether LLMs can be trai… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: Reviewed ARR December 2024

    ACM Class: I.2.7

  3. Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It's Best to Relate Perspectives!

    Authors: Philipp Heinisch, Matthias Orlikowski, Julia Romberg, Philipp Cimiano

    Abstract: Many annotation tasks in natural language processing are highly subjective in that there can be different valid and justified perspectives on what is a proper label for a given example. This also applies to the judgment of argument quality, where the assignment of a single ground truth is often questionable. At the same time, there are generally accepted concepts behind argumentation that form a c… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023 main conference. First two authors contributed equally

  4. The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond Sociodemographics

    Authors: Matthias Orlikowski, Paul Röttger, Philipp Cimiano, Dirk Hovy

    Abstract: Many NLP tasks exhibit human label variation, where different annotators give different labels to the same texts. This variation is known to depend, at least in part, on the sociodemographics of annotators. Recent research aims to model individual annotator behaviour rather than predicting aggregated labels, and we would expect that sociodemographic information is useful for these models. On the o… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: ACL2023 Camera-Ready