Skip to main content

Showing 1–21 of 21 results for author: Spinde, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.00343  [pdf, ps, other

    cs.CL

    Leveraging Large Language Models for Automated Definition Extraction with TaxoMatic A Case Study on Media Bias

    Authors: Timo Spinde, Luyang Lin, Smi Hinterreiter, Isao Echizen

    Abstract: This paper introduces TaxoMatic, a framework that leverages large language models to automate definition extraction from academic literature. Focusing on the media bias domain, the framework encompasses data collection, LLM-based relevance classification, and extraction of conceptual definitions. Evaluated on a dataset of 2,398 manually rated articles, the study demonstrates the frameworks effecti… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

    Journal ref: Proceedings of the International AAAI Conference on Web and Social Media (ICWSM'25) (2025)

  2. arXiv:2503.16674  [pdf, other

    cs.CL

    Through the LLM Looking Glass: A Socratic Probing of Donkeys, Elephants, and Markets

    Authors: Molly Kennedy, Ayyoob Imani, Timo Spinde, Hinrich Schütze

    Abstract: While detecting and avoiding bias in LLM-generated text is becoming increasingly important, media bias often remains subtle and subjective, making it particularly difficult to identify and mitigate. In this study, we assess media bias in LLM-generated content and LLMs' ability to detect subtle ideological bias. We conduct this evaluation using two datasets, PoliGen and EconoLex, covering political… ▽ More

    Submitted 22 May, 2025; v1 submitted 20 March, 2025; originally announced March 2025.

  3. arXiv:2412.19545  [pdf

    cs.HC

    Enhancing Media Literacy: The Effectiveness of (Human) Annotations and Bias Visualizations on Bias Detection

    Authors: Timo Spinde, Fei Wu, Wolfgang Gaissmaier, Gianluca Demartini, Helge Giese

    Abstract: Marking biased texts is a practical approach to increase media bias awareness among news consumers. However, little is known about the generalizability of such awareness to new topics or unmarked news articles, and the role of machine-generated bias labels in enhancing awareness remains unclear. This study tests how news consumers may be trained and pre-bunked to detect media bias with bias labels… ▽ More

    Submitted 30 December, 2024; v1 submitted 27 December, 2024; originally announced December 2024.

  4. arXiv:2411.11081  [pdf, other

    cs.CL

    The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias Detection

    Authors: Tomas Horych, Christoph Mandl, Terry Ruas, Andre Greiner-Petter, Bela Gipp, Akiko Aizawa, Timo Spinde

    Abstract: High annotation costs from hiring or crowdsourcing complicate the creation of large, high-quality datasets needed for training reliable text classifiers. Recent research suggests using Large Language Models (LLMs) to automate the annotation process, reducing these costs while maintaining data quality. LLMs have shown promising results in annotating downstream tasks like hate speech detection and p… ▽ More

    Submitted 24 January, 2025; v1 submitted 17 November, 2024; originally announced November 2024.

  5. arXiv:2407.17111  [pdf, other

    cs.HC

    News Ninja: Gamified Annotation of Linguistic Bias in Online News

    Authors: Smi Hinterreiter, Timo Spinde, Sebastian Oberdörfer, Isao Echizen, Marc Erich Latoschik

    Abstract: Recent research shows that visualizing linguistic bias mitigates its negative effects. However, reliable automatic detection methods to generate such visualizations require costly, knowledge-intensive training data. To facilitate data collection for media bias datasets, we present News Ninja, a game employing data-collecting game mechanics to generate a crowdsourced dataset. Before annotating sent… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  6. arXiv:2407.17045  [pdf, other

    cs.HC

    NewsUnfold: Creating a News-Reading Application That Indicates Linguistic Media Bias and Collects Feedback

    Authors: Smi Hinterreiter, Martin Wessel, Fabian Schliski, Isao Echizen, Marc Erich Latoschik, Timo Spinde

    Abstract: Media bias is a multifaceted problem, leading to one-sided views and impacting decision-making. A way to address digital media bias is to detect and indicate it automatically through machine-learning methods. However, such detection is limited due to the difficulty of obtaining reliable training data. Human-in-the-loop-based feedback mechanisms have proven an effective way to facilitate the data-g… ▽ More

    Submitted 29 July, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

  7. arXiv:2403.07910  [pdf, ps, other

    cs.CY cs.CL

    MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of Expressions

    Authors: Tomáš Horych, Martin Wessel, Jan Philip Wahle, Terry Ruas, Jerome Waßmuth, André Greiner-Petter, Akiko Aizawa, Bela Gipp, Timo Spinde

    Abstract: Media bias detection poses a complex, multifaceted problem traditionally tackled using single-task models and small in-domain datasets, consequently lacking generalizability. To address this, we introduce MAGPIE, the first large-scale multi-task pre-training approach explicitly tailored for media bias detection. To enable pre-training at scale, we present Large Bias Mixture (LBM), a compilation of… ▽ More

    Submitted 12 June, 2025; v1 submitted 26 February, 2024; originally announced March 2024.

    Journal ref: LREC-COLING 2024

  8. arXiv:2312.16148  [pdf, other

    cs.CL

    The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias

    Authors: Timo Spinde, Smi Hinterreiter, Fabian Haak, Terry Ruas, Helge Giese, Norman Meuschke, Bela Gipp

    Abstract: The way the media presents events can significantly affect public perception, which in turn can alter people's beliefs and views. Media bias describes a one-sided or polarizing perspective on a topic. This article summarizes the research on computational methods to detect media bias by systematically reviewing 3140 research papers published between 2019 and 2022. To structure our review and suppor… ▽ More

    Submitted 10 January, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  9. Introducing MBIB -- the first Media Bias Identification Benchmark Task and Dataset Collection

    Authors: Martin Wessel, Tomáš Horych, Terry Ruas, Akiko Aizawa, Bela Gipp, Timo Spinde

    Abstract: Although media bias detection is a complex multi-task problem, there is, to date, no unified benchmark grouping these evaluation tasks. We introduce the Media Bias Identification Benchmark (MBIB), a comprehensive benchmark that groups different types of media bias (e.g., linguistic, cognitive, political) under a common framework to test how prospective detection techniques generalize. After review… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: To be published in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '23)

  10. A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents

    Authors: Norman Meuschke, Apurva Jagdale, Timo Spinde, Jelena Mitrović, Bela Gipp

    Abstract: Extracting information from academic PDF documents is crucial for numerous indexing, retrieval, and analysis use cases. Choosing the best tool to extract specific content elements is difficult because many, technically diverse tools are available, but recent performance benchmarks are rare. Moreover, such benchmarks typically cover only a few content elements like header metadata or bibliographic… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: iConference 2023

  11. Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles

    Authors: Timo Spinde, Jan-David Krieger, Terry Ruas, Jelena Mitrović, Franz Götz-Hahn, Akiko Aizawa, Bela Gipp

    Abstract: Media has a substantial impact on the public perception of events. A one-sided or polarizing perspective on any topic is usually described as media bias. One of the ways how bias in news articles can be introduced is by altering word choice. Biased word choices are not always obvious, nor do they exhibit high context-dependency. Hence, detecting bias is often difficult. We propose a Transformer-ba… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Journal ref: Proceedings of the iConference 2022

  12. Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By Experts

    Authors: Timo Spinde, Manuel Plank, Jan-David Krieger, Terry Ruas, Bela Gipp, Akiko Aizawa

    Abstract: Media coverage has a substantial effect on the public perception of events. Nevertheless, media outlets are often biased. One way to bias news articles is by altering the word choice. The automatic identification of bias by word choice is challenging, primarily due to the lack of a gold standard data set and high context dependencies. This paper presents BABE, a robust and diverse data set created… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: substantial text overlap with Ph.D. proposal by same author, part of dissertation arXiv:2112.13352

    Journal ref: Findings of the Association for Computational Linguistics: EMNLP 2021

  13. A Domain-adaptive Pre-training Approach for Language Bias Detection in News

    Authors: Jan-David Krieger, Timo Spinde, Terry Ruas, Juhi Kulshrestha, Bela Gipp

    Abstract: Media bias is a multi-faceted construct influencing individual behavior and collective decision-making. Slanted news reporting is the result of one-sided and polarized writing which can occur in various forms. In this work, we focus on an important form of media bias, i.e. bias by word choice. Detecting biased word choices is a challenging task due to its linguistic complexity and the lack of repr… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Journal ref: Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries 2022 (JCDL)

  14. arXiv:2112.13352  [pdf, other

    cs.CL

    An Interdisciplinary Approach for the Automated Detection and Visualization of Media Bias in News Articles

    Authors: Timo Spinde

    Abstract: Media coverage has a substantial effect on the public perception of events. Nevertheless, media outlets are often biased. One way to bias news articles is by altering the word choice. The automatic identification of bias by word choice is challenging, primarily due to the lack of gold-standard data sets and high context dependencies. In this research project, I aim to devise data sets and methods… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

    Journal ref: 2021 IEEE International Conference on Data Mining Workshops (ICDMW)

  15. arXiv:2112.07421  [pdf, other

    cs.CL

    Towards A Reliable Ground-Truth For Biased Language Detection

    Authors: Timo Spinde, David Krieger, Manuel Plank, Bela Gipp

    Abstract: Reference texts such as encyclopedias and news articles can manifest biased language when objective reporting is substituted by subjective writing. Existing methods to detect bias mostly rely on annotated data to train machine learning models. However, low annotator agreement and comparability is a substantial drawback in available media bias corpora. To evaluate data collection options, we collec… ▽ More

    Submitted 17 December, 2021; v1 submitted 14 December, 2021; originally announced December 2021.

  16. arXiv:2112.07392  [pdf

    cs.CL

    Do You Think It's Biased? How To Ask For The Perception Of Media Bias

    Authors: Timo Spinde, Christina Kreuter, Wolfgang Gaissmaier, Felix Hamborg, Bela Gipp, Helge Giese

    Abstract: Media coverage possesses a substantial effect on the public perception of events. The way media frames events can significantly alter the beliefs and perceptions of our society. Nevertheless, nearly all media outlets are known to report news in a biased way. While such bias can be introduced by altering the word choice or omitting information, the perception of bias also varies largely depending o… ▽ More

    Submitted 16 December, 2021; v1 submitted 14 December, 2021; originally announced December 2021.

  17. arXiv:2112.07391  [pdf

    cs.CL

    TASSY -- A Text Annotation Survey System

    Authors: Timo Spinde, Kanishka Sinha, Norman Meuschke, Bela Gipp

    Abstract: We present a free and open-source tool for creating web-based surveys that include text annotation tasks. Existing tools offer either text annotation or survey functionality but not both. Combining the two input types is particularly relevant for investigating a reader's perception of a text which also depends on the reader's background, such as age, gender, and education. Our tool caters primaril… ▽ More

    Submitted 16 December, 2021; v1 submitted 14 December, 2021; originally announced December 2021.

  18. Identification of Biased Terms in News Articles by Comparison of Outlet-specific Word Embeddings

    Authors: Timo Spinde, Lada Rudnitckaia, Felix Hamborg, Bela Gipp

    Abstract: Slanted news coverage, also called media bias, can heavily influence how news consumers interpret and react to the news. To automatically identify biased language, we present an exploratory approach that compares the context of related words. We train two word embedding models, one on texts of left-wing, the other on right-wing news outlets. Our hypothesis is that a word's representations in both… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  19. arXiv:2110.09151  [pdf, other

    cs.CY cs.AI cs.HC

    How to Effectively Identify and Communicate Person-Targeting Media Bias in Daily News Consumption?

    Authors: Felix Hamborg, Timo Spinde, Kim Heinser, Karsten Donnay, Bela Gipp

    Abstract: Slanted news coverage strongly affects public opinion. This is especially true for coverage on politics and related issues, where studies have shown that bias in the news may influence elections and other collective decisions. Due to its viable importance, news coverage has long been studied in the social sciences, resulting in comprehensive models to describe it and effective yet costly methods t… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  20. arXiv:2105.11910  [pdf

    cs.CL

    MBIC -- A Media Bias Annotation Dataset Including Annotator Characteristics

    Authors: T. Spinde, L. Rudnitckaia, K. Sinha, F. Hamborg, B. Gipp, K. Donnay

    Abstract: Many people consider news articles to be a reliable source of information on current events. However, due to the range of factors influencing news agencies, such coverage may not always be impartial. Media bias, or slanted news coverage, can have a substantial impact on public perception of events, and, accordingly, can potentially alter the beliefs and views of the public. The main data gap in cu… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

  21. Enabling News Consumers to View and Understand Biased News Coverage: A Study on the Perception and Visualization of Media Bias

    Authors: Timo Spinde, Felix Hamborg, Karsten Donnay, Angelica Becerra, Bela Gipp

    Abstract: Traditional media outlets are known to report political news in a biased way, potentially affecting the political beliefs of the audience and even altering their voting behaviors. Many researchers focus on automatically detecting and identifying media bias in the news, but only very few studies exist that systematically analyze how theses biases can be best visualized and communicated. We create t… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.