Skip to main content

Showing 1–7 of 7 results for author: Tonneau, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.20519  [pdf

    cs.CY cs.HC

    Conversations with AI Chatbots Increase Short-Term Vaccine Intentions But Do Not Outperform Standard Public Health Messaging

    Authors: Neil K. R. Sehgal, Sunny Rai, Manuel Tonneau, Anish K. Agarwal, Joseph Cappella, Melanie Kornides, Lyle Ungar, Alison Buttenheim, Sharath Chandra Guntuku

    Abstract: Large language model (LLM) based chatbots show promise in persuasive communication, but existing studies often rely on weak controls or focus on belief change rather than behavioral intentions or outcomes. This pre-registered multi-country (US, Canada, UK) randomized controlled trial involving 930 vaccine-hesitant parents evaluated brief (three-minute) multi-turn conversations with LLM-based chatb… ▽ More

    Submitted 26 June, 2025; v1 submitted 29 April, 2025; originally announced April 2025.

  2. arXiv:2503.03417  [pdf, ps, other

    cs.CL cs.AI

    When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits

    Authors: Jabez Magomere, Emanuele La Malfa, Manuel Tonneau, Ashkan Kazemi, Scott Hale

    Abstract: Online misinformation remains a critical challenge, and fact-checkers increasingly rely on claim matching systems that use sentence embedding models to retrieve relevant fact-checks. However, as users interact with claims online, they often introduce edits, and it remains unclear whether current embedding models used in retrieval are robust to such edits. To investigate this, we introduce a pertur… ▽ More

    Submitted 5 June, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: Accepted to ACL 2025 Findings

  3. arXiv:2411.15462  [pdf, ps, other

    cs.CL

    HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter

    Authors: Manuel Tonneau, Diyi Liu, Niyati Malhotra, Scott A. Hale, Samuel P. Fraiberger, Victor Orozco-Olvera, Paul Röttger

    Abstract: To address the global challenge of online hate speech, prior research has developed detection models to flag such content on social media. However, due to systematic biases in evaluation datasets, the real-world effectiveness of these models remains unclear, particularly across geographies. We introduce HateDay, the first global hate speech dataset representative of social media settings, construc… ▽ More

    Submitted 3 June, 2025; v1 submitted 23 November, 2024; originally announced November 2024.

    Comments: ACL 2025 main conference. Data available at https://huggingface.co/datasets/manueltonneau/hateday

  4. arXiv:2404.17874  [pdf, other

    cs.CL

    From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets

    Authors: Manuel Tonneau, Diyi Liu, Samuel Fraiberger, Ralph Schroeder, Scott A. Hale, Paul Röttger

    Abstract: Perceptions of hate can vary greatly across cultural contexts. Hate speech (HS) datasets, however, have traditionally been developed by language. This hides potential cultural biases, as one language may be spoken in different countries home to different cultures. In this work, we evaluate cultural bias in HS datasets by leveraging two interrelated cultural proxies: language and geography. We cond… ▽ More

    Submitted 19 May, 2025; v1 submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted at WOAH (NAACL 2024). Please cite the ACL Anthology version: https://aclanthology.org/2024.woah-1.23/

  5. arXiv:2403.19260  [pdf, other

    cs.CL

    NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data

    Authors: Manuel Tonneau, Pedro Vitor Quinta de Castro, Karim Lasri, Ibrahim Farouq, Lakshminarayanan Subramanian, Victor Orozco-Olvera, Samuel P. Fraiberger

    Abstract: To address the global issue of online hate, hate speech detection (HSD) systems are typically developed on datasets from the United States, thereby failing to generalize to English dialects from the Majority World. Furthermore, HSD models are often evaluated on non-representative samples, raising concerns about overestimating model performance in real-world settings. In this work, we introduce Nai… ▽ More

    Submitted 24 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: ACL 2024 main conference. Data and models available at https://github.com/worldbank/NaijaHate

  6. Indian-BhED: A Dataset for Measuring India-Centric Biases in Large Language Models

    Authors: Khyati Khandelwal, Manuel Tonneau, Andrew M. Bean, Hannah Rose Kirk, Scott A. Hale

    Abstract: Large Language Models (LLMs), now used daily by millions, can encode societal biases, exposing their users to representational harms. A large body of scholarship on LLM bias exists but it predominantly adopts a Western-centric frame and attends comparatively less to bias levels and potential harms in the Global South. In this paper, we quantify stereotypical bias in popular LLMs according to an In… ▽ More

    Submitted 9 August, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: To be published in GoodIT '24, doi:10.1145/3677525.3678666. 14 pages

  7. Multilingual Detection of Personal Employment Status on Twitter

    Authors: Manuel Tonneau, Dhaval Adjodah, João Palotti, Nir Grinberg, Samuel Fraiberger

    Abstract: Detecting disclosures of individuals' employment status on social media can provide valuable information to match job seekers with suitable vacancies, offer social protection, or measure labor market flows. However, identifying such personal disclosures is a challenging task due to their rarity in a sea of social media content and the variety of linguistic forms used to describe them. Here, we exa… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: ACL 2022 main conference. Data and models available at https://github.com/manueltonneau/twitter-unemployment