Skip to main content

Showing 1–1 of 1 results for author: Sagun, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2411.08135  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    On the Role of Speech Data in Reducing Toxicity Detection Bias

    Authors: Samuel J. Bell, Mariano Coria Meglioli, Megan Richards, Eduardo Sánchez, Christophe Ropers, Skyler Wang, Adina Williams, Levent Sagun, Marta R. Costa-jussà

    Abstract: Text toxicity detection systems exhibit significant biases, producing disproportionate rates of false positives on samples mentioning demographic groups. But what about toxicity detection in speech? To investigate the extent to which text-based biases are mitigated by speech-based systems, we produce a set of high-quality group annotations for the multilingual MuTox dataset, and then leverage thes… ▽ More

    Submitted 16 May, 2025; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: Accepted at NAACL 2025

    Journal ref: In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (Volume 1), pages 1454-1468