Skip to main content

Showing 1–1 of 1 results for author: Murris, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.09948  [pdf, other

    cs.CL

    Mitigating Text Toxicity with Counterfactual Generation

    Authors: Milan Bhan, Jean-Noel Vittaut, Nina Achache, Victor Legrand, Nicolas Chesneau, Annabelle Blangero, Juliette Murris, Marie-Jeanne Lesot

    Abstract: Toxicity mitigation consists in rephrasing text in order to remove offensive or harmful meaning. Neural natural language processing (NLP) models have been widely used to target and mitigate textual toxicity. However, existing methods fail to detoxify text while preserving the initial non-toxic meaning at the same time. In this work, we propose to apply counterfactual generation methods from the eX… ▽ More

    Submitted 28 May, 2025; v1 submitted 16 May, 2024; originally announced May 2024.