Skip to main content

Showing 1–2 of 2 results for author: Bethke, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2004.09456  [pdf, other

    cs.CL cs.AI cs.CY

    StereoSet: Measuring stereotypical bias in pretrained language models

    Authors: Moin Nadeem, Anna Bethke, Siva Reddy

    Abstract: A stereotype is an over-generalized belief about a particular group of people, e.g., Asians are good at math or Asians are bad drivers. Such beliefs (biases) are known to hurt target groups. Since pretrained language models are trained on large real world data, they are known to capture stereotypical biases. In order to assess the adverse effects of these models, it is important to quantify the bi… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: 9 pages, 6 tables, and 3 figures

  2. arXiv:1909.04251  [pdf, other

    cs.CL cs.AI cs.CY

    A Benchmark Dataset for Learning to Intervene in Online Hate Speech

    Authors: Jing Qian, Anna Bethke, Yinyin Liu, Elizabeth Belding, William Yang Wang

    Abstract: Countering online hate speech is a critical yet challenging task, but one which can be aided by the use of Natural Language Processing (NLP) techniques. Previous research has primarily focused on the development of NLP methods to automatically and effectively detect online hate speech while disregarding further action needed to calm and discourage individuals from using hate speech in the future.… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.