Skip to main content

Showing 1–6 of 6 results for author: Vann, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.19143  [pdf, ps, other

    cs.CL cs.AI cs.CY

    Anecdoctoring: Automated Red-Teaming Across Language and Place

    Authors: Alejandro Cuevas, Saloni Dash, Bharat Kumar Nayak, Dan Vann, Madeleine I. G. Daepp

    Abstract: Disinformation is among the top risks of generative artificial intelligence (AI) misuse. Global adoption of generative AI necessitates red-teaming evaluations (i.e., systematic adversarial probing) that are robust across diverse languages and cultures, but red-teaming datasets are commonly US- and English-centric. To address this gap, we propose "anecdoctoring", a novel red-teaming approach that a… ▽ More

    Submitted 23 September, 2025; originally announced September 2025.

    Comments: To be published in EMNLP 2025

  2. arXiv:2504.00928  [pdf, ps, other

    cs.CL cs.CY

    Taxonomizing Representational Harms using Speech Act Theory

    Authors: Emily Corvi, Hannah Washington, Stefanie Reed, Chad Atalla, Alexandra Chouldechova, P. Alex Dow, Jean Garcia-Gathright, Nicholas Pangakis, Emily Sheng, Dan Vann, Matthew Vogel, Hanna Wallach

    Abstract: Representational harms are widely recognized among fairness-related harms caused by generative language systems. However, their definitions are commonly under-specified. We make a theoretical contribution to the specification of representational harms by introducing a framework, grounded in speech act theory (Austin, 1962), that conceptualizes representational harms caused by generative language s… ▽ More

    Submitted 7 June, 2025; v1 submitted 1 April, 2025; originally announced April 2025.

    Comments: In Findings of the Association for Computational Linguistics (ACL), 2025

  3. arXiv:2502.00561  [pdf, ps, other

    cs.CY

    Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge

    Authors: Hanna Wallach, Meera Desai, A. Feder Cooper, Angelina Wang, Chad Atalla, Solon Barocas, Su Lin Blodgett, Alexandra Chouldechova, Emily Corvi, P. Alex Dow, Jean Garcia-Gathright, Alexandra Olteanu, Nicholas Pangakis, Stefanie Reed, Emily Sheng, Dan Vann, Jennifer Wortman Vaughan, Matthew Vogel, Hannah Washington, Abigail Z. Jacobs

    Abstract: The measurement tasks involved in evaluating generative AI (GenAI) systems lack sufficient scientific rigor, leading to what has been described as "a tangle of sloppy tests [and] apples-to-oranges comparisons" (Roose, 2024). In this position paper, we argue that the ML community would benefit from learning from and drawing on the social sciences when developing and using measurement instruments fo… ▽ More

    Submitted 6 June, 2025; v1 submitted 1 February, 2025; originally announced February 2025.

    Comments: In Proceedings of the 42nd International Conference on Machine Learning (ICML), 2025

  4. arXiv:2412.01934  [pdf, other

    cs.CY

    A Shared Standard for Valid Measurement of Generative AI Systems' Capabilities, Risks, and Impacts

    Authors: Alexandra Chouldechova, Chad Atalla, Solon Barocas, A. Feder Cooper, Emily Corvi, P. Alex Dow, Jean Garcia-Gathright, Nicholas Pangakis, Stefanie Reed, Emily Sheng, Dan Vann, Matthew Vogel, Hannah Washington, Hanna Wallach

    Abstract: The valid measurement of generative AI (GenAI) systems' capabilities, risks, and impacts forms the bedrock of our ability to evaluate these systems. We introduce a shared standard for valid measurement that helps place many of the disparate-seeming evaluation practices in use today on a common footing. Our framework, grounded in measurement theory from the social sciences, extends the work of Adco… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: NeurIPS 2024 Workshop on Statistical Foundations of LLMs and Foundation Models (SFLLM)

  5. arXiv:2411.10939  [pdf, other

    cs.CY

    Evaluating Generative AI Systems is a Social Science Measurement Challenge

    Authors: Hanna Wallach, Meera Desai, Nicholas Pangakis, A. Feder Cooper, Angelina Wang, Solon Barocas, Alexandra Chouldechova, Chad Atalla, Su Lin Blodgett, Emily Corvi, P. Alex Dow, Jean Garcia-Gathright, Alexandra Olteanu, Stefanie Reed, Emily Sheng, Dan Vann, Jennifer Wortman Vaughan, Matthew Vogel, Hannah Washington, Abigail Z. Jacobs

    Abstract: Across academia, industry, and government, there is an increasing awareness that the measurement tasks involved in evaluating generative AI (GenAI) systems are especially difficult. We argue that these measurement tasks are highly reminiscent of measurement tasks found throughout the social sciences. With this in mind, we present a framework, grounded in measurement theory from the social sciences… ▽ More

    Submitted 16 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024 Workshop on Evaluating Evaluations (EvalEval)

  6. arXiv:2310.17750  [pdf, other

    cs.CL

    A Framework for Automated Measurement of Responsible AI Harms in Generative AI Applications

    Authors: Ahmed Magooda, Alec Helyar, Kyle Jackson, David Sullivan, Chad Atalla, Emily Sheng, Dan Vann, Richard Edgar, Hamid Palangi, Roman Lutz, Hongliang Kong, Vincent Yun, Eslam Kamal, Federico Zarfati, Hanna Wallach, Sarah Bird, Mei Chen

    Abstract: We present a framework for the automated measurement of responsible AI (RAI) metrics for large language models (LLMs) and associated products and services. Our framework for automatically measuring harms from LLMs builds on existing technical and sociotechnical expertise and leverages the capabilities of state-of-the-art LLMs, such as GPT-4. We use this framework to run through several case studie… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: This is a living document