Skip to main content

Showing 1–3 of 3 results for author: Westerhoff, V

.
  1. arXiv:2503.16861  [pdf, other

    cs.AI

    In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

    Authors: Shayne Longpre, Kevin Klyman, Ruth E. Appel, Sayash Kapoor, Rishi Bommasani, Michelle Sahar, Sean McGregor, Avijit Ghosh, Borhane Blili-Hamelin, Nathan Butters, Alondra Nelson, Amit Elazari, Andrew Sellars, Casey John Ellis, Dane Sherrets, Dawn Song, Harley Geiger, Ilona Cohen, Lauren McIlvenny, Madhulika Srikumar, Mark M. Jaycox, Markus Anderljung, Nadine Farid Johnson, Nicholas Carlini, Nicolas Miailhe , et al. (9 additional authors not shown)

    Abstract: The widespread deployment of general-purpose AI (GPAI) systems introduces significant new risks. Yet the infrastructure, practices, and norms for reporting flaws in GPAI systems remain seriously underdeveloped, lagging far behind more established fields like software security. Based on a collaboration between experts from the fields of software security, machine learning, law, social science, and… ▽ More

    Submitted 25 March, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2501.07238  [pdf, other

    cs.AI

    Lessons From Red Teaming 100 Generative AI Products

    Authors: Blake Bullwinkel, Amanda Minnich, Shiven Chawla, Gary Lopez, Martin Pouliot, Whitney Maxwell, Joris de Gruyter, Katherine Pratt, Saphir Qi, Nina Chikanov, Roman Lutz, Raja Sekhar Rao Dheekonda, Bolor-Erdene Jagdagdorj, Eugenia Kim, Justin Song, Keegan Hines, Daniel Jones, Giorgio Severi, Richard Lundeen, Sam Vaughan, Victoria Westerhoff, Pete Bryan, Ram Shankar Siva Kumar, Yonatan Zunger, Chang Kawaguchi , et al. (1 additional authors not shown)

    Abstract: In recent years, AI red teaming has emerged as a practice for probing the safety and security of generative AI systems. Due to the nascency of the field, there are many open questions about how red teaming operations should be conducted. Based on our experience red teaming over 100 generative AI products at Microsoft, we present our internal threat model ontology and eight main lessons we have lea… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

  3. arXiv:math/0211111  [pdf, ps, other

    math.AP q-bio

    Control of Spatially Heterogeneous and Time-Varying Cellular Reaction Networks: A New Summation Law

    Authors: Mark A. Peletier, Hans V. Westerhoff, Boris N. Kholodenko

    Abstract: A hallmark of a plethora of intracellular signaling pathways is the spatial separation of activation and deactivation processes that potentially results in precipitous gradients of activated proteins. The classical Metabolic Control Analysis (MCA), which quantifies the influence of an individual process on a system variable as the control coefficient, cannot be applied to spatially separated pro… ▽ More

    Submitted 6 November, 2002; originally announced November 2002.

    Comments: 19 pages, AMS-LaTeX, 6 eps figures included with geompsfi.sty

    Report number: MAS-R0226 MSC Class: 35B30 (Primary) 35J60; 35K55 (Secondary)

    Journal ref: Journal of Theoretical Biology, Vol. 225, pp. 477-487 (2003)