Identifying and Mitigating the Security Risks of Generative AI

Barrett, Clark; Boyd, Brad; Burzstein, Elie; Carlini, Nicholas; Chen, Brad; Choi, Jihye; Chowdhury, Amrita Roy; Christodorescu, Mihai; Datta, Anupam; Feizi, Soheil; Fisher, Kathleen; Hashimoto, Tatsunori; Hendrycks, Dan; Jha, Somesh; Kang, Daniel; Kerschbaum, Florian; Mitchell, Eric; Mitchell, John; Ramzan, Zulfikar; Shams, Khawaja; Song, Dawn; Taly, Ankur; Yang, Diyi

doi:10.1561/3300000041

Computer Science > Artificial Intelligence

arXiv:2308.14840 (cs)

[Submitted on 28 Aug 2023 (v1), last revised 29 Dec 2023 (this version, v4)]

Title:Identifying and Mitigating the Security Risks of Generative AI

View PDF HTML (experimental)

Abstract:Every major technical invention resurfaces the dual-use dilemma -- the new technology has the potential to be used for good as well as for harm. Generative AI (GenAI) techniques, such as large language models (LLMs) and diffusion models, have shown remarkable capabilities (e.g., in-context learning, code-completion, and text-to-image generation and editing). However, GenAI can be used just as well by attackers to generate new attacks and increase the velocity and efficacy of existing attacks.
This paper reports the findings of a workshop held at Google (co-organized by Stanford University and the University of Wisconsin-Madison) on the dual-use dilemma posed by GenAI. This paper is not meant to be comprehensive, but is rather an attempt to synthesize some of the interesting findings from the workshop. We discuss short-term and long-term goals for the community on this topic. We hope this paper provides both a launching point for a discussion on this important topic as well as interesting problems that the research community can work to address.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.14840 [cs.AI]
	(or arXiv:2308.14840v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2308.14840
Journal reference:	Foundations and Trends in Privacy and Security 6 (2023) 1-52
Related DOI:	https://doi.org/10.1561/3300000041

Submission history

From: Mihai Christodorescu [view email]
[v1] Mon, 28 Aug 2023 18:51:09 UTC (4,090 KB)
[v2] Sun, 15 Oct 2023 05:05:12 UTC (4,093 KB)
[v3] Tue, 17 Oct 2023 23:27:11 UTC (4,093 KB)
[v4] Fri, 29 Dec 2023 00:30:34 UTC (1,157 KB)

Computer Science > Artificial Intelligence

Title:Identifying and Mitigating the Security Risks of Generative AI

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Identifying and Mitigating the Security Risks of Generative AI

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators