Aligned with Whom? Direct and social goals for AI systems

Korinek, Anton; Balwit, Avital

Computer Science > Computers and Society

arXiv:2205.04279 (cs)

[Submitted on 9 May 2022]

Title:Aligned with Whom? Direct and social goals for AI systems

Authors:Anton Korinek, Avital Balwit

View PDF

Abstract:As artificial intelligence (AI) becomes more powerful and widespread, the AI alignment problem - how to ensure that AI systems pursue the goals that we want them to pursue - has garnered growing attention. This article distinguishes two types of alignment problems depending on whose goals we consider, and analyzes the different solutions necessitated by each. The direct alignment problem considers whether an AI system accomplishes the goals of the entity operating it. In contrast, the social alignment problem considers the effects of an AI system on larger groups or on society more broadly. In particular, it also considers whether the system imposes externalities on others. Whereas solutions to the direct alignment problem center around more robust implementation, social alignment problems typically arise because of conflicts between individual and group-level goals, elevating the importance of AI governance to mediate such conflicts. Addressing the social alignment problem requires both enforcing existing norms on their developers and operators and designing new norms that apply directly to AI systems.

Comments:	Prepared for the Oxford Handbook of AI Governance (23 pages, 2 figures)
Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2205.04279 [cs.CY]
	(or arXiv:2205.04279v1 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2205.04279

Submission history

From: Anton Korinek [view email]
[v1] Mon, 9 May 2022 13:49:47 UTC (273 KB)

Computer Science > Computers and Society

Title:Aligned with Whom? Direct and social goals for AI systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Aligned with Whom? Direct and social goals for AI systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators