PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories

Aroca-Ouellette, Stephane; Mackraz, Natalie; Theobald, Barry-John; Metcalf, Katherine

Computer Science > Artificial Intelligence

arXiv:2410.06273 (cs)

[Submitted on 8 Oct 2024]

Title:PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories

Authors:Stephane Aroca-Ouellette, Natalie Mackraz, Barry-John Theobald, Katherine Metcalf

View PDF HTML (experimental)

Abstract:Accommodating human preferences is essential for creating AI agents that deliver personalized and effective interactions. Recent work has shown the potential for LLMs to infer preferences from user interactions, but they often produce broad and generic preferences, failing to capture the unique and individualized nature of human preferences. This paper introduces PREDICT, a method designed to enhance the precision and adaptability of inferring preferences. PREDICT incorporates three key elements: (1) iterative refinement of inferred preferences, (2) decomposition of preferences into constituent components, and (3) validation of preferences across multiple trajectories. We evaluate PREDICT on two distinct environments: a gridworld setting and a new text-domain environment (PLUME). PREDICT more accurately infers nuanced human preferences improving over existing baselines by 66.2\% (gridworld environment) and 41.0\% (PLUME).

Subjects:	Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2410.06273 [cs.AI]
	(or arXiv:2410.06273v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2410.06273

Submission history

From: Katherine Metcalf [view email]
[v1] Tue, 8 Oct 2024 18:16:41 UTC (3,261 KB)

Computer Science > Artificial Intelligence

Title:PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators