"What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)

Brindise, Noel; Hebbar, Vijeth; Shah, Riya; Langbort, Cedric

Computer Science > Machine Learning

arXiv:2506.09901 (cs)

[Submitted on 11 Jun 2025]

Title:"What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)

Authors:Noel Brindise, Vijeth Hebbar, Riya Shah, Cedric Langbort

View PDF

Abstract:In this work, we provide an extended discussion of a new approach to explainable Reinforcement Learning called Diverse Near-Optimal Alternatives (DNA), first proposed at L4DC 2025. DNA seeks a set of reasonable "options" for trajectory-planning agents, optimizing policies to produce qualitatively diverse trajectories in Euclidean space. In the spirit of explainability, these distinct policies are used to "explain" an agent's options in terms of available trajectory shapes from which a human user may choose. In particular, DNA applies to value function-based policies on Markov decision processes where agents are limited to continuous trajectories. Here, we describe DNA, which uses reward shaping in local, modified Q-learning problems to solve for distinct policies with guaranteed epsilon-optimality. We show that it successfully returns qualitatively different policies that constitute meaningfully different "options" in simulation, including a brief comparison to related approaches in the stochastic optimization field of Quality Diversity. Beyond the explanatory motivation, this work opens new possibilities for exploration and adaptive planning in RL.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2506.09901 [cs.LG]
	(or arXiv:2506.09901v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2506.09901
Journal reference:	Proceedings of the 7th Annual Learning for Dynamics & Control Conference, PMLR 283:1194-1205, 2025

Submission history

From: Noel Brindise [view email]
[v1] Wed, 11 Jun 2025 16:15:56 UTC (163 KB)

Computer Science > Machine Learning

Title:"What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:"What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators