Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning

Town, Jared; Morrison, Zachary; Kamalapurkar, Rushikesh

Electrical Engineering and Systems Science > Systems and Control

arXiv:2210.16299 (eess)

[Submitted on 28 Oct 2022 (v1), last revised 30 May 2024 (this version, v4)]

Title:Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning

Authors:Jared Town, Zachary Morrison, Rushikesh Kamalapurkar

View PDF

Abstract:A key challenge in solving the deterministic inverse reinforcement learning (IRL) problem online and in real-time is the existence of multiple solutions. Nonuniqueness necessitates the study of the notion of equivalent solutions, i.e., solutions that result in a different cost functional but same feedback matrix, and convergence to such solutions. While offline algorithms that result in convergence to equivalent solutions have been developed in the literature, online, real-time techniques that address nonuniqueness are not available. In this paper, a regularized history stack observer that converges to approximately equivalent solutions of the IRL problem is developed. Novel data-richness conditions are developed to facilitate the analysis and simulation results are provided to demonstrate the effectiveness of the developed technique.

Subjects:	Systems and Control (eess.SY); Machine Learning (cs.LG)
Cite as:	arXiv:2210.16299 [eess.SY]
	(or arXiv:2210.16299v4 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2210.16299

Submission history

From: Rushikesh Kamalapurkar [view email]
[v1] Fri, 28 Oct 2022 17:52:18 UTC (1,763 KB)
[v2] Tue, 18 Jul 2023 00:26:30 UTC (729 KB)
[v3] Thu, 20 Jul 2023 05:27:03 UTC (594 KB)
[v4] Thu, 30 May 2024 17:31:41 UTC (53 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators