Adaptive Reward Design for Reinforcement Learning

Kwon, Minjae; ElSayed-Aly, Ingy; Feng, Lu

Computer Science > Robotics

arXiv:2412.10917 (cs)

[Submitted on 14 Dec 2024 (v1), last revised 17 May 2025 (this version, v2)]

Title:Adaptive Reward Design for Reinforcement Learning

Authors:Minjae Kwon, Ingy ElSayed-Aly, Lu Feng

View PDF HTML (experimental)

Abstract:There is a surge of interest in using formal languages such as Linear Temporal Logic (LTL) to precisely and succinctly specify complex tasks and derive reward functions for Reinforcement Learning (RL). However, existing methods often assign sparse rewards (e.g., giving a reward of 1 only if a task is completed and 0 otherwise). By providing feedback solely upon task completion, these methods fail to encourage successful subtask completion. This is particularly problematic in environments with inherent uncertainty, where task completion may be unreliable despite progress on intermediate goals. To address this limitation, we propose a suite of reward functions that incentivize an RL agent to complete a task specified by an LTL formula as much as possible, and develop an adaptive reward shaping approach that dynamically updates reward functions during the learning process. Experimental results on a range of benchmark RL environments demonstrate that the proposed approach generally outperforms baselines, achieving earlier convergence to a better policy with higher expected return and task completion rate.

Comments:	UAI 2025 Camera Ready Version
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
MSC classes:	68T40 (Primary) 93E35, 03B44 (Secondary)
Cite as:	arXiv:2412.10917 [cs.RO]
	(or arXiv:2412.10917v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2412.10917

Submission history

From: Minjae Kwon [view email]
[v1] Sat, 14 Dec 2024 18:04:18 UTC (10,517 KB)
[v2] Sat, 17 May 2025 21:14:03 UTC (12,081 KB)

Computer Science > Robotics

Title:Adaptive Reward Design for Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Adaptive Reward Design for Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators