Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory

Tan, Daniel C. H.; Acero, Fernando; McCarthy, Robert; Kanoulas, Dimitrios; Li, Zhibin

Computer Science > Machine Learning

arXiv:2306.04026 (cs)

[Submitted on 6 Jun 2023 (v1), last revised 5 Dec 2023 (this version, v4)]

Title:Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory

Authors:Daniel C.H. Tan, Fernando Acero, Robert McCarthy, Dimitrios Kanoulas, Zhibin Li

View PDF HTML (experimental)

Abstract:Guaranteeing safe behaviour of reinforcement learning (RL) policies poses significant challenges for safety-critical applications, despite RL's generality and scalability. To address this, we propose a new approach to apply verification methods from control theory to learned value functions. By analyzing task structures for safety preservation, we formalize original theorems that establish links between value functions and control barrier functions. Further, we propose novel metrics for verifying value functions in safe control tasks and practical implementation details to improve learning. Our work presents a novel method for certificate learning, which unlocks a diversity of verification techniques from control theory for RL policies, and marks a significant step towards a formal framework for the general, scalable, and verifiable design of RL-based control systems. Code and videos are available at this https url: this https URL

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2306.04026 [cs.LG]
	(or arXiv:2306.04026v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.04026

Submission history

From: Daniel Chee Hian Tan [view email]
[v1] Tue, 6 Jun 2023 21:41:31 UTC (1,589 KB)
[v2] Thu, 8 Jun 2023 22:53:10 UTC (1,642 KB)
[v3] Wed, 12 Jul 2023 09:26:38 UTC (1,642 KB)
[v4] Tue, 5 Dec 2023 10:47:31 UTC (16,919 KB)

Computer Science > Machine Learning

Title:Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators