Policy Verification in Stochastic Dynamical Systems Using Logarithmic Neural Certificates

Badings, Thom; Koops, Wietze; Junges, Sebastian; Jansen, Nils

Computer Science > Machine Learning

arXiv:2406.00826 (cs)

[Submitted on 2 Jun 2024 (v1), last revised 23 Feb 2025 (this version, v2)]

Title:Policy Verification in Stochastic Dynamical Systems Using Logarithmic Neural Certificates

Authors:Thom Badings, Wietze Koops, Sebastian Junges, Nils Jansen

View PDF

Abstract:We consider the verification of neural network policies for discrete-time stochastic systems with respect to reach-avoid specifications. We use a learner-verifier procedure that learns a certificate for the specification, represented as a neural network. Verifying that this neural network certificate is a so-called reach-avoid supermartingale (RASM) proves the satisfaction of a reach-avoid specification. Existing approaches for such a verification task rely on computed Lipschitz constants of neural networks. These approaches struggle with large Lipschitz constants, especially for reach-avoid specifications with high threshold probabilities. We present two key contributions to obtain smaller Lipschitz constants than existing approaches. First, we introduce logarithmic RASMs (logRASMs), which take exponentially smaller values than RASMs and hence have lower theoretical Lipschitz constants. Second, we present a fast method to compute tighter upper bounds on Lipschitz constants based on weighted norms. Our empirical evaluation shows we can consistently verify the satisfaction of reach-avoid specifications with probabilities as high as 99.9999%.

Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2406.00826 [cs.LG]
	(or arXiv:2406.00826v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.00826

Submission history

From: Thom Badings [view email]
[v1] Sun, 2 Jun 2024 18:19:19 UTC (1,408 KB)
[v2] Sun, 23 Feb 2025 16:11:34 UTC (1,735 KB)

Computer Science > Machine Learning

Title:Policy Verification in Stochastic Dynamical Systems Using Logarithmic Neural Certificates

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Policy Verification in Stochastic Dynamical Systems Using Logarithmic Neural Certificates

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators