Skip to main content

Showing 1–50 of 104 results for author: Abate, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17602  [pdf, ps, other

    cs.LO eess.SY

    ARCH-COMP25 Category Report: Stochastic Models

    Authors: Alessandro Abate, Omid Akbarzadeh, Henk A. P. Blom, Sofie Haesaert, Sina Hassani, Abolfazl Lavaei, Frederik Baymler Mathiesen, Rahul Misra, Amy Nejati, Mathis Niehage, Fie Ørum, Anne Remke, Behrad Samari, Ruohan Wang, Rafal Wisniewski, Ben Wooding, Mahdieh Zaker

    Abstract: This report is concerned with a friendly competition for formal verification and policy synthesis of stochastic models. The main goal of the report is to introduce new benchmarks and their properties within this category and recommend next steps toward next year's edition of the competition. In particular, this report introduces three recently developed software tools, a new water distribution net… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  2. arXiv:2505.15497  [pdf, ps, other

    cs.LG eess.SY

    Certified Neural Approximations of Nonlinear Dynamics

    Authors: Frederik Baymler Mathiesen, Nikolaus Vertovec, Francesco Fabiano, Luca Laurenti, Alessandro Abate

    Abstract: Neural networks hold great potential to act as approximate models of nonlinear dynamical systems, with the resulting neural approximations enabling verification and control of such systems. However, in safety-critical contexts, the use of neural approximations requires formal bounds on their closeness to the underlying system. To address this fundamental challenge, we propose a novel, adaptive, an… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: first and second author contributed equally

  3. arXiv:2504.12914  [pdf, other

    cs.CY

    In Which Areas of Technical AI Safety Could Geopolitical Rivals Cooperate?

    Authors: Ben Bucknall, Saad Siddiqui, Lara Thurnherr, Conor McGurk, Ben Harack, Anka Reuel, Patricia Paskov, Casey Mahoney, Sören Mindermann, Scott Singer, Vinay Hiremath, Charbel-Raphaël Segerie, Oscar Delaney, Alessandro Abate, Fazl Barez, Michael K. Cohen, Philip Torr, Ferenc Huszár, Anisoara Calinescu, Gabriel Davis Jones, Yoshua Bengio, Robert Trager

    Abstract: International cooperation is common in AI research, including between geopolitical rivals. While many experts advocate for greater international cooperation on AI safety to address shared global risks, some view cooperation on AI with suspicion, arguing that it can pose unacceptable risks to national security. However, the extent to which cooperation on AI safety poses such risks, as well as provi… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: Accepted to ACM Conference on Fairness, Accountability, and Transparency (FAccT 2025)

  4. arXiv:2504.12246  [pdf, other

    cs.LO

    Branching Bisimulation Learning

    Authors: Alessandro Abate, Mirco Giacobbe, Christian Micheletti, Yannik Schnitzer

    Abstract: We introduce a bisimulation learning algorithm for non-deterministic transition systems. We generalise bisimulation learning to systems with bounded branching and extend its applicability to model checking branching-time temporal logic, while previously it was limited to deterministic systems and model checking linear-time properties. Our method computes a finite stutter-insensitive bisimulation q… ▽ More

    Submitted 22 May, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

  5. arXiv:2504.06386  [pdf, ps, other

    cs.LG

    SPoRt -- Safe Policy Ratio: Certified Training and Deployment of Task Policies in Model-Free RL

    Authors: Jacques Cloete, Nikolaus Vertovec, Alessandro Abate

    Abstract: To apply reinforcement learning to safety-critical applications, we ought to provide safety guarantees during both policy training and deployment. In this work, we present theoretical results that place a bound on the probability of violating a safety property for a new task-specific policy in a model-free, episodic setting. This bound, based on a maximum policy ratio computed with respect to a 's… ▽ More

    Submitted 23 June, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: 9 pages + 16 pages supplementary material, 3 figures + 6 figures supplementary material

  6. arXiv:2504.05065  [pdf, other

    cs.LO eess.SY

    Quantitative Supermartingale Certificates

    Authors: Alessandro Abate, Mirco Giacobbe, Diptarko Roy

    Abstract: We introduce a general methodology for quantitative model checking and control synthesis with supermartingale certificates. We show that every specification that is invariant to time shifts admits a stochastic invariant that bounds its probability from below; for systems with general state space, the stochastic invariant bounds this probability as closely as desired; for systems with finite state… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: To appear at CAV'25

  7. arXiv:2504.00484  [pdf, other

    eess.SY cs.CE

    Aggregate Flexibility of Thermostatically Controlled Loads using Generalized Polymatroids

    Authors: Karan Mukhi, Alessandro Abate

    Abstract: Leveraging populations of thermostatically controlled loads could provide vast storage capacity to the grid. To realize this potential, their flexibility must be accurately aggregated and represented to the system operator as a single, controllable virtual device. Mathematically this is computed by calculating the Minkowski sum of the individual flexibility of each of the devices. Previous work sh… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  8. arXiv:2503.23912  [pdf, other

    eess.SY cs.LG math.OC

    Certified Approximate Reachability (CARe): Formal Error Bounds on Deep Learning of Reachable Sets

    Authors: Prashant Solanki, Nikolaus Vertovec, Yannik Schnitzer, Jasper Van Beers, Coen de Visser, Alessandro Abate

    Abstract: Recent approaches to leveraging deep learning for computing reachable sets of continuous-time dynamical systems have gained popularity over traditional level-set methods, as they overcome the curse of dimensionality. However, as with level-set methods, considerable care needs to be taken in limiting approximation errors, particularly since no guarantees are provided during training on the accuracy… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

  9. arXiv:2503.23458  [pdf, ps, other

    eess.SY cs.CE

    Exact Characterization of Aggregate Flexibility via Generalized Polymatroids

    Authors: Karan Mukhi, Georg Loho, Alessandro Abate

    Abstract: It is well established that the aggregate flexibility inherent in populations of distributed energy resources (DERs) can be leveraged to mitigate the intermittency and uncertainty associated with renewable generation, while also providing ancillary grid services. To enable this, aggregators must effectively represent the flexibility in the populations they control to the market or system operator.… ▽ More

    Submitted 17 June, 2025; v1 submitted 30 March, 2025; originally announced March 2025.

  10. arXiv:2503.09400  [pdf, other

    cs.MA

    Networked Communication for Decentralised Cooperative Agents in Mean-Field Control

    Authors: Patrick Benjamin, Alessandro Abate

    Abstract: We introduce networked communication to mean-field control (MFC) - the cooperative counterpart to mean-field games (MFGs) - and in particular to the setting where decentralised agents learn online from a single, non-episodic run of the empirical system. We adapt recent algorithms for MFGs to this new setting, as well as contributing a novel sub-routine allowing networked agents to estimate the glo… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  11. arXiv:2502.12042  [pdf, other

    cs.GT

    Multi-agent coordination via communication partitions

    Authors: Wei-Chen Lee, Alessandro Abate, Michael Wooldridge

    Abstract: Coordinating the behaviour of self-interested agents in the presence of multiple Nash equilibria is a major research challenge for multi-agent systems. Pre-game communication between all the players can aid coordination in cases where the Pareto-optimal payoff is unique, but can lead to deadlocks when there are multiple payoffs on the Pareto frontier. We consider a communication partition, where o… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  12. arXiv:2502.02470  [pdf, other

    cs.LG cs.AI

    Modular Training of Neural Networks aids Interpretability

    Authors: Satvik Golechha, Maheep Chaudhary, Joan Velja, Alessandro Abate, Nandi Schoots

    Abstract: An approach to improve neural network interpretability is via clusterability, i.e., splitting a model into disjoint clusters that can be studied independently. We define a measure for clusterability and show that pre-trained models form highly enmeshed clusters via spectral graph clustering. We thus train models to be more modular using a "clusterability loss" function that encourages the formatio… ▽ More

    Submitted 6 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: 8 pages, under review. arXiv admin note: text overlap with arXiv:2409.15747 (author note: this is an extension of that workshop paper but has different authors)

  13. arXiv:2412.12480  [pdf, other

    cs.LG cs.AI

    Subversion Strategy Eval: Can language models statelessly strategize to subvert control protocols?

    Authors: Alex Mallen, Charlie Griffin, Misha Wagner, Alessandro Abate, Buck Shlegeris

    Abstract: An AI control protocol is a plan for usefully deploying AI systems that aims to prevent an AI from intentionally causing some unacceptable outcome. This paper investigates how well AI systems can generate and act on their own strategies for subverting control protocols whilst operating statelessly (without shared memory between contexts). To do this, an AI system may need to reliably generate opti… ▽ More

    Submitted 4 April, 2025; v1 submitted 16 December, 2024; originally announced December 2024.

  14. arXiv:2412.11155  [pdf, ps, other

    cs.LG cs.AI

    Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting

    Authors: Joar Skalse, Alessandro Abate

    Abstract: The aim of inverse reinforcement learning (IRL) is to infer an agent's preferences from observing their behaviour. Usually, preferences are modelled as a reward function, $R$, and behaviour is modelled as a policy, $π$. One of the central difficulties in IRL is that multiple preferences may lead to the same observed behaviour. That is, $R$ is typically underdetermined by $π$, which means that $R$… ▽ More

    Submitted 15 December, 2024; originally announced December 2024.

  15. arXiv:2411.19729  [pdf, other

    cs.LG

    Risk-Averse Certification of Bayesian Neural Networks

    Authors: Xiyue Zhang, Zifan Wang, Yulong Gao, Licio Romao, Alessandro Abate, Marta Kwiatkowska

    Abstract: In light of the inherently complex and dynamic nature of real-world environments, incorporating risk measures is crucial for the robustness evaluation of deep learning models. In this work, we propose a Risk-Averse Certification framework for Bayesian neural networks called RAC-BNN. Our method leverages sampling and optimisation to compute a sound approximation of the output set of a BNN, represen… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

  16. arXiv:2411.15951  [pdf, other

    cs.LG cs.AI

    Partial Identifiability and Misspecification in Inverse Reinforcement Learning

    Authors: Joar Skalse, Alessandro Abate

    Abstract: The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function $R$ from a policy $π$. This problem is difficult, for several reasons. First of all, there are typically multiple reward functions which are compatible with a given policy; this means that the reward function is only *partially identifiable*, and that IRL contains a certain fundamental degree of ambiguity. Secondly, in o… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

  17. arXiv:2410.07812  [pdf, other

    cs.LG cs.AI

    Temporal-Difference Variational Continual Learning

    Authors: Luckeciano C. Melo, Alessandro Abate, Yarin Gal

    Abstract: Machine Learning models in real-world applications must continuously learn new tasks to adapt to shifts in the data-generating distribution. Yet, for Continual Learning (CL), models often struggle to balance learning new tasks (plasticity) with retaining previous knowledge (memory stability). Consequently, they are susceptible to Catastrophic Forgetting, which degrades performance and undermines t… ▽ More

    Submitted 14 May, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

  18. arXiv:2410.04631  [pdf, other

    cs.AI cs.LG

    DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL

    Authors: Mathias Jackermeier, Alessandro Abate

    Abstract: Linear temporal logic (LTL) has recently been adopted as a powerful formalism for specifying complex, temporally extended tasks in multi-task reinforcement learning (RL). However, learning policies that efficiently satisfy arbitrary specifications not observed during training remains a challenging problem. Existing approaches suffer from several shortcomings: they are often only applicable to fini… ▽ More

    Submitted 29 March, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

    Comments: ICLR'25 (Oral)

  19. arXiv:2409.07985  [pdf, other

    cs.AI cs.LG

    Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols

    Authors: Charlie Griffin, Louis Thomson, Buck Shlegeris, Alessandro Abate

    Abstract: To evaluate the safety and usefulness of deployment protocols for untrusted AIs, AI Control uses a red-teaming exercise played between a protocol designer and an adversary. This paper introduces AI-Control Games, a formal decision-making model of the red-teaming exercise as a multi-objective, partially observable, stochastic game. We also introduce methods for finding optimal protocols in AI-Contr… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 7 pages, with appendices

  20. arXiv:2408.11607  [pdf, other

    cs.MA cs.AI cs.GT cs.LG eess.SY

    Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation

    Authors: Patrick Benjamin, Alessandro Abate

    Abstract: Recent algorithms allow decentralised agents, possibly connected via a communication network, to learn equilibria in Mean-Field Games from a non-episodic run of the empirical system. However, these algorithms are for tabular settings: this computationally limits the size of agents' observation space, meaning the algorithms cannot handle anything but small state spaces, nor generalise beyond polici… ▽ More

    Submitted 13 March, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

  21. arXiv:2408.03093  [pdf, other

    cs.LG cs.AI eess.SY

    Certifiably Robust Policies for Uncertain Parametric Environments

    Authors: Yannik Schnitzer, Alessandro Abate, David Parker

    Abstract: We present a data-driven approach for producing policies that are provably robust across unknown stochastic environments. Existing approaches can learn models of a single environment as an interval Markov decision processes (IMDP) and produce a robust policy with a probably approximately correct (PAC) guarantee on its performance. However these are unable to reason about the impact of environmenta… ▽ More

    Submitted 23 March, 2025; v1 submitted 6 August, 2024; originally announced August 2024.

  22. arXiv:2407.10971  [pdf, other

    cs.LG

    Walking the Values in Bayesian Inverse Reinforcement Learning

    Authors: Ondrej Bajgar, Alessandro Abate, Konstantinos Gatsis, Michael A. Osborne

    Abstract: The goal of Bayesian inverse reinforcement learning (IRL) is recovering a posterior distribution over reward functions using a set of demonstrations from an expert optimizing for a reward unknown to the learner. The resulting posterior over rewards can then be used to synthesize an apprentice policy that performs well on the same or a similar task. A key challenge in Bayesian IRL is bridging the c… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Published at the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

  23. arXiv:2406.15753  [pdf, other

    cs.LG cs.AI stat.ML

    The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

    Authors: Lukas Fluri, Leon Lang, Alessandro Abate, Patrick Forré, David Krueger, Joar Skalse

    Abstract: In reinforcement learning, specifying reward functions that capture the intended task can be very challenging. Reward learning aims to address this issue by learning the reward function. However, a learned reward model may have a low error on the data distribution, and yet subsequently produce a policy with large regret. We say that such a reward model has an error-regret mismatch. The main source… ▽ More

    Submitted 4 March, 2025; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: 70 pages, 4 figures

  24. arXiv:2406.10023  [pdf, other

    cs.LG cs.CL stat.ML

    Deep Bayesian Active Learning for Preference Modeling in Large Language Models

    Authors: Luckeciano C. Melo, Panagiotis Tigas, Alessandro Abate, Yarin Gal

    Abstract: Leveraging human preferences for steering the behavior of Large Language Models (LLMs) has demonstrated notable success in recent years. Nonetheless, data selection and labeling are still a bottleneck for these systems, particularly at large scale. Hence, selecting the most informative points for acquiring human feedback may considerably reduce the cost of preference labeling and unleash the furth… ▽ More

    Submitted 28 October, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  25. arXiv:2405.17304  [pdf, ps, other

    cs.LO eess.SY

    Stochastic Omega-Regular Verification and Control with Supermartingales

    Authors: Alessandro Abate, Mirco Giacobbe, Diptarko Roy

    Abstract: We present for the first time a supermartingale certificate for $ω$-regular specifications. We leverage the Robbins & Siegmund convergence theorem to characterize supermartingale certificates for the almost-sure acceptance of Streett conditions on general stochastic processes, which we call Streett supermartingales. This enables effective verification and control of discrete-time stochastic dynami… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: The conference version of this manuscript appeared at CAV'24

  26. arXiv:2405.15723  [pdf, other

    cs.LO cs.LG

    Bisimulation Learning

    Authors: Alessandro Abate, Mirco Giacobbe, Yannik Schnitzer

    Abstract: We introduce a data-driven approach to computing finite bisimulations for state transition systems with very large, possibly infinite state space. Our novel technique computes stutter-insensitive bisimulations of deterministic systems, which we characterize as the problem of learning a state classifier together with a ranking function for each class. Our procedure learns a candidate state classifi… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  27. Robust Aggregation of Electric Vehicle Flexiblity

    Authors: Karan Mukhi, Chengrui Qu, Pengcheng You, Alessandro Abate

    Abstract: We address the problem of characterizing the aggregate flexibility in populations of electric vehicles (EVs) with uncertain charging requirements. Extending upon prior results that provide exact characterizations of aggregate flexibility in populations of electric vehicle (EVs), we adapt the framework to encompass more general charging requirements. In doing so we give a characterization of the ex… ▽ More

    Submitted 11 March, 2025; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages, conference

  28. arXiv:2405.06624  [pdf, other

    cs.AI

    Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

    Authors: David "davidad" Dalrymple, Joar Skalse, Yoshua Bengio, Stuart Russell, Max Tegmark, Sanjit Seshia, Steve Omohundro, Christian Szegedy, Ben Goldhaber, Nora Ammann, Alessandro Abate, Joe Halpern, Clark Barrett, Ding Zhao, Tan Zhi-Xuan, Jeannette Wing, Joshua Tenenbaum

    Abstract: Ensuring that AI systems reliably and robustly avoid harmful or dangerous behaviours is a crucial challenge, especially for AI systems with a high degree of autonomy and general intelligence, or systems used in safety-critical contexts. In this paper, we will introduce and define a family of approaches to AI safety, which we will refer to as guaranteed safe (GS) AI. The core feature of these appro… ▽ More

    Submitted 8 July, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  29. arXiv:2404.18813  [pdf, other

    eess.SY cs.LG cs.LO

    Safe Reach Set Computation via Neural Barrier Certificates

    Authors: Alessandro Abate, Sergiy Bogomolov, Alec Edwards, Kostiantyn Potomkin, Sadegh Soudjani, Paolo Zuliani

    Abstract: We present a novel technique for online safety verification of autonomous systems, which performs reachability analysis efficiently for both bounded and unbounded horizons by employing neural barrier certificates. Our approach uses barrier certificates given by parameterized neural networks that depend on a given initial set, unsafe sets, and time horizon. Such networks are trained efficiently off… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: IFAC Conference on Analysis and Design of Hybrid Systems

  30. arXiv:2404.03314  [pdf, other

    cs.GT eess.SY

    Learning to Bid in Forward Electricity Markets Using a No-Regret Algorithm

    Authors: Arega Getaneh Abate, Dorsa Majdi, Jalal Kazempour, Maryam Kamgarpour

    Abstract: It is a common practice in the current literature of electricity markets to use game-theoretic approaches for strategic price bidding. However, they generally rely on the assumption that the strategic bidders have prior knowledge of rival bids, either perfectly or with some uncertainty. This is not necessarily a realistic assumption. This paper takes a different approach by relaxing such an assump… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  31. arXiv:2403.15398  [pdf

    cs.CY

    An International and Multidisciplinary Teaching Experience with Real Industrial Team Project Development

    Authors: Martin Mellado, Eduardo Vendrell, Filomena Ferrucci, Andrea Abate, Detlef Zuhlke, Bernard Riera

    Abstract: This paper presents the design, objectives, experiences, and results of an international cooperation project funded by the European Commission in the context of the Erasmus Intensive Programme (IP, for short) designed to improve students' curricula. An IP is a short programme of study (minimum 2 weeks) that brings together university students and staff from at least three countries in order to enc… ▽ More

    Submitted 17 February, 2024; originally announced March 2024.

    Comments: 21 pages

  32. arXiv:2403.06854  [pdf, other

    cs.LG

    Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification

    Authors: Joar Skalse, Alessandro Abate

    Abstract: Inverse reinforcement learning (IRL) aims to infer an agent's preferences (represented as a reward function $R$) from their behaviour (represented as a policy $π$). To do this, we need a behavioural model of how $π$ relates to $R$. In the current literature, the most common behavioural models are optimality, Boltzmann-rationality, and causal entropy maximisation. However, the true relationship bet… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  33. arXiv:2401.15838  [pdf, other

    stat.ML cs.LG cs.MA math.OC stat.CO

    Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

    Authors: Alexandros E. Tzikas, Licio Romao, Mert Pilanci, Alessandro Abate, Mykel J. Kochenderfer

    Abstract: Many machine learning applications require operating on a spatially distributed dataset. Despite technological advances, privacy considerations and communication constraints may prevent gathering the entire dataset in a central unit. In this paper, we propose a distributed sampling scheme based on the alternating direction method of multipliers, which is commonly used in the optimization literatur… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  34. arXiv:2401.14811  [pdf, ps, other

    cs.AI cs.LG

    On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks

    Authors: Joar Skalse, Alessandro Abate

    Abstract: In this paper, we study the expressivity of scalar, Markovian reward functions in Reinforcement Learning (RL), and identify several limitations to what they can express. Specifically, we look at three classes of RL tasks; multi-objective RL, risk-sensitive RL, and modal RL. For each class, we derive necessary and sufficient conditions that describe when a problem in this class can be expressed usi… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, PMLR 216:1974-1984, 2023

  35. arXiv:2312.11314  [pdf, other

    cs.LG cs.LO eess.SY

    Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

    Authors: Rohan Mitta, Hosein Hasanbeig, Jun Wang, Daniel Kroening, Yiannis Kantaros, Alessandro Abate

    Abstract: This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning. In a variety of RL applications the safety of the agent is particularly important, e.g. autonomous platforms or robots that work in proximity of humans. As enforcing safety during training might severely limit th… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  36. arXiv:2312.06344  [pdf, other

    eess.SY cs.LO

    Learning Robust Policies for Uncertain Parametric Markov Decision Processes

    Authors: Luke Rickard, Alessandro Abate, Kostas Margellos

    Abstract: Synthesising verifiably correct controllers for dynamical systems is crucial for safety-critical problems. To achieve this, it is important to account for uncertainty in a robust manner, while at the same time it is often of interest to avoid being overly conservative with the view of achieving a better cost. We propose a method for verifiably safe policy synthesis for a class of finite state mode… ▽ More

    Submitted 15 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 10 pages, accepted for oral presentation at L4DC

  37. arXiv:2311.09793  [pdf, other

    eess.SY cs.LG cs.LO

    Fossil 2.0: Formal Certificate Synthesis for the Verification and Control of Dynamical Models

    Authors: Alec Edwards, Andrea Peruffo, Alessandro Abate

    Abstract: This paper presents Fossil 2.0, a new major release of a software tool for the synthesis of certificates (e.g., Lyapunov and barrier functions) for dynamical systems modelled as ordinary differential and difference equations. Fossil 2.0 is much improved from its original release, including new interfaces, a significantly expanded certificate portfolio, controller synthesis and enhanced extensibili… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: HSCC 2024 Tool Paper

  38. arXiv:2311.09786  [pdf, other

    eess.SY cs.AI cs.LO

    Correct-by-Construction Control for Stochastic and Uncertain Dynamical Models via Formal Abstractions

    Authors: Thom Badings, Nils Jansen, Licio Romao, Alessandro Abate

    Abstract: Automated synthesis of correct-by-construction controllers for autonomous systems is crucial for their deployment in safety-critical scenarios. Such autonomous systems are naturally modeled as stochastic dynamical models. The general problem is to compute a controller that provably satisfies a given task, represented as a probabilistic temporal logic specification. However, factors such as stochas… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: In Proceedings FMAS 2023, arXiv:2311.08987. arXiv admin note: text overlap with arXiv:2301.01526

    Journal ref: EPTCS 395, 2023, pp. 144-152

  39. arXiv:2310.01951  [pdf, other

    cs.LG cs.AI

    Probabilistic Reach-Avoid for Bayesian Neural Networks

    Authors: Matthew Wicker, Luca Laurenti, Andrea Patane, Nicola Paoletti, Alessandro Abate, Marta Kwiatkowska

    Abstract: Model-based reinforcement learning seeks to simultaneously learn the dynamics of an unknown stochastic environment and synthesise an optimal policy for acting in it. Ensuring the safety and robustness of sequential decisions made through a policy in such an environment is a key challenge for policies intended for safety-critical scenarios. In this work, we investigate two complementary problems: f… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 47 pages, 10 figures. arXiv admin note: text overlap with arXiv:2105.10134

  40. arXiv:2309.15257  [pdf, other

    cs.LG cs.AI

    STARC: A General Framework For Quantifying Differences Between Reward Functions

    Authors: Joar Skalse, Lucy Farnik, Sumeet Ramesh Motwani, Erik Jenner, Adam Gleave, Alessandro Abate

    Abstract: In order to solve a task using reinforcement learning, it is necessary to first formalise the goal of that task as a reward function. However, for many real-world tasks, it is very difficult to manually specify a reward function that never incentivises undesirable behaviour. As a result, it is increasingly popular to use reward learning algorithms, which attempt to learn a reward function from dat… ▽ More

    Submitted 12 December, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  41. arXiv:2309.06090  [pdf, other

    eess.SY cs.LG cs.LO

    A General Framework for Verification and Control of Dynamical Models via Certificate Synthesis

    Authors: Alec Edwards, Andrea Peruffo, Alessandro Abate

    Abstract: An emerging branch of control theory specialises in certificate learning, concerning the specification of a desired (possibly complex) system behaviour for an autonomous or control model, which is then analytically verified by means of a function-based proof. However, the synthesis of controllers abiding by these complex requirements is in general a non-trivial task and may elude the most expert c… ▽ More

    Submitted 28 October, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  42. arXiv:2308.10587  [pdf, other

    cs.FL

    Formal Analysis and Verification of Max-Plus Linear Systems

    Authors: Muhammad Syifa'ul Mufid, Andrea Micheli, Alessandro Abate, Alessandro Cimatti

    Abstract: Max-Plus Linear (MPL) systems are an algebraic formalism with practical applications in transportation networks, manufacturing and biological systems. In this paper, we investigate the problem of automatically analyzing the properties of MPL, taking into account both structural properties such as transient and cyclicity, and the open problem of user-defined temporal properties. We propose Time-Dif… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 28 pages (including appendixes)

  43. arXiv:2307.15546  [pdf, other

    cs.LO cs.LG eess.SY

    On the Trade-off Between Efficiency and Precision of Neural Abstraction

    Authors: Alec Edwards, Mirco Giacobbe, Alessandro Abate

    Abstract: Neural abstractions have been recently introduced as formal approximations of complex, nonlinear dynamical models. They comprise a neural ODE and a certified upper bound on the error between the abstract neural network and the concrete dynamical model. So far neural abstractions have exclusively been obtained as neural networks consisting entirely of $ReLU$ activation functions, resulting in neura… ▽ More

    Submitted 2 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Appeared at QEST 2023. Added codebase link; corrected Eq. 11

  44. arXiv:2307.05059  [pdf, ps, other

    cs.GT cs.AI cs.MA

    On Imperfect Recall in Multi-Agent Influence Diagrams

    Authors: James Fox, Matt MacDermott, Lewis Hammond, Paul Harrenstein, Alessandro Abate, Michael Wooldridge

    Abstract: Multi-agent influence diagrams (MAIDs) are a popular game-theoretic model based on Bayesian networks. In some settings, MAIDs offer significant advantages over extensive-form game representations. Previous work on MAIDs has assumed that agents employ behavioural policies, which set independent conditional probability distributions over actions for each of their decisions. In settings with imperfec… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: In Proceedings TARK 2023, arXiv:2307.04005

    Journal ref: EPTCS 379, 2023, pp. 201-220

  45. arXiv:2306.02766  [pdf, other

    cs.MA cs.AI cs.LG cs.SI eess.SY

    Networked Communication for Decentralised Agents in Mean-Field Games

    Authors: Patrick Benjamin, Alessandro Abate

    Abstract: We introduce networked communication to the mean-field game framework, in particular to oracle-free settings where $N$ decentralised agents learn along a single, non-episodic run of the empirical system. We prove that our architecture has sample guarantees bounded between those of the centralised- and independent-learning cases. We provide the order of the difference in these bounds in terms of ne… ▽ More

    Submitted 13 March, 2025; v1 submitted 5 June, 2023; originally announced June 2023.

  46. arXiv:2303.17618  [pdf, other

    cs.LG eess.SY

    Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

    Authors: Adrien Banse, Licio Romao, Alessandro Abate, Raphaël M. Jungers

    Abstract: We introduce an adaptive refinement procedure for smart, and scalable abstraction of dynamical systems. Our technique relies on partitioning the state space depending on the observation of future outputs. However, this knowledge is dynamically constructed in an adaptive, asymmetric way. In order to learn the optimal structure, we define a Kantorovich-inspired metric between Markov chains, and we u… ▽ More

    Submitted 30 October, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: This paper is an extended version of a CDC2023 submission

  47. arXiv:2303.13657  [pdf, other

    math.OC cs.LG

    Policy Evaluation in Distributional LQR

    Authors: Zifan Wang, Yulong Gao, Siyi Wang, Michael M. Zavlanos, Alessandro Abate, Karl H. Johansson

    Abstract: Distributional reinforcement learning (DRL) enhances the understanding of the effects of the randomness in the environment by letting agents learn the distribution of a random return, rather than its expected value as in standard RL. At the same time, a main challenge in DRL is that policy evaluation in DRL typically relies on the representation of the return distribution, which needs to be carefu… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 12pages

  48. arXiv:2302.13888  [pdf, other

    cs.GT cs.CC

    k-Prize Weighted Voting Games

    Authors: Wei-Chen Lee, David Hyland, Alessandro Abate, Edith Elkind, Jiarui Gan, Julian Gutierrez, Paul Harrenstein, Michael Wooldridge

    Abstract: We introduce a natural variant of weighted voting games, which we refer to as k-Prize Weighted Voting Games. Such games consist of n players with weights, and k prizes, of possibly differing values. The players form coalitions, and the i-th largest coalition (by the sum of weights of its members) wins the i-th largest prize, which is then shared among its members. We present four solution concepts… ▽ More

    Submitted 2 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted to AAMAS 2023

  49. arXiv:2301.11683  [pdf, other

    cs.LO cs.LG eess.SY

    Neural Abstractions

    Authors: Alessandro Abate, Alec Edwards, Mirco Giacobbe

    Abstract: We present a novel method for the safety verification of nonlinear dynamical models that uses neural networks to represent abstractions of their dynamics. Neural networks have extensively been used before as approximators; in this work, we make a step further and use them for the first time as abstractions. For a given dynamical model, our method synthesises a neural network that overapproximates… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: NeurIPS 2022

  50. Quantitative Verification with Neural Networks

    Authors: Alessandro Abate, Alec Edwards, Mirco Giacobbe, Hashan Punchihewa, Diptarko Roy

    Abstract: We present a data-driven approach to the quantitative verification of probabilistic programs and stochastic dynamical models. Our approach leverages neural networks to compute tight and sound bounds for the probability that a stochastic process hits a target condition within finite time. This problem subsumes a variety of quantitative verification questions, from the reachability and safety analys… ▽ More

    Submitted 29 May, 2025; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: The conference version of this manuscript appeared at CONCUR 2023

    ACM Class: F.3.1; D.2.4