Skip to main content

Showing 1–21 of 21 results for author: Könighofer, B

.
  1. arXiv:2412.11994  [pdf, other

    cs.AI

    Fairness Shields: Safeguarding against Biased Decision Makers

    Authors: Filip Cano, Thomas A. Henzinger, Bettina Könighofer, Konstantin Kueffner, Kaushik Mallik

    Abstract: As AI-based decision-makers increasingly influence human lives, it is a growing concern that their decisions are often unfair or biased with respect to people's sensitive attributes, such as gender and race. Most existing bias prevention measures provide probabilistic fairness guarantees in the long run, and it is possible that the decisions are biased on specific instances of short decision seque… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: To appear in AAAI 2025

  2. arXiv:2411.07700  [pdf, other

    cs.LG

    Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning

    Authors: Stefan Pranger, Hana Chockler, Martin Tappler, Bettina Könighofer

    Abstract: In many Deep Reinforcement Learning (RL) problems, decisions in a trained policy vary in significance for the expected safety and performance of the policy. Since RL policies are very complex, testing efforts should concentrate on states in which the agent's decisions have the highest impact on the expected outcome. In this paper, we propose a novel model-based method to rigorously compute a ranki… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  3. arXiv:2405.13583  [pdf, other

    cs.LO

    Tools at the Frontiers of Quantitative Verification

    Authors: Roman Andriushchenko, Alexander Bork, Carlos E. Budde, Milan Češka, Kush Grover, Ernst Moritz Hahn, Arnd Hartmanns, Bryant Israelsen, Nils Jansen, Joshua Jeppson, Sebastian Junges, Maximilian A. Köhl, Bettina Könighofer, Jan Křetínský, Tobias Meggendorfer, David Parker, Stefan Pranger, Tim Quatmann, Enno Ruijters, Landon Taylor, Matthias Volk, Maximilian Weininger, Zhen Zhang

    Abstract: The analysis of formal models that include quantitative aspects such as timing or probabilistic choices is performed by quantitative verification tools. Broad and mature tool support is available for computing basic properties such as expected rewards on basic models such as Markov chains. Previous editions of QComp, the comparison of tools for the analysis of quantitative formal models, focused o… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. Safety Shielding under Delayed Observation

    Authors: Filip Cano Córdoba, Alexander Palmisano, Martin Fränzle, Roderick Bloem, Bettina Könighofer

    Abstract: Agents operating in physical environments need to be able to handle delays in the input and output signals since neither data transmission nor sensing or actuating the environment are instantaneous. Shields are correct-by-construction runtime enforcers that guarantee safe execution by correcting any action that may cause a violation of a formal safety specification. Besides providing safety guaran… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 6 pages, Published at ICAPS 2023 (Main Track)

  5. arXiv:2307.01532  [pdf, other

    cs.AI

    Analyzing Intentional Behavior in Autonomous Agents under Uncertainty

    Authors: Filip Cano Córdoba, Samuel Judson, Timos Antonopoulos, Katrine Bjørner, Nicholas Shoemaker, Scott J. Shapiro, Ruzica Piskac, Bettina Könighofer

    Abstract: Principled accountability for autonomous decision-making in uncertain environments requires distinguishing intentional outcomes from negligent designs from actual accidents. We propose analyzing the behavior of autonomous agents through a quantitative measure of the evidence of intentional behavior. We model an uncertain environment as a Markov Decision Process (MDP). For a given scenario, we rely… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 10 pages. Accepted for publication at IJCAI 2023 (Main Track)

  6. arXiv:2306.17204  [pdf, other

    cs.LG cs.FL

    Learning Environment Models with Continuous Stochastic Dynamics

    Authors: Martin Tappler, Edi Muškardin, Bernhard K. Aichernig, Bettina Könighofer

    Abstract: Solving control tasks in complex environments automatically through learning offers great potential. While contemporary techniques from deep reinforcement learning (DRL) provide effective solutions, their decision-making is not transparent. We aim to provide insights into the decisions faced by the agent by learning an automaton model of environmental behavior under the control of an agent. Howeve… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  7. arXiv:2305.05731  [pdf, other

    cs.LO cs.CY cs.PL

    'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

    Authors: Samuel Judson, Matthew Elacqua, Filip Cano, Timos Antonopoulos, Bettina Könighofer, Scott J. Shapiro, Ruzica Piskac

    Abstract: Principled accountability in the aftermath of harms is essential to the trustworthy design and governance of algorithmic decision making. Legal theory offers a paramount method for assessing culpability: putting the agent 'on the stand' to subject their actions and intentions to cross-examination. We show that under minimal assumptions automated reasoning can rigorously interrogate algorithmic beh… ▽ More

    Submitted 29 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

  8. arXiv:2212.01861  [pdf, other

    cs.LG cs.LO

    Online Shielding for Reinforcement Learning

    Authors: Bettina Könighofer, Julian Rudolf, Alexander Palmisano, Martin Tappler, Roderick Bloem

    Abstract: Besides the recent impressive results on reinforcement learning (RL), safety is still one of the major research challenges in RL. RL is a machine-learning approach to determine near-optimal policies in Markov decision processes (MDPs). In this paper, we consider the setting where the safety-relevant fragment of the MDP together with a temporal logic safety specification is given and many safety vi… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.09539

  9. arXiv:2212.01838  [pdf, other

    cs.LG cs.LO

    Automata Learning meets Shielding

    Authors: Martin Tappler, Stefan Pranger, Bettina Könighofer, Edi Muškardin, Roderick Bloem, Kim Larsen

    Abstract: Safety is still one of the major research challenges in reinforcement learning (RL). In this paper, we address the problem of how to avoid safety violations of RL agents during exploration in probabilistic and partially unknown environments. Our approach combines automata learning for Markov Decision Processes (MDPs) and shield synthesis in an iterative approach. Initially, the MDP representing th… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  10. arXiv:2208.14426  [pdf, ps, other

    cs.AI cs.LO

    Correct-by-Construction Runtime Enforcement in AI -- A Survey

    Authors: Bettina Könighofer, Roderick Bloem, Rüdiger Ehlers, Christian Pek

    Abstract: Runtime enforcement refers to the theories, techniques, and tools for enforcing correct behavior with respect to a formal specification of systems at runtime. In this paper, we are interested in techniques for constructing runtime enforcers for the concrete application domain of enforcing safety in AI. We discuss how safety is traditionally handled in the field of AI and how more formal guarantees… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

  11. arXiv:2205.04887  [pdf, other

    cs.LG cs.AI cs.SE

    Search-Based Testing of Reinforcement Learning

    Authors: Martin Tappler, Filip Cano Córdoba, Bernhard K. Aichernig, Bettina Könighofer

    Abstract: Evaluation of deep reinforcement learning (RL) is inherently challenging. Especially the opaqueness of learned policies and the stochastic nature of both agents and environments make testing the behavior of deep RL agents difficult. We present a search-based testing framework that enables a wide range of novel analysis capabilities for evaluating the safety and performance of deep RL agents. For s… ▽ More

    Submitted 14 May, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: 11 pages, 15 figures, Accepted at IJCAI-ECAI 2022 (Main Track)

  12. arXiv:2105.12588  [pdf, other

    cs.LO

    TEMPEST -- Synthesis Tool for Reactive Systems and Shields in Probabilistic Environments

    Authors: Stefan Pranger, Bettina Könighofer, Lukas Posch, Roderick Bloem

    Abstract: We present Tempest, a synthesis tool to automatically create correct-by-construction reactive systems and shields from qualitative or quantitative specifications in probabilistic environments. A shield is a special type of reactive system used for run-time enforcement; i.e., a shield enforces a given qualitative or quantitative specification of a running system while interfering with its operation… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  13. arXiv:2012.09539  [pdf, other

    cs.LO

    Online Shielding for Stochastic Systems

    Authors: Bettina Könighofer, Julian Rudolf, Alexander Palmisano, Martin Tappler, Roderick Bloem

    Abstract: In this paper, we propose a method to develop trustworthy reinforcement learning systems. To ensure safety especially during exploration, we automatically synthesize a correct-by-construction runtime enforcer, called a shield, that blocks all actions that are unsafe with respect to a temporal logic specification from the agent. Our main contribution is a new synthesis algorithm for computing the s… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: 18 Pages, 6 Figures, under submission

  14. arXiv:2010.03842  [pdf, other

    cs.LO

    Adaptive Shielding under Uncertainty

    Authors: Stefan Pranger, Bettina Könighofer, Martin Tappler, Martin Deixelberger, Nils Jansen, Roderick Bloem

    Abstract: This paper targets control problems that exhibit specific safety and performance requirements. In particular, the aim is to ensure that an agent, operating under uncertainty, will at runtime strictly adhere to such requirements. Previous works create so-called shields that correct an existing controller for the agent if it is about to take unbearable safety risks. However, so far, shields do not c… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: 8 pages, 6 figures, 1 table

  15. arXiv:2006.16688  [pdf, other

    cs.LO cs.LG

    It's Time to Play Safe: Shield Synthesis for Timed Systems

    Authors: Roderick Bloem, Peter Gjøl Jensen, Bettina Könighofer, Kim Guldstrand Larsen, Florian Lorber, Alexander Palmisano

    Abstract: Erroneous behaviour in safety critical real-time systems may inflict serious consequences. In this paper, we show how to synthesize timed shields from timed safety properties given as timed automata. A timed shield enforces the safety of a running system while interfering with the system as little as possible. We present timed post-shields and timed pre-shields. A timed pre-shield is placed before… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: Submitted to RV2020

  16. Synthesis of Admissible Shields

    Authors: Laura Humphrey, Bettina Könighofer, Robert Könighofer, Ufuk Topcu

    Abstract: Shield synthesis is an approach to enforce a set of safety-critical properties of a reactive system at runtime. A shield monitors the system and corrects any erroneous output values instantaneously. The shield deviates from the given outputs as little as it can and recovers to hand back control to the system as soon as possible. This paper takes its inspiration from a case study on mission plannin… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Journal ref: Hardware and Software: Verification and Testing - 12th International Haifa Verification Conference, {HVC} 2016, Haifa, Israel, November 14-17, 2016, Proceedings

  17. arXiv:1807.06096  [pdf, other

    cs.AI

    Safe Reinforcement Learning via Probabilistic Shields

    Authors: Nils Jansen, Bettina Könighofer, Sebastian Junges, Alexandru C. Serban, Roderick Bloem

    Abstract: This paper targets the efficient construction of a safety shield for decision making in scenarios that incorporate uncertainty. Markov decision processes (MDPs) are prominent models to capture such planning problems. Reinforcement learning (RL) is a machine learning technique to determine near-optimal policies in MDPs that may be unknown prior to exploring the model. However, during exploration, R… ▽ More

    Submitted 25 November, 2019; v1 submitted 16 July, 2018; originally announced July 2018.

  18. arXiv:1708.08611  [pdf, other

    cs.LO cs.AI cs.LG

    Safe Reinforcement Learning via Shielding

    Authors: Mohammed Alshiekh, Roderick Bloem, Ruediger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu

    Abstract: Reinforcement learning algorithms discover policies that maximize reward, but do not necessarily guarantee safety during learning or execution phases. We introduce a new approach to learn optimal policies while enforcing properties expressed in temporal logic. To this end, given the temporal logic specification that is to be obeyed by the learning system, we propose to synthesize a reactive system… ▽ More

    Submitted 3 September, 2017; v1 submitted 29 August, 2017; originally announced August 2017.

  19. arXiv:1501.02573  [pdf, ps, other

    cs.LO

    Shield Synthesis: Runtime Enforcement for Reactive Systems

    Authors: Roderick Bloem, Bettina Koenighofer, Robert Koenighofer, Chao Wang

    Abstract: Scalability issues may prevent users from verifying critical properties of a complex hardware design. In this situation, we propose to synthesize a "safety shield" that is attached to the design to enforce the properties at run time. Shield synthesis can succeed where model checking and reactive synthesis fail, because it only considers a small set of critical properties, as opposed to the complex… ▽ More

    Submitted 16 January, 2015; v1 submitted 12 January, 2015; originally announced January 2015.

    Comments: This is an extended version of [5], featuring an additional appendix

  20. arXiv:1308.4767  [pdf, other

    cs.LO

    Synthesizing Multiple Boolean Functions using Interpolation on a Single Proof

    Authors: Georg Hofferek, Ashutosh Gupta, Bettina Könighofer, Jie-Hong Roland Jiang, Roderick Bloem

    Abstract: It is often difficult to correctly implement a Boolean controller for a complex system, especially when concurrency is involved. Yet, it may be easy to formally specify a controller. For instance, for a pipelined processor it suffices to state that the visible behavior of the pipelined system should be identical to a non-pipelined reference system (Burch-Dill paradigm). We present a novel procedur… ▽ More

    Submitted 22 August, 2013; originally announced August 2013.

    Comments: This paper originally appeared in FMCAD 2013, http://www.cs.utexas.edu/users/hunt/FMCAD/FMCAD13/index.shtml. This version includes an appendix that is missing in the conference version

  21. Synthesizing Robust Systems with RATSY

    Authors: Roderick Bloem, Hans-Jürgen Gamauf, Georg Hofferek, Bettina Könighofer, Robert Könighofer

    Abstract: Specifications for reactive systems often consist of environment assumptions and system guarantees. An implementation should not only be correct, but also robust in the sense that it behaves reasonably even when the assumptions are (temporarily) violated. We present an extension of the requirements analysis and synthesis tool RATSY that is able to synthesize robust systems from GR(1) specification… ▽ More

    Submitted 3 July, 2012; originally announced July 2012.

    Comments: In Proceedings SYNT 2012, arXiv:1207.0554

    Journal ref: EPTCS 84, 2012, pp. 47-53