Skip to main content

Showing 1–3 of 3 results for author: Kanav, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.06420  [pdf, other

    cs.AI eess.SY

    Explaining Control Policies through Predicate Decision Diagrams

    Authors: Debraj Chakraborty, Clemens Dubslaff, Sudeep Kanav, Jan Kretinsky, Christoph Weinhuber

    Abstract: Safety-critical controllers of complex systems are hard to construct manually. Automated approaches such as controller synthesis or learning provide a tempting alternative but usually lack explainability. To this end, learning decision trees (DTs) have been prevalently used towards an interpretable model of the generated controllers. However, DTs do not exploit shared decision-making, a key concep… ▽ More

    Submitted 25 March, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

    Comments: Extended version of the HSCC 2025 paper

  2. arXiv:2411.13365  [pdf, other

    cs.AI cs.LG cs.RO eess.SY

    Explainable Finite-Memory Policies for Partially Observable Markov Decision Processes

    Authors: Muqsit Azeem, Debraj Chakraborty, Sudeep Kanav, Jan Kretinsky

    Abstract: Partially Observable Markov Decision Processes (POMDPs) are a fundamental framework for decision-making under uncertainty and partial observability. Since in general optimal policies may require infinite memory, they are hard to implement and often render most problems undecidable. Consequently, finite-memory policies are mostly considered instead. However, the algorithms for computing them are ty… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: Preprint -- Under Review

  3. arXiv:2410.18293  [pdf, other

    cs.AI cs.LG cs.LO eess.SY

    1-2-3-Go! Policy Synthesis for Parameterized Markov Decision Processes via Decision-Tree Learning and Generalization

    Authors: Muqsit Azeem, Debraj Chakraborty, Sudeep Kanav, Jan Kretinsky, Mohammadsadegh Mohagheghi, Stefanie Mohr, Maximilian Weininger

    Abstract: Despite the advances in probabilistic model checking, the scalability of the verification methods remains limited. In particular, the state space often becomes extremely large when instantiating parameterized Markov decision processes (MDPs) even with moderate values. Synthesizing policies for such \emph{huge} MDPs is beyond the reach of available tools. We propose a learning-based approach to obt… ▽ More

    Submitted 1 April, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: Extended version of the paper accepted at VMCAI 2025