Skip to main content

Showing 1–17 of 17 results for author: Koehler, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.08322  [pdf, other

    cs.LG cs.AI

    Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs

    Authors: Hector Kohler, Quentin Delfosse, Waris Radji, Riad Akrour, Philippe Preux

    Abstract: There exist applications of reinforcement learning like medicine where policies need to be ''interpretable'' by humans. User studies have shown that some policy classes might be more interpretable than others. However, it is costly to conduct human studies of policy interpretability. Furthermore, there is no clear definition of policy interpretabiliy, i.e., no clear metrics for interpretability an… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 12 pages of main text, under review

  2. arXiv:2501.17379  [pdf

    cs.DS cs.DB

    Stable Tree Labelling for Accelerating Distance Queries on Dynamic Road Networks

    Authors: Henning Koehler, Muhammad Farhan, Qing Wang

    Abstract: Finding the shortest-path distance between two arbitrary vertices is an important problem in road networks. Due to real-time traffic conditions, road networks undergo dynamic changes all the time. Current state-of-the-art methods incrementally maintain a distance labelling based on a hierarchy among vertices to support efficient distance computation. However, their labelling sizes are often large… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  3. arXiv:2405.14956  [pdf, other

    cs.AI cs.LG

    Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

    Authors: Hector Kohler, Quentin Delfosse, Riad Akrour, Kristian Kersting, Philippe Preux

    Abstract: Deep reinforcement learning agents are prone to goal misalignments. The black-box nature of their policies hinders the detection and correction of such misalignments, and the trust necessary for real-world deployment. So far, solutions learning interpretable policies are inefficient or require many human priors. We propose INTERPRETER, a fast distillation method producing INTerpretable Editable tR… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. PID Tuning using Cross-Entropy Deep Learning: a Lyapunov Stability Analysis

    Authors: Hector Kohler, Benoit Clement, Thomas Chaffre, Gilles Le Chenadec

    Abstract: Underwater Unmanned Vehicles (UUVs) have to constantly compensate for the external disturbing forces acting on their body. Adaptive Control theory is commonly used there to grant the control law some flexibility in its response to process variation. Today, learning-based (LB) adaptive methods are leading the field where model-based control structures are combined with deep model-free learning algo… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Journal ref: IFAC-PapersOnLine, Volume 55, Issue 31, 2022

  5. arXiv:2404.10906  [pdf, other

    cs.AI cs.HC cs.LG cs.SC

    Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop

    Authors: Hector Kohler, Quentin Delfosse, Paul Festor, Philippe Preux

    Abstract: Embracing the pursuit of intrinsically explainable reinforcement learning raises crucial questions: what distinguishes explainability from interpretability? Should explainable and interpretable agents be developed outside of domains where transparency is imperative? What advantages do interpretable policies offer over neural networks? How can we rigorously define and measure interpretability in po… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  6. arXiv:2311.11063  [pdf, other

    cs.DS cs.DB

    Hierarchical Cut Labelling -- Scaling Up Distance Queries on Road Networks

    Authors: Muhammad Farhan, Henning Koehler, Robert Ohms, Qing Wang

    Abstract: Answering the shortest-path distance between two arbitrary locations is a fundamental problem in road networks. Labelling-based solutions are the current state-of-the-arts to render fast response time, which can generally be categorised into hub-based labellings, highway-based labellings, and tree decomposition labellings. Hub-based and highway-based labellings exploit hierarchical structures of r… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  7. arXiv:2309.13365  [pdf, other

    cs.LG cs.AI

    Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs

    Authors: Hector Kohler, Riad Akrour, Philippe Preux

    Abstract: Interpretability of AI models allows for user safety checks to build trust in such AIs. In particular, Decision Trees (DTs) provide a global look at the learned model and transparently reveal which features of the input are critical for making a decision. However, interpretability is hindered if the DT is too large. To learn compact trees, a recent Reinforcement Learning (RL) framework has been pr… ▽ More

    Submitted 21 January, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: To be included in an other submission. arXiv admin note: text overlap with arXiv:2304.05839

  8. Breiman meets Bellman: Non-Greedy Decision Trees with MDPs

    Authors: Hector Kohler, Riad Akrour, Philippe Preux

    Abstract: In supervised learning, decision trees are valued for their interpretability and performance. While greedy decision tree algorithms like CART remain widely used due to their computational efficiency, they often produce sub-optimal solutions with respect to a regularized training loss. Conversely, optimal decision tree methods can find better solutions but are computationally intensive and typicall… ▽ More

    Submitted 1 June, 2025; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: ACM SIGKDD 2025, 12 pages

  9. arXiv:2306.10882  [pdf, other

    cs.LG stat.ML

    AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents

    Authors: Timothée Mathieu, Riccardo Della Vecchia, Alena Shilova, Matheus Medeiros Centa, Hector Kohler, Odalric-Ambrym Maillard, Philippe Preux

    Abstract: Recently, the scientific community has questioned the statistical reproducibility of many empirical results, especially in the field of machine learning. To contribute to the resolution of this reproducibility crisis, we propose a theoretically sound methodology for comparing the performance of a set of algorithms. We exemplify our methodology in Deep Reinforcement Learning (Deep RL). The performa… ▽ More

    Submitted 12 December, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Journal ref: TMLR 2024

  10. arXiv:2305.09385  [pdf, other

    stat.ML cs.LG

    Lp- and Risk Consistency of Localized SVMs

    Authors: Hannes Köhler

    Abstract: Kernel-based regularized risk minimizers, also called support vector machines (SVMs), are known to possess many desirable properties but suffer from their super-linear computational requirements when dealing with large data sets. This problem can be tackled by using localized SVMs instead, which also offer the additional advantage of being able to apply different hyperparameters to different regio… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 29 pages, 1 figure

  11. arXiv:2304.05839  [pdf, other

    cs.LG cs.AI

    Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning

    Authors: Hector Kohler, Riad Akrour, Philippe Preux

    Abstract: Interpretability of AI models allows for user safety checks to build trust in these models. In particular, decision trees (DTs) provide a global view on the learned model and clearly outlines the role of the features that are critical to classify a given data. However, interpretability is hindered if the DT is too large. To learn compact trees, a Reinforcement Learning (RL) framework has been rece… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  12. arXiv:2303.15210  [pdf, other

    stat.ML cs.LG

    On the Connection between $L_p$ and Risk Consistency and its Implications on Regularized Kernel Methods

    Authors: Hannes Köhler

    Abstract: As a predictor's quality is often assessed by means of its risk, it is natural to regard risk consistency as a desirable property of learning methods, and many such methods have indeed been shown to be risk consistent. The first aim of this paper is to establish the close connection between risk consistency and $L_p$-consistency for a considerably wider class of loss functions than has been done b… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: 33 pages, 1 figure

  13. arXiv:2204.11012  [pdf, other

    cs.DB cs.DS

    BatchHL: Answering Distance Queries on Batch-Dynamic Networks at Scale

    Authors: Muhammad Farhan, Qing Wang, Henning Koehler

    Abstract: Many real-world applications operate on dynamic graphs that undergo rapid changes in their topological structure over time. However, it is challenging to design dynamic algorithms that are capable of supporting such graph changes efficiently. To circumvent the challenge, we propose a batch-dynamic framework for answering distance queries, which combines offline labelling and online searching to le… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

  14. Query-by-Sketch: Scaling Shortest Path Graph Queries on Very Large Networks

    Authors: Ye Wang, Qing Wang, Henning Koehler, Yu Lin

    Abstract: Computing shortest paths is a fundamental operation in processing graph data. In many real-world applications, discovering shortest paths between two vertices empowers us to make full use of the underlying structure to understand how vertices are related in a graph, e.g. the strength of social ties between individuals in a social network. In this paper, we study the shortest-path-graph problem tha… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  15. arXiv:2101.12678  [pdf, other

    stat.ML cs.LG

    Total Stability of SVMs and Localized SVMs

    Authors: Hannes Köhler, Andreas Christmann

    Abstract: Regularized kernel-based methods such as support vector machines (SVMs) typically depend on the underlying probability measure $\mathrm{P}$ (respectively an empirical measure $\mathrm{D}_n$ in applications) as well as on the regularization parameter $λ$ and the kernel $k$. Whereas classical statistical robustness only considers the effect of small perturbations in $\mathrm{P}$, the present paper i… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: 30 pages, 1 figure

  16. arXiv:1904.03558  [pdf, ps, other

    cs.DM math.CO

    A characterization of maximal 2-dimensional subgraphs of transitive graphs

    Authors: Henning Koehler

    Abstract: A transitive graph is 2-dimensional if it can be represented as the intersection of two linear orders. Such representations make answering of reachability queries trivial, and allow many problems that are NP-hard on arbitrary graphs to be solved in polynomial time. One may therefore be interested in finding 2-dimensional graphs that closely approximate a given graph of arbitrary order dimension.… ▽ More

    Submitted 6 April, 2019; originally announced April 2019.

    Comments: 10 pages, submitted to Discrete Mathematics and Theoretical Computer Science

    MSC Class: 05C85 ACM Class: G.2.2

  17. arXiv:1710.04333  [pdf, ps, other

    cs.DM math.CO

    Modular decomposition of transitive graphs and transitively orienting their complements

    Authors: Henning Koehler

    Abstract: The modular decomposition of a graph is a canonical representation of its modules. Algorithms for computing the modular decomposition of directed and undirected graphs differ significantly, with the undirected case being simpler, and algorithms for directed graphs often work by reducing the problem to decomposing undirected graphs. In this paper we show that transitive acyclic digraphs have the sa… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 12 pages, submitted to Discrete Mathematics and Theoretical Computer Science

    MSC Class: 05C85 ACM Class: G.2.2