Skip to main content

Showing 1–2 of 2 results for author: Smith, T M S

.
  1. arXiv:2505.07797  [pdf, ps, other

    cs.LG

    A Theoretical Framework for Explaining Reinforcement Learning with Shapley Values

    Authors: Daniel Beechey, Thomas M. S. Smith, Özgür Şimşek

    Abstract: Reinforcement learning agents can achieve superhuman performance, but their decisions are often difficult to interpret. This lack of transparency limits deployment, especially in safety-critical settings where human trust and accountability are essential. In this work, we develop a theoretical framework for explaining reinforcement learning through the influence of state features, which represent… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  2. arXiv:2306.05810  [pdf, other

    cs.LG

    Explaining Reinforcement Learning with Shapley Values

    Authors: Daniel Beechey, Thomas M. S. Smith, Özgür Şimşek

    Abstract: For reinforcement learning systems to be widely adopted, their users must understand and trust them. We present a theoretical analysis of explaining reinforcement learning using Shapley values, following a principled approach from game theory for identifying the contribution of individual players to the outcome of a cooperative game. We call this general framework Shapley Values for Explaining Rei… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 12 pages, 9 figures. Accepted at ICML 2023