Skip to main content

Showing 1–1 of 1 results for author: Alshiekh, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1708.08611  [pdf, other

    cs.LO cs.AI cs.LG

    Safe Reinforcement Learning via Shielding

    Authors: Mohammed Alshiekh, Roderick Bloem, Ruediger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu

    Abstract: Reinforcement learning algorithms discover policies that maximize reward, but do not necessarily guarantee safety during learning or execution phases. We introduce a new approach to learn optimal policies while enforcing properties expressed in temporal logic. To this end, given the temporal logic specification that is to be obeyed by the learning system, we propose to synthesize a reactive system… ▽ More

    Submitted 3 September, 2017; v1 submitted 29 August, 2017; originally announced August 2017.