-
arXiv:1708.03871 [pdf, ps, other]
A Game-Theoretic Analysis of the Off-Switch Game
Abstract: The off-switch game is a game theoretic model of a highly intelligent robot interacting with a human. In the original paper by Hadfield-Menell et al. (2016), the analysis is not fully game-theoretic as the human is modelled as an irrational player, and the robot's best action is only calculated under unrealistic normality and soft-max assumptions. In this paper, we make the analysis fully game the… ▽ More
Submitted 13 August, 2017; originally announced August 2017.
Journal ref: Artificial General Intelligence: 10th International Conference, AGI 2017, Melbourne, VIC, Australia, August 15-18, 2017, Proceedings, pages 167-177