Skip to main content

Showing 1–4 of 4 results for author: Spielberg, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2201.05034  [pdf, other

    cs.LG cs.AI

    Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning

    Authors: Yitzhak Spielberg, Amos Azaria

    Abstract: In the context of reinforcement learning we introduce the concept of criticality of a state, which indicates the extent to which the choice of action in that particular state influences the expected return. That is, a state in which the choice of action is more likely to influence the final outcome is considered as more critical than a state in which it is less likely to influence the final outcom… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:1810.07254

    Journal ref: International Journal on Artificial Intelligence Tools, vol. 30, 2021

  2. arXiv:2201.04633  [pdf, other

    cs.HC cs.AI

    Revelation of Task Difficulty in AI-aided Education

    Authors: Yitzhak Spielberg, Amos Azaria

    Abstract: When a student is asked to perform a given task, her subjective estimate of the difficulty of that task has a strong influence on her performance. There exists a rich literature on the impact of perceived task difficulty on performance and motivation. Yet, there is another topic that is closely related to the subject of the influence of perceived task difficulty that did not receive any attention… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

  3. arXiv:2201.04632  [pdf, other

    cs.HC cs.AI

    The Concept of Criticality in AI Safety

    Authors: Yitzhak Spielberg, Amos Azaria

    Abstract: When AI agents don't align their actions with human values they may cause serious harm. One way to solve the value alignment problem is by including a human operator who monitors all of the agent's actions. Despite the fact, that this solution guarantees maximal safety, it is very inefficient, since it requires the human operator to dedicate all of his attention to the agent. In this paper, we pro… ▽ More

    Submitted 12 June, 2023; v1 submitted 12 January, 2022; originally announced January 2022.

  4. arXiv:1810.07254  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    The Concept of Criticality in Reinforcement Learning

    Authors: Yitzhak Spielberg, Amos Azaria

    Abstract: Reinforcement learning methods carry a well known bias-variance trade-off in n-step algorithms for optimal control. Unfortunately, this has rarely been addressed in current research. This trade-off principle holds independent of the choice of the algorithm, such as n-step SARSA, n-step Expected SARSA or n-step Tree backup. A small n results in a large bias, while a large n leads to large variance.… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.