Skip to main content

Showing 1–1 of 1 results for author: Choset, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:1912.00330  [pdf, other

    cs.LG stat.ML

    Adversary A3C for Robust Reinforcement Learning

    Authors: Zhaoyuan Gu, Zhenzhong Jia, Howie Choset

    Abstract: Asynchronous Advantage Actor Critic (A3C) is an effective Reinforcement Learning (RL) algorithm for a wide range of tasks, such as Atari games and robot control. The agent learns policies and value function through trial-and-error interactions with the environment until converging to an optimal policy. Robustness and stability are critical in RL; however, neural network can be vulnerable to noise… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.