Skip to main content

Showing 1–1 of 1 results for author: Kämmerer, M M

.
  1. arXiv:1911.04817  [pdf, other

    cs.LG stat.ML

    On Policy Gradients

    Authors: Mattis Manfred Kämmerer

    Abstract: The goal of policy gradient approaches is to find a policy in a given class of policies which maximizes the expected return. Given a differentiable model of the policy, we want to apply a gradient-ascent technique to reach a local optimum. We mainly use gradient ascent, because it is theoretically well researched. The main issue is that the policy gradient with respect to the expected return is no… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.