Skip to main content

Showing 1–1 of 1 results for author: Thai, H L

Searching in archive cs. Search in all archives.
.
  1. arXiv:1902.02823  [pdf, other

    cs.LG stat.ML

    Compatible Natural Gradient Policy Search

    Authors: Joni Pajarinen, Hong Linh Thai, Riad Akrour, Jan Peters, Gerhard Neumann

    Abstract: Trust-region methods have yielded state-of-the-art results in policy search. A common approach is to use KL-divergence to bound the region of trust resulting in a natural gradient policy update. We show that the natural gradient and trust region optimization are equivalent if we use the natural parameterization of a standard exponential policy distribution in combination with compatible value func… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.