Skip to main content

Showing 1–1 of 1 results for author: Soprani, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2005.12864  [pdf, ps, other

    cs.LG stat.ML

    Time-Variant Variational Transfer for Value Functions

    Authors: Giuseppe Canonaco, Andrea Soprani, Manuel Roveri, Marcello Restelli

    Abstract: In most of the transfer learning approaches to reinforcement learning (RL) the distribution over the tasks is assumed to be stationary. Therefore, the target and source tasks are i.i.d. samples of the same distribution. In the context of this work, we consider the problem of transferring value functions through a variational method when the distribution that generates the tasks is time-variant, pr… ▽ More

    Submitted 18 June, 2020; v1 submitted 26 May, 2020; originally announced May 2020.