Skip to main content

Showing 1–1 of 1 results for author: Baldazo, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:1710.10363  [pdf, other

    cs.LG cs.MA math.OC stat.ML

    Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning

    Authors: Sergio Valcarcel Macua, Aleksi Tukiainen, Daniel García-Ocaña Hernández, David Baldazo, Enrique Munoz de Cote, Santiago Zazo

    Abstract: We propose a fully distributed actor-critic algorithm approximated by deep neural networks, named \textit{Diff-DAC}, with application to single-task and to average multitask reinforcement learning (MRL). Each agent has access to data from its local task only, but it aims to learn a policy that performs well on average for the whole set of tasks. During the learning process, agents communicate thei… ▽ More

    Submitted 25 October, 2020; v1 submitted 27 October, 2017; originally announced October 2017.

    Journal ref: Presented at Adaptive Learning Agents workshop (ALA2018), July 14th, 2018, Stockholm, Sweden