Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms

Galatolo, Federico A.; Cimino, Mario G. C. A.; Vaglini, Gigliola

Computer Science > Machine Learning

arXiv:2004.04120v1 (cs)

[Submitted on 8 Apr 2020 (this version), latest version 1 Oct 2021 (v4)]

Title:Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms

Authors:Federico A. Galatolo, Mario G.C.A. Cimino, Gigliola Vaglini

View PDF

Abstract:In this paper we investigate some of the issues that arise from the scalarization of the multi-objective optimization problem in the Advantage Actor Critic (A2C) reinforcement learning algorithm. We show how a naive scalarization leads to gradients overlapping and we also argue that the entropy regularization term just inject uncontrolled noise into the system. We propose two methods: one to avoid gradient overlapping (NOG) but keeping the same loss formulation; and one to avoid the noise injection (TE) but generating action distributions with a desired entropy. A comprehensive pilot experiment has been carried out showing how using our proposed methods speeds up the training of 210%. We argue how the proposed solutions can be applied to all the Advantage based reinforcement learning algorithms.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2004.04120 [cs.LG]
	(or arXiv:2004.04120v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2004.04120

Submission history

From: Federico Galatolo [view email]
[v1] Wed, 8 Apr 2020 17:03:21 UTC (921 KB)
[v2] Thu, 21 May 2020 14:58:25 UTC (1,716 KB)
[v3] Wed, 7 Jul 2021 14:30:46 UTC (1,445 KB)
[v4] Fri, 1 Oct 2021 14:53:15 UTC (1,464 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-04

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Federico A. Galatolo
Mario G. C. A. Cimino
Gigliola Vaglini

export BibTeX citation

Computer Science > Machine Learning

Title:Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators