Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies

Li, Oscar; Harrison, James; Sohl-Dickstein, Jascha; Smith, Virginia; Metz, Luke

Computer Science > Neural and Evolutionary Computing

arXiv:2304.12180 (cs)

[Submitted on 21 Apr 2023 (v1), last revised 9 Dec 2023 (this version, v2)]

Title:Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies

Authors:Oscar Li, James Harrison, Jascha Sohl-Dickstein, Virginia Smith, Luke Metz

View PDF HTML (experimental)

Abstract:Unrolled computation graphs are prevalent throughout machine learning but present challenges to automatic differentiation (AD) gradient estimation methods when their loss functions exhibit extreme local sensitivtiy, discontinuity, or blackbox characteristics. In such scenarios, online evolution strategies methods are a more capable alternative, while being more parallelizable than vanilla evolution strategies (ES) by interleaving partial unrolls and gradient updates. In this work, we propose a general class of unbiased online evolution strategies methods. We analytically and empirically characterize the variance of this class of gradient estimators and identify the one with the least variance, which we term Noise-Reuse Evolution Strategies (NRES). Experimentally, we show NRES results in faster convergence than existing AD and ES methods in terms of wall-clock time and number of unroll steps across a variety of applications, including learning dynamical systems, meta-training learned optimizers, and reinforcement learning.

Comments:	NeurIPS 2023. 41 pages. Code available at this https URL
Subjects:	Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2304.12180 [cs.NE]
	(or arXiv:2304.12180v2 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.2304.12180

Submission history

From: Oscar Li [view email]
[v1] Fri, 21 Apr 2023 17:53:05 UTC (3,355 KB)
[v2] Sat, 9 Dec 2023 22:20:16 UTC (2,763 KB)

Computer Science > Neural and Evolutionary Computing

Title:Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators