History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

Ji, Kaiyi; Wang, Zhe; Weng, Bowen; Zhou, Yi; Zhang, Wei; Liang, Yingbin

Mathematics > Optimization and Control

arXiv:1910.09670 (math)

[Submitted on 21 Oct 2019 (v1), last revised 27 Jul 2020 (this version, v4)]

Title:History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

Authors:Kaiyi Ji, Zhe Wang, Bowen Weng, Yi Zhou, Wei Zhang, Yingbin Liang

View PDF

Abstract:Variance-reduced algorithms, although achieve great theoretical performance, can run slowly in practice due to the periodic gradient estimation with a large batch of data. Batch-size adaptation thus arises as a promising approach to accelerate such algorithms. However, existing schemes either apply prescribed batch-size adaption rule or exploit the information along optimization path via additional backtracking and condition verification steps. In this paper, we propose a novel scheme, which eliminates backtracking line search but still exploits the information along optimization path by adapting the batch size via history stochastic gradients. We further theoretically show that such a scheme substantially reduces the overall complexity for popular variance-reduced algorithms SVRG and SARAH/SPIDER for both conventional nonconvex optimization and reinforcement learning problems. To this end, we develop a new convergence analysis framework to handle the dependence of the batch size on history stochastic gradients. Extensive experiments validate the effectiveness of the proposed batch-size adaptation scheme.

Comments:	46 pages, 23 figures; Published in ICML 2020
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1910.09670 [math.OC]
	(or arXiv:1910.09670v4 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1910.09670

Submission history

From: Kaiyi Ji [view email]
[v1] Mon, 21 Oct 2019 21:58:07 UTC (189 KB)
[v2] Thu, 13 Feb 2020 21:41:27 UTC (1,322 KB)
[v3] Fri, 28 Feb 2020 18:53:30 UTC (1,322 KB)
[v4] Mon, 27 Jul 2020 03:12:54 UTC (11,564 KB)

Mathematics > Optimization and Control

Title:History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators