Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction

Jiang, Wei; Yang, Sifan; Yang, Wenhao; Zhang, Lijun

Computer Science > Machine Learning

arXiv:2406.00489 (cs)

[Submitted on 1 Jun 2024 (v1), last revised 13 Dec 2024 (this version, v3)]

Title:Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction

Authors:Wei Jiang, Sifan Yang, Wenhao Yang, Lijun Zhang

View PDF HTML (experimental)

Abstract:Sign stochastic gradient descent (signSGD) is a communication-efficient method that transmits only the sign of stochastic gradients for parameter updating. Existing literature has demonstrated that signSGD can achieve a convergence rate of $\mathcal{O}(d^{1/2}T^{-1/4})$, where $d$ represents the dimension and $T$ is the iteration number. In this paper, we improve this convergence rate to $\mathcal{O}(d^{1/2}T^{-1/3})$ by introducing the Sign-based Stochastic Variance Reduction (SSVR) method, which employs variance reduction estimators to track gradients and leverages their signs to update. For finite-sum problems, our method can be further enhanced to achieve a convergence rate of $\mathcal{O}(m^{1/4}d^{1/2}T^{-1/2})$, where $m$ denotes the number of component functions. Furthermore, we investigate the heterogeneous majority vote in distributed settings and introduce two novel algorithms that attain improved convergence rates of $\mathcal{O}(d^{1/2}T^{-1/2} + dn^{-1/2})$ and $\mathcal{O}(d^{1/4}T^{-1/4})$ respectively, outperforming the previous results of $\mathcal{O}(dT^{-1/4} + dn^{-1/2})$ and $\mathcal{O}(d^{3/8}T^{-1/8})$, where $n$ represents the number of nodes. Numerical experiments across different tasks validate the effectiveness of our proposed methods.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2406.00489 [cs.LG]
	(or arXiv:2406.00489v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.00489

Submission history

From: Wei Jiang [view email]
[v1] Sat, 1 Jun 2024 16:38:43 UTC (743 KB)
[v2] Wed, 23 Oct 2024 14:42:35 UTC (1,669 KB)
[v3] Fri, 13 Dec 2024 12:11:01 UTC (1,669 KB)

Computer Science > Machine Learning

Title:Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators