MaSS: an Accelerated Stochastic Method for Over-parametrized Learning

Liu, Chaoyue; Belkin, Mikhail

Computer Science > Machine Learning

arXiv:1810.13395v2 (cs)

[Submitted on 31 Oct 2018 (v1), revised 5 Nov 2018 (this version, v2), latest version 27 Sep 2019 (v5)]

Title:MaSS: an Accelerated Stochastic Method for Over-parametrized Learning

Authors:Chaoyue Liu, Mikhail Belkin

View PDF

Abstract:In this paper we introduce MaSS (Momentum-added Stochastic Solver), an accelerated SGD method for optimizing over-parameterized networks. Our method is simple and efficient to implement and does not require changing parameters or computing full gradients in the course of optimization. We provide a detailed theoretical analysis for convergence and parameter selection including their dependence on the mini-batch size in the quadratic case. We also provide theoretical convergence results for a more general convex setting.
We provide an experimental evaluation showing strong performance of our method in comparison to Adam and SGD for several standard architectures of deep networks including ResNet, convolutional and fully connected networks. We also show its performance for convex kernel machines.

Comments:	This is version 2. Find typos in version 1 .Make the corrections
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.13395 [cs.LG]
	(or arXiv:1810.13395v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.13395

Submission history

From: Chaoyue Liu [view email]
[v1] Wed, 31 Oct 2018 16:44:05 UTC (3,202 KB)
[v2] Mon, 5 Nov 2018 21:30:32 UTC (3,202 KB)
[v3] Thu, 14 Feb 2019 16:58:03 UTC (2,862 KB)
[v4] Mon, 18 Feb 2019 18:53:41 UTC (2,862 KB)
[v5] Fri, 27 Sep 2019 17:38:08 UTC (1,447 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-10

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chaoyue Liu
Mikhail Belkin

export BibTeX citation

Computer Science > Machine Learning

Title:MaSS: an Accelerated Stochastic Method for Over-parametrized Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:MaSS: an Accelerated Stochastic Method for Over-parametrized Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators