Accelerating Stochastic Gradient Descent For Least Squares Regression

Jain, Prateek; Kakade, Sham M.; Kidambi, Rahul; Netrapalli, Praneeth; Sidford, Aaron

Statistics > Machine Learning

arXiv:1704.08227 (stat)

[Submitted on 26 Apr 2017 (v1), last revised 31 Jul 2018 (this version, v2)]

Title:Accelerating Stochastic Gradient Descent For Least Squares Regression

Authors:Prateek Jain, Sham M. Kakade, Rahul Kidambi, Praneeth Netrapalli, Aaron Sidford

View PDF

Abstract:There is widespread sentiment that it is not possible to effectively utilize fast gradient methods (e.g. Nesterov's acceleration, conjugate gradient, heavy ball) for the purposes of stochastic optimization due to their instability and error accumulation, a notion made precise in d'Aspremont 2008 and Devolder, Glineur, and Nesterov 2014. This work considers these issues for the special case of stochastic approximation for the least squares regression problem, and our main result refutes the conventional wisdom by showing that acceleration can be made robust to statistical errors. In particular, this work introduces an accelerated stochastic gradient method that provably achieves the minimax optimal statistical risk faster than stochastic gradient descent. Critical to the analysis is a sharp characterization of accelerated stochastic gradient descent as a stochastic process. We hope this characterization gives insights towards the broader question of designing simple and effective accelerated stochastic methods for more general convex and non-convex optimization problems.

Comments:	54 pages, 3 figures, 1 table; updated acknowledgements, minor title change. Paper appeared in the proceedings of the Conference on Learning Theory (COLT), 2018
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC); Statistics Theory (math.ST)
Cite as:	arXiv:1704.08227 [stat.ML]
	(or arXiv:1704.08227v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1704.08227

Submission history

From: Rahul Kidambi [view email]
[v1] Wed, 26 Apr 2017 17:30:27 UTC (427 KB)
[v2] Tue, 31 Jul 2018 18:11:32 UTC (407 KB)

Statistics > Machine Learning

Title:Accelerating Stochastic Gradient Descent For Least Squares Regression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Accelerating Stochastic Gradient Descent For Least Squares Regression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators