SEGA: Variance Reduction via Gradient Sketching

Hanzely, Filip; Mishchenko, Konstantin; Richtarik, Peter

Mathematics > Optimization and Control

arXiv:1809.03054 (math)

[Submitted on 9 Sep 2018 (v1), last revised 18 Oct 2018 (this version, v2)]

Title:SEGA: Variance Reduction via Gradient Sketching

Authors:Filip Hanzely, Konstantin Mishchenko, Peter Richtarik

View PDF

Abstract:We propose a randomized first order optimization method--SEGA (SkEtched GrAdient method)-- which progressively throughout its iterations builds a variance-reduced estimate of the gradient from random linear measurements (sketches) of the gradient obtained from an oracle. In each iteration, SEGA updates the current estimate of the gradient through a sketch-and-project operation using the information provided by the latest sketch, and this is subsequently used to compute an unbiased estimate of the true gradient through a random relaxation procedure. This unbiased estimate is then used to perform a gradient step. Unlike standard subspace descent methods, such as coordinate descent, SEGA can be used for optimization problems with a non-separable proximal term. We provide a general convergence analysis and prove linear convergence for strongly convex objectives. In the special case of coordinate sketches, SEGA can be enhanced with various techniques such as importance sampling, minibatching and acceleration, and its rate is up to a small constant factor identical to the best-known rate of coordinate descent.

Comments:	Accepted to the NIPS conference
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG)
Cite as:	arXiv:1809.03054 [math.OC]
	(or arXiv:1809.03054v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1809.03054

Submission history

From: Filip Hanzely [view email]
[v1] Sun, 9 Sep 2018 22:40:52 UTC (4,907 KB)
[v2] Thu, 18 Oct 2018 07:51:59 UTC (4,457 KB)

Mathematics > Optimization and Control

Title:SEGA: Variance Reduction via Gradient Sketching

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:SEGA: Variance Reduction via Gradient Sketching

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators