A convergence analysis of the perturbed compositional gradient flow: averaging principle and normal deviations

Hu, Wenqing; Li, Chris Junchi

doi:10.3934/dcds.2018216

Mathematics > Probability

arXiv:1709.00515 (math)

[Submitted on 2 Sep 2017 (v1), last revised 25 Jul 2018 (this version, v3)]

Title:A convergence analysis of the perturbed compositional gradient flow: averaging principle and normal deviations

Authors:Wenqing Hu, Chris Junchi Li

View PDF

Abstract:We consider in this work a system of two stochastic differential equations named the perturbed compositional gradient flow. By introducing a separation of fast and slow scales of the two equations, we show that the limit of the slow motion is given by an averaged ordinary differential equation. We then demonstrate that the deviation of the slow motion from the averaged equation, after proper rescaling, converges to a stochastic process with Gaussian inputs. This indicates that the slow motion can be approximated in the weak sense by a standard perturbed gradient flow or the continuous-time stochastic gradient descent algorithm that solves the optimization problem for a composition of two functions. As an application, the perturbed compositional gradient flow corresponds to the diffusion limit of the Stochastic Composite Gradient Descent (SCGD) algorithm for minimizing a composition of two expected-value functions in the optimization literatures. For the strongly convex case, such an analysis implies that the SCGD algorithm has the same convergence time asymptotic as the classical stochastic gradient descent algorithm. Thus it validates, at the level of continuous approximation, the effectiveness of using the SCGD algorithm in the strongly convex case.

Comments:	Final version to appear at DCDS-A
Subjects:	Probability (math.PR); Machine Learning (stat.ML)
MSC classes:	34C29, 60J60, 62L20, 90C30
Cite as:	arXiv:1709.00515 [math.PR]
	(or arXiv:1709.00515v3 [math.PR] for this version)
	https://doi.org/10.48550/arXiv.1709.00515
Journal reference:	Discrete and Continuous Dynamical Systems, Series A, Vol 38, Issue 10, October 2018
Related DOI:	https://doi.org/10.3934/dcds.2018216

Submission history

From: Wenqing Hu [view email]
[v1] Sat, 2 Sep 2017 01:26:13 UTC (23 KB)
[v2] Wed, 27 Sep 2017 21:58:38 UTC (24 KB)
[v3] Wed, 25 Jul 2018 03:05:15 UTC (41 KB)

Mathematics > Probability

Title:A convergence analysis of the perturbed compositional gradient flow: averaging principle and normal deviations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Probability

Title:A convergence analysis of the perturbed compositional gradient flow: averaging principle and normal deviations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators