Beyond Convexity -- Contraction and Global Convergence of Gradient Descent

Wensing, Patrick M.; Slotine, Jean-Jacques E.

doi:10.1371/journal.pone.0236661

Mathematics > Optimization and Control

arXiv:1806.06655 (math)

[Submitted on 18 Jun 2018 (v1), last revised 22 Dec 2022 (this version, v7)]

Title:Beyond Convexity -- Contraction and Global Convergence of Gradient Descent

Authors:Patrick M. Wensing, Jean-Jacques E. Slotine

View PDF

Abstract:This paper considers the analysis of continuous time gradient-based optimization algorithms through the lens of nonlinear contraction theory. It demonstrates that in the case of a time-invariant objective, most elementary results on gradient descent based on convexity can be replaced by much more general results based on contraction. In particular, gradient descent converges to a unique equilibrium if its dynamics are contracting in any metric, with convexity of the cost corresponding to the special case of contraction in the identity metric. More broadly, contraction analysis provides new insights for the case of geodesically-convex optimization, wherein non-convex problems in Euclidean space can be transformed to convex ones posed over a Riemannian manifold. In this case, natural gradient descent converges to a unique equilibrium if it is contracting in any metric, with geodesic convexity of the cost corresponding to contraction in the natural metric. New results using semi-contraction provide additional insights into the topology of the set of optimizers in the case when multiple optima exist. Furthermore, they show how semi-contraction may be combined with specific additional information to reach broad conclusions about a dynamical system. The contraction perspective also easily extends to time-varying optimization settings and allows one to recursively build large optimization structures out of simpler elements. Extensions to natural primal-dual optimization and game-theoretic contexts further illustrate the potential reach of these new perspectives.

Comments:	author typesetting of extended final version (expanded appendix)
Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:1806.06655 [math.OC]
	(or arXiv:1806.06655v7 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1806.06655
Related DOI:	https://doi.org/10.1371/journal.pone.0236661

Submission history

From: Patrick Wensing [view email]
[v1] Mon, 18 Jun 2018 13:34:51 UTC (74 KB)
[v2] Mon, 29 Oct 2018 01:55:21 UTC (82 KB)
[v3] Tue, 10 Mar 2020 20:37:31 UTC (103 KB)
[v4] Sun, 17 May 2020 18:45:47 UTC (50 KB)
[v5] Sat, 30 May 2020 22:04:50 UTC (353 KB)
[v6] Tue, 11 Aug 2020 14:54:56 UTC (344 KB)
[v7] Thu, 22 Dec 2022 02:19:39 UTC (590 KB)

Mathematics > Optimization and Control

Title:Beyond Convexity -- Contraction and Global Convergence of Gradient Descent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Beyond Convexity -- Contraction and Global Convergence of Gradient Descent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators