Multi-Step Model-Agnostic Meta-Learning: Convergence and Improved Algorithms

Ji, Kaiyi; Yang, Junjie; Liang, Yingbin

Computer Science > Machine Learning

arXiv:2002.07836v2 (cs)

[Submitted on 18 Feb 2020 (v1), revised 20 Feb 2020 (this version, v2), latest version 13 Jul 2020 (v3)]

Title:Multi-Step Model-Agnostic Meta-Learning: Convergence and Improved Algorithms

Authors:Kaiyi Ji, Junjie Yang, Yingbin Liang

View PDF

Abstract:As a popular meta-learning approach, the model-agnostic meta-learning (MAML) algorithm has been widely used due to its simplicity and effectiveness. However, the convergence of the general multi-step MAML still remains unexplored. In this paper, we develop a new theoretical framework, under which we characterize the convergence rate and the computational complexity of multi-step MAML. Our results indicate that $N$-step MAML attains the convergence with linearly increasing complexity with $N$ under a properly chosen inner stepsize. We then take a further step to develop a more efficient Hessian-free MAML. We first show that the existing zeroth-order Hessian estimator contains a constant-level estimation error so that the MAML algorithm can perform unstably. To address this issue, we propose a novel Hessian estimator via a gradient-based Gaussian smoothing method, and show that it achieves a much smaller estimation bias and variance, and the resulting algorithm achieves the same performance guarantee as the original MAML under mild conditions. Our experiments validate our theory and demonstrate the effectiveness of the proposed Hessian estimator.

Comments:	67 pages, 8 figures
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2002.07836 [cs.LG]
	(or arXiv:2002.07836v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.07836

Submission history

From: Kaiyi Ji [view email]
[v1] Tue, 18 Feb 2020 19:17:54 UTC (136 KB)
[v2] Thu, 20 Feb 2020 22:11:20 UTC (146 KB)
[v3] Mon, 13 Jul 2020 04:03:09 UTC (47 KB)

Computer Science > Machine Learning

Title:Multi-Step Model-Agnostic Meta-Learning: Convergence and Improved Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multi-Step Model-Agnostic Meta-Learning: Convergence and Improved Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators