FORML: A Riemannian Hessian-free Method for Meta-learning on Stiefel Manifolds

Tabealhojeh, Hadi; Roy, Soumava Kumar; Adibi, Peyman; Karshenas, Hossein

Computer Science > Machine Learning

arXiv:2402.18605 (cs)

[Submitted on 28 Feb 2024 (v1), last revised 31 May 2024 (this version, v2)]

Title:FORML: A Riemannian Hessian-free Method for Meta-learning on Stiefel Manifolds

Authors:Hadi Tabealhojeh, Soumava Kumar Roy, Peyman Adibi, Hossein Karshenas

View PDF HTML (experimental)

Abstract:Meta-learning problem is usually formulated as a bi-level optimization in which the task-specific and the meta-parameters are updated in the inner and outer loops of optimization, respectively. However, performing the optimization in the Riemannian space, where the parameters and meta-parameters are located on Riemannian manifolds is computationally intensive. Unlike the Euclidean methods, the Riemannian backpropagation needs computing the second-order derivatives that include backward computations through the Riemannian operators such as retraction and orthogonal projection. This paper introduces a Hessian-free approach that uses a first-order approximation of derivatives on the Stiefel manifold. Our method significantly reduces the computational load and memory footprint. We show how using a Stiefel fully-connected layer that enforces orthogonality constraint on the parameters of the last classification layer as the head of the backbone network, strengthens the representation reuse of the gradient-based meta-learning methods. Our experimental results across various few-shot learning datasets, demonstrate the superiority of our proposed method compared to the state-of-the-art methods, especially MAML, its Euclidean counterpart.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2402.18605 [cs.LG]
	(or arXiv:2402.18605v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.18605

Submission history

From: Hadi Tabealhojeh [view email]
[v1] Wed, 28 Feb 2024 10:57:30 UTC (1,533 KB)
[v2] Fri, 31 May 2024 21:34:33 UTC (1,771 KB)

Computer Science > Machine Learning

Title:FORML: A Riemannian Hessian-free Method for Meta-learning on Stiefel Manifolds

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:FORML: A Riemannian Hessian-free Method for Meta-learning on Stiefel Manifolds

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators