Comparing the Parameter Complexity of Hypernetworks and the Embedding-Based Alternative

Galanti, Tomer; Wolf, Lior

Computer Science > Machine Learning

arXiv:2002.10006v1 (cs)

[Submitted on 23 Feb 2020 (this version), latest version 2 Nov 2020 (v2)]

Title:Comparing the Parameter Complexity of Hypernetworks and the Embedding-Based Alternative

Authors:Tomer Galanti, Lior Wolf

View PDF

Abstract:In the context of learning to map an input $I$ to a function $h_I:\mathcal{X}\to \mathbb{R}$, we compare two alternative methods: (i) an embedding-based method, which learns a fixed function in which $I$ is encoded as a conditioning signal $e(I)$ and the learned function takes the form $h_I(x) = q(x,e(I))$, and (ii) hypernetworks, in which the weights $\theta_I$ of the function $h_I(x) = g(x;\theta_I)$ are given by a hypernetwork $f$ as $\theta_I=f(I)$.
We extend the theory of~\cite{devore} and provide a lower bound on the complexity of neural networks as function approximators, i.e., the number of trainable parameters. This extension, eliminates the requirements for the approximation method to be robust. Our results are then used to compare the complexities of $q$ and $g$, showing that under certain conditions and when letting the functions $e$ and $f$ be as large as we wish, $g$ can be smaller than $q$ by orders of magnitude. In addition, we show that for typical assumptions on the function to be approximated, the overall number of trainable parameters in a hypernetwork is smaller by orders of magnitude than the number of trainable parameters of a standard neural network and an embedding method.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2002.10006 [cs.LG]
	(or arXiv:2002.10006v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.10006

Submission history

From: Tomer Galanti [view email]
[v1] Sun, 23 Feb 2020 22:51:52 UTC (565 KB)
[v2] Mon, 2 Nov 2020 12:22:00 UTC (1,205 KB)

Computer Science > Machine Learning

Title:Comparing the Parameter Complexity of Hypernetworks and the Embedding-Based Alternative

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Comparing the Parameter Complexity of Hypernetworks and the Embedding-Based Alternative

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators