einspace: Searching for Neural Architectures from Fundamental Operations

Ericsson, Linus; Espinosa, Miguel; Yang, Chenhongyi; Antoniou, Antreas; Storkey, Amos; Cohen, Shay B.; McDonagh, Steven; Crowley, Elliot J.

Computer Science > Machine Learning

arXiv:2405.20838 (cs)

[Submitted on 31 May 2024 (v1), last revised 30 Oct 2024 (this version, v2)]

Title:einspace: Searching for Neural Architectures from Fundamental Operations

Authors:Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley

View PDF HTML (experimental)

Abstract:Neural architecture search (NAS) finds high performing networks for a given task. Yet the results of NAS are fairly prosaic; they did not e.g. create a shift from convolutional structures to transformers. This is not least because the search spaces in NAS often aren't diverse enough to include such transformations a priori. Instead, for NAS to provide greater potential for fundamental design shifts, we need a novel expressive search space design which is built from more fundamental operations. To this end, we introduce einspace, a search space based on a parameterised probabilistic context-free grammar. Our space is versatile, supporting architectures of various sizes and complexities, while also containing diverse network operations which allow it to model convolutions, attention components and more. It contains many existing competitive architectures, and provides flexibility for discovering new ones. Using this search space, we perform experiments to find novel architectures as well as improvements on existing ones on the diverse Unseen NAS datasets. We show that competitive architectures can be obtained by searching from scratch, and we consistently find large improvements when initialising the search with strong baselines. We believe that this work is an important advancement towards a transformative NAS paradigm where search space expressivity and strategic search initialisation play key roles.

Comments:	NeurIPS 2024. Project page at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2405.20838 [cs.LG]
	(or arXiv:2405.20838v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.20838

Submission history

From: Linus Ericsson [view email]
[v1] Fri, 31 May 2024 14:25:45 UTC (2,004 KB)
[v2] Wed, 30 Oct 2024 12:35:56 UTC (2,023 KB)

Computer Science > Machine Learning

Title:einspace: Searching for Neural Architectures from Fundamental Operations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:einspace: Searching for Neural Architectures from Fundamental Operations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators