A Transformer-based Neural Architecture Search Method

Wang, Shang; Tang, Huanrong; Ouyang, Jianquan

Computer Science > Computation and Language

arXiv:2505.01314 (cs)

[Submitted on 2 May 2025]

Title:A Transformer-based Neural Architecture Search Method

Authors:Shang Wang, Huanrong Tang, Jianquan Ouyang

View PDF HTML (experimental)

Abstract:This paper presents a neural architecture search method based on Transformer architecture, searching cross multihead attention computation ways for different number of encoder and decoder combinations. In order to search for neural network structures with better translation results, we considered perplexity as an auxiliary evaluation metric for the algorithm in addition to BLEU scores and iteratively improved each individual neural network within the population by a multi-objective genetic algorithm. Experimental results show that the neural network structures searched by the algorithm outperform all the baseline models, and that the introduction of the auxiliary evaluation metric can find better models than considering only the BLEU score as an evaluation metric.

Comments:	GECCO 2023
Subjects:	Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2505.01314 [cs.CL]
	(or arXiv:2505.01314v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2505.01314

Submission history

From: Shang Wang [view email]
[v1] Fri, 2 May 2025 14:40:16 UTC (1,522 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2025-05

Change to browse by:

cs
cs.NE

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:A Transformer-based Neural Architecture Search Method

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Transformer-based Neural Architecture Search Method

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators