AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis

Song, Zichen; Wu, Yuxin; Huang, Sitan; Kang, Zhongfeng

Computer Science > Computation and Language

arXiv:2411.02117 (cs)

[Submitted on 4 Nov 2024]

Title:AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis

Authors:Zichen Song, Yuxin Wu, Sitan Huang, Zhongfeng Kang

View PDF HTML (experimental)

Abstract:The evaluation of layer importance in deep learning has been an active area of research, with significant implications for model optimization and interpretability. Recently, large language models (LLMs) have gained prominence across various domains, yet limited studies have explored the functional importance and performance contributions of individual layers within LLMs, especially from the perspective of activation distribution. In this work, we propose the Activation Variance-Sparsity Score (AVSS), a novel metric combining normalized activation variance and sparsity to assess each layer's contribution to model performance. By identifying and removing approximately the lowest 25% of layers based on AVSS, we achieve over 90% of original model performance across tasks such as question answering, language modeling, and sentiment classification, indicating that these layers may be non-essential. Our approach provides a systematic method for identifying less critical layers, contributing to efficient large language model architectures.

Comments:	4 pages, 1 figure
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2411.02117 [cs.CL]
	(or arXiv:2411.02117v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2411.02117

Submission history

From: Zichen Song [view email]
[v1] Mon, 4 Nov 2024 14:29:49 UTC (1,179 KB)

Computer Science > Computation and Language

Title:AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators