Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax

Zaitova, Iuliia; Hirak, Vitalii; Abdullah, Badr M.; Klakow, Dietrich; Möbius, Bernd; Avgustinova, Tania

Computer Science > Computation and Language

arXiv:2505.06062 (cs)

[Submitted on 9 May 2025]

Title:Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax

Authors:Iuliia Zaitova, Vitalii Hirak, Badr M. Abdullah, Dietrich Klakow, Bernd Möbius, Tania Avgustinova

View PDF HTML (experimental)

Abstract:This study analyzes the attention patterns of fine-tuned encoder-only models based on the BERT architecture (BERT-based models) towards two distinct types of Multiword Expressions (MWEs): idioms and microsyntactic units (MSUs). Idioms present challenges in semantic non-compositionality, whereas MSUs demonstrate unconventional syntactic behavior that does not conform to standard grammatical categorizations. We aim to understand whether fine-tuning BERT-based models on specific tasks influences their attention to MWEs, and how this attention differs between semantic and syntactic tasks. We examine attention scores to MWEs in both pre-trained and fine-tuned BERT-based models. We utilize monolingual models and datasets in six Indo-European languages - English, German, Dutch, Polish, Russian, and Ukrainian. Our results show that fine-tuning significantly influences how models allocate attention to MWEs. Specifically, models fine-tuned on semantic tasks tend to distribute attention to idiomatic expressions more evenly across layers. Models fine-tuned on syntactic tasks show an increase in attention to MSUs in the lower layers, corresponding with syntactic processing requirements.

Comments:	10 pages, 3 figures. Findings 2025
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2505.06062 [cs.CL]
	(or arXiv:2505.06062v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2505.06062
Journal reference:	In Findings of the Association for Computational Linguistics: NAACL 2025, pages 4083â€“4092, Albuquerque, New Mexico https://aclanthology.org/2025.findings-naacl.228/

Submission history

From: Iuliia Zaitova [view email]
[v1] Fri, 9 May 2025 13:57:56 UTC (8,882 KB)

Computer Science > Computation and Language

Title:Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators