Subword models struggle with word learning, but surprisal hides it

Bunzeck, Bastian; Zarrieß, Sina

Computer Science > Computation and Language

arXiv:2502.12835 (cs)

[Submitted on 18 Feb 2025 (v1), last revised 2 Jun 2025 (this version, v2)]

Title:Subword models struggle with word learning, but surprisal hides it

Authors:Bastian Bunzeck, Sina Zarrieß

View PDF HTML (experimental)

Abstract:We study word learning in subword and character language models with the psycholinguistic lexical decision task. While subword LMs struggle to discern words and non-words with high accuracy, character LMs solve this task easily and consistently. Only when supplied with further contexts do subword LMs perform similarly to character models. Additionally, when looking at word-level and syntactic learning trajectories, we find that both processes are separable in character LMs. Word learning happens before syntactic learning, whereas both occur simultaneously in subword LMs. This raises questions about the adequacy of subword LMs for modeling language acquisition and positions character LMs as a viable alternative to study processes below the syntactic level.

Comments:	Accepted to ACL 2025 (Main)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.12835 [cs.CL]
	(or arXiv:2502.12835v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.12835

Submission history

From: Bastian Bunzeck [view email]
[v1] Tue, 18 Feb 2025 13:09:16 UTC (2,328 KB)
[v2] Mon, 2 Jun 2025 08:05:04 UTC (1,318 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2025-02

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Subword models struggle with word learning, but surprisal hides it

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Subword models struggle with word learning, but surprisal hides it

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators