A Study of Continual Learning Under Language Shift

Gogoulou, Evangelia; Lesort, Timothée; Boman, Magnus; Nivre, Joakim

Computer Science > Computation and Language

arXiv:2311.01200v1 (cs)

[Submitted on 2 Nov 2023 (this version), latest version 27 Jun 2024 (v4)]

Title:A Study of Continual Learning Under Language Shift

Authors:Evangelia Gogoulou, Timothée Lesort, Magnus Boman, Joakim Nivre

View PDF

Abstract:The recent increase in data and model scale for language model pre-training has led to huge training costs. In scenarios where new data become available over time, updating a model instead of fully retraining it would therefore provide significant gains. In this paper, we study the benefits and downsides of updating a language model when new data comes from new languages - the case of continual learning under language shift. Starting from a monolingual English language model, we incrementally add data from Norwegian and Icelandic to investigate how forward and backward transfer effects depend on the pre-training order and characteristics of languages, for different model sizes and learning rate schedulers. Our results show that, while forward transfer is largely positive and independent of language order, backward transfer can be either positive or negative depending on the order and characteristics of new languages. To explain these patterns we explore several language similarity metrics and find that syntactic similarity appears to have the best correlation with our results.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2311.01200 [cs.CL]
	(or arXiv:2311.01200v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.01200

Submission history

From: Evangelia Gogoulou [view email]
[v1] Thu, 2 Nov 2023 12:54:50 UTC (383 KB)
[v2] Wed, 21 Feb 2024 13:21:45 UTC (728 KB)
[v3] Mon, 26 Feb 2024 08:20:03 UTC (728 KB)
[v4] Thu, 27 Jun 2024 08:35:53 UTC (622 KB)

Computer Science > Computation and Language

Title:A Study of Continual Learning Under Language Shift

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Study of Continual Learning Under Language Shift

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators