Competition Dynamics Shape Algorithmic Phases of In-Context Learning

Park, Core Francisco; Lubana, Ekdeep Singh; Pres, Itamar; Tanaka, Hidenori

Computer Science > Machine Learning

arXiv:2412.01003 (cs)

[Submitted on 1 Dec 2024 (v1), last revised 2 May 2025 (this version, v4)]

Title:Competition Dynamics Shape Algorithmic Phases of In-Context Learning

Authors:Core Francisco Park, Ekdeep Singh Lubana, Itamar Pres, Hidenori Tanaka

View PDF HTML (experimental)

Abstract:In-Context Learning (ICL) has significantly expanded the general-purpose nature of large language models, allowing them to adapt to novel tasks using merely the inputted context. This has motivated a series of papers that analyze tractable synthetic domains and postulate precise mechanisms that may underlie ICL. However, the use of relatively distinct setups that often lack a sequence modeling nature to them makes it unclear how general the reported insights from such studies are. Motivated by this, we propose a synthetic sequence modeling task that involves learning to simulate a finite mixture of Markov chains. As we show, models trained on this task reproduce most well-known results on ICL, hence offering a unified setting for studying the concept. Building on this setup, we demonstrate we can explain a model's behavior by decomposing it into four broad algorithms that combine a fuzzy retrieval vs. inference approach with either unigram or bigram statistics of the context. These algorithms engage in a competition dynamics to dominate model behavior, with the precise experimental conditions dictating which algorithm ends up superseding others: e.g., we find merely varying context size or amount of training yields (at times sharp) transitions between which algorithm dictates the model behavior, revealing a mechanism that explains the transient nature of ICL. In this sense, we argue ICL is best thought of as a mixture of different algorithms, each with its own peculiarities, instead of a monolithic capability. This also implies that making general claims about ICL that hold universally across all settings may be infeasible.

Comments:	ICLR 2025 Spotlight
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2412.01003 [cs.LG]
	(or arXiv:2412.01003v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.01003
Journal reference:	International Conference on Learning Representations, 2025

Submission history

From: Core Francisco Park [view email]
[v1] Sun, 1 Dec 2024 23:35:53 UTC (27,227 KB)
[v2] Fri, 20 Dec 2024 07:53:56 UTC (27,405 KB)
[v3] Sat, 28 Dec 2024 21:01:43 UTC (33,328 KB)
[v4] Fri, 2 May 2025 05:25:53 UTC (33,329 KB)

Computer Science > Machine Learning

Title:Competition Dynamics Shape Algorithmic Phases of In-Context Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Competition Dynamics Shape Algorithmic Phases of In-Context Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators