Meta-in-context learning in large language models

Coda-Forno, Julian; Binz, Marcel; Akata, Zeynep; Botvinick, Matthew; Wang, Jane X.; Schulz, Eric

Computer Science > Computation and Language

arXiv:2305.12907 (cs)

[Submitted on 22 May 2023]

Title:Meta-in-context learning in large language models

Authors:Julian Coda-Forno, Marcel Binz, Zeynep Akata, Matthew Botvinick, Jane X. Wang, Eric Schulz

View PDF

Abstract:Large language models have shown tremendous performance in a variety of tasks. In-context learning -- the ability to improve at a task after being provided with a number of demonstrations -- is seen as one of the main contributors to their success. In the present paper, we demonstrate that the in-context learning abilities of large language models can be recursively improved via in-context learning itself. We coin this phenomenon meta-in-context learning. Looking at two idealized domains, a one-dimensional regression task and a two-armed bandit task, we show that meta-in-context learning adaptively reshapes a large language model's priors over expected tasks. Furthermore, we find that meta-in-context learning modifies the in-context learning strategies of such models. Finally, we extend our approach to a benchmark of real-world regression problems where we observe competitive performance to traditional learning algorithms. Taken together, our work improves our understanding of in-context learning and paves the way toward adapting large language models to the environment they are applied purely through meta-in-context learning rather than traditional finetuning.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2305.12907 [cs.CL]
	(or arXiv:2305.12907v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.12907

Submission history

From: Julian Coda-Forno [view email]
[v1] Mon, 22 May 2023 10:40:36 UTC (6,582 KB)

Computer Science > Computation and Language

Title:Meta-in-context learning in large language models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Meta-in-context learning in large language models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators