Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning

Rytting, Christopher Michael; Wingate, David

Computer Science > Computation and Language

arXiv:2110.02370 (cs)

[Submitted on 5 Oct 2021]

Title:Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning

Authors:Christopher Michael Rytting, David Wingate

View PDF

Abstract:Large natural language models (such as GPT-3 or T5) demonstrate impressive abilities across a range of general NLP tasks. Here, we show that the knowledge embedded in such models provides a useful inductive bias, not just on traditional NLP tasks, but also in the nontraditional task of training a symbolic reasoning engine. We observe that these engines learn quickly and generalize in a natural way that reflects human intuition. For example, training such a system to model block-stacking might naturally generalize to stacking other types of objects because of structure in the real world that has been partially captured by the language describing it. We study several abstract textual reasoning tasks, such as object manipulation and navigation, and demonstrate multiple types of generalization to novel scenarios and the symbols that comprise them. We also demonstrate the surprising utility of \textit{compositional learning}, where a learner dedicated to mastering a complicated task gains an advantage by training on relevant simpler tasks instead of jumping straight to the complicated task.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2110.02370 [cs.CL]
	(or arXiv:2110.02370v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2110.02370

Submission history

From: Christopher Michael Rytting [view email]
[v1] Tue, 5 Oct 2021 21:40:46 UTC (6,281 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-10

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

David Wingate

export BibTeX citation

Computer Science > Computation and Language

Title:Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators