Benchmarking Neural Network Generalization for Grammar Induction

Lan, Nur; Chemla, Emmanuel; Katzir, Roni

Computer Science > Computation and Language

arXiv:2308.08253 (cs)

[Submitted on 16 Aug 2023 (v1), last revised 25 Aug 2023 (this version, v2)]

Title:Benchmarking Neural Network Generalization for Grammar Induction

Authors:Nur Lan, Emmanuel Chemla, Roni Katzir

View PDF

Abstract:How well do neural networks generalize? Even for grammar induction tasks, where the target generalization is fully known, previous works have left the question open, testing very limited ranges beyond the training set and using different success criteria. We provide a measure of neural network generalization based on fully specified formal languages. Given a model and a formal grammar, the method assigns a generalization score representing how well a model generalizes to unseen samples in inverse relation to the amount of data it was trained on. The benchmark includes languages such as $a^nb^n$, $a^nb^nc^n$, $a^nb^mc^{n+m}$, and Dyck-1 and 2. We evaluate selected architectures using the benchmark and find that networks trained with a Minimum Description Length objective (MDL) generalize better and using less data than networks trained using standard loss functions. The benchmark is available at this https URL.

Comments:	10 pages, 4 figures, 2 tables. Conference: Learning with Small Data 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2308.08253 [cs.CL]
	(or arXiv:2308.08253v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2308.08253

Submission history

From: Nur Lan [view email]
[v1] Wed, 16 Aug 2023 09:45:06 UTC (302 KB)
[v2] Fri, 25 Aug 2023 13:40:31 UTC (302 KB)

Computer Science > Computation and Language

Title:Benchmarking Neural Network Generalization for Grammar Induction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Benchmarking Neural Network Generalization for Grammar Induction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators