Showing 1–1 of 1 results for author: Yedetore, A

Search v0.5.6 released 2020-02-24

arXiv:2301.11462 [pdf, other]

cs.CL

How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech

Authors: Aditya Yedetore, Tal Linzen, Robert Frank, R. Thomas McCoy

Abstract: When acquiring syntax, children consistently choose hierarchical rules over competing non-hierarchical possibilities. Is this preference due to a learning bias for hierarchical structure, or due to more general biases that interact with hierarchical cues in children's linguistic input? We explore these possibilities by training LSTMs and Transformers - two types of neural networks without a hierar… ▽ More When acquiring syntax, children consistently choose hierarchical rules over competing non-hierarchical possibilities. Is this preference due to a learning bias for hierarchical structure, or due to more general biases that interact with hierarchical cues in children's linguistic input? We explore these possibilities by training LSTMs and Transformers - two types of neural networks without a hierarchical bias - on data similar in quantity and content to children's linguistic input: text from the CHILDES corpus. We then evaluate what these models have learned about English yes/no questions, a phenomenon for which hierarchical structure is crucial. We find that, though they perform well at capturing the surface statistics of child-directed speech (as measured by perplexity), both model types generalize in a way more consistent with an incorrect linear rule than the correct hierarchical rule. These results suggest that human-like generalization from text alone requires stronger biases than the general sequence-processing biases of standard neural network architectures. △ Less

Submitted 6 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

Comments: 10 pages plus references and appendices; accepted to ACL

ACM Class: J.4; I.2.7

Search v0.5.6 released 2020-02-24