Linguistic dependencies and statistical dependence

Hoover, Jacob Louis; Sordoni, Alessandro; Du, Wenyu; O'Donnell, Timothy J.

Computer Science > Computation and Language

arXiv:2104.08685v1 (cs)

[Submitted on 18 Apr 2021 (this version), latest version 29 Apr 2022 (v3)]

Title:Linguistic dependencies and statistical dependence

Authors:Jacob Louis Hoover, Alessandro Sordoni, Wenyu Du, Timothy J. O'Donnell

View PDF

Abstract:What is the relationship between linguistic dependencies and statistical dependence? Building on earlier work in NLP and cognitive science, we study this question. We introduce a contextualized version of pointwise mutual information (CPMI), using pretrained language models to estimate probabilities of words in context. Extracting dependency trees which maximize CPMI, we compare the resulting structures against gold dependencies. Overall, we find that these maximum-CPMI trees correspond to linguistic dependencies more often than trees extracted from non-contextual PMI estimate, but only roughly as often as a simple baseline formed by connecting adjacent words. We also provide evidence that the extent to which the two kinds of dependency align cannot be explained by the distance between words or by the category of the dependency relation. Finally, our analysis sheds some light on the differences between large pretrained language models, specifically in the kinds of inductive biases they encode.

Comments:	8 pages, plus references and appendices
Subjects:	Computation and Language (cs.CL); Information Theory (cs.IT)
Cite as:	arXiv:2104.08685 [cs.CL]
	(or arXiv:2104.08685v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2104.08685

Submission history

From: Jacob Louis Hoover [view email]
[v1] Sun, 18 Apr 2021 02:43:37 UTC (3,070 KB)
[v2] Fri, 10 Sep 2021 14:32:15 UTC (1,323 KB)
[v3] Fri, 29 Apr 2022 16:00:27 UTC (1,302 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-04

Change to browse by:

cs
cs.IT
math
math.IT

References & Citations

DBLP - CS Bibliography

listing | bibtex

Alessandro Sordoni
Wenyu Du
Timothy J. O'Donnell

export BibTeX citation

Computer Science > Computation and Language

Title:Linguistic dependencies and statistical dependence

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Linguistic dependencies and statistical dependence

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators