Pre-training Language Models with Deterministic Factual Knowledge

Li, Shaobo; Li, Xiaoguang; Shang, Lifeng; Sun, Chengjie; Liu, Bingquan; Ji, Zhenzhou; Jiang, Xin; Liu, Qun

Computer Science > Computation and Language

arXiv:2210.11165 (cs)

[Submitted on 20 Oct 2022]

Title:Pre-training Language Models with Deterministic Factual Knowledge

Authors:Shaobo Li, Xiaoguang Li, Lifeng Shang, Chengjie Sun, Bingquan Liu, Zhenzhou Ji, Xin Jiang, Qun Liu

View PDF

Abstract:Previous works show that Pre-trained Language Models (PLMs) can capture factual knowledge. However, some analyses reveal that PLMs fail to perform it robustly, e.g., being sensitive to the changes of prompts when extracting factual knowledge. To mitigate this issue, we propose to let PLMs learn the deterministic relationship between the remaining context and the masked content. The deterministic relationship ensures that the masked factual content can be deterministically inferable based on the existing clues in the context. That would provide more stable patterns for PLMs to capture factual knowledge than randomly masking. Two pre-training tasks are further introduced to motivate PLMs to rely on the deterministic relationship when filling masks. Specifically, we use an external Knowledge Base (KB) to identify deterministic relationships and continuously pre-train PLMs with the proposed methods. The factual knowledge probing experiments indicate that the continuously pre-trained PLMs achieve better robustness in factual knowledge capturing. Further experiments on question-answering datasets show that trying to learn a deterministic relationship with the proposed methods can also help other knowledge-intensive tasks.

Comments:	Accepted at EMNLP 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.11165 [cs.CL]
	(or arXiv:2210.11165v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.11165

Submission history

From: Shaobo Li [view email]
[v1] Thu, 20 Oct 2022 11:04:09 UTC (1,116 KB)

Computer Science > Computation and Language

Title:Pre-training Language Models with Deterministic Factual Knowledge

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Pre-training Language Models with Deterministic Factual Knowledge

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators