Language Model Pre-Training with Sparse Latent Typing

Ren, Liliang; Zhang, Zixuan; Wang, Han; Voss, Clare R.; Zhai, Chengxiang; Ji, Heng

Computer Science > Computation and Language

arXiv:2210.12582 (cs)

[Submitted on 23 Oct 2022 (v1), last revised 26 Oct 2022 (this version, v2)]

Title:Language Model Pre-Training with Sparse Latent Typing

Authors:Liliang Ren, Zixuan Zhang, Han Wang, Clare R. Voss, Chengxiang Zhai, Heng Ji

View PDF

Abstract:Modern large-scale Pre-trained Language Models (PLMs) have achieved tremendous success on a wide range of downstream tasks. However, most of the LM pre-training objectives only focus on text reconstruction, but have not sought to learn latent-level interpretable representations of sentences. In this paper, we manage to push the language models to obtain a deeper understanding of sentences by proposing a new pre-training objective, Sparse Latent Typing, which enables the model to sparsely extract sentence-level keywords with diverse latent types. Experimental results show that our model is able to learn interpretable latent type categories in a self-supervised manner without using any external knowledge. Besides, the language model pre-trained with such an objective also significantly improves Information Extraction related downstream tasks in both supervised and few-shot settings. Our code is publicly available at: this https URL.

Comments:	EMNLP 2022 (Oral)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.12582 [cs.CL]
	(or arXiv:2210.12582v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.12582

Submission history

From: Liliang Ren [view email]
[v1] Sun, 23 Oct 2022 00:37:08 UTC (1,144 KB)
[v2] Wed, 26 Oct 2022 22:41:30 UTC (1,144 KB)

Computer Science > Computation and Language

Title:Language Model Pre-Training with Sparse Latent Typing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Language Model Pre-Training with Sparse Latent Typing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators