Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning

Xu, Jingjing; Sun, Xu; Li, Sujian; Cai, Xiaoyan; Wei, Bingzhen

Computer Science > Computation and Language

arXiv:1711.01427 (cs)

[Submitted on 4 Nov 2017]

Title:Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning

Authors:Jingjing Xu, Xu Sun, Sujian Li, Xiaoyan Cai, Bingzhen Wei

View PDF

Abstract:In recent years, neural networks have proven to be effective in Chinese word segmentation. However, this promising performance relies on large-scale training data. Neural networks with conventional architectures cannot achieve the desired results in low-resource datasets due to the lack of labelled training data. In this paper, we propose a deep stacking framework to improve the performance on word segmentation tasks with insufficient data by integrating datasets from diverse domains. Our framework consists of two parts, domain-based models and deep stacking networks. The domain-based models are used to learn knowledge from different datasets. The deep stacking networks are designed to integrate domain-based models. To reduce model conflicts, we innovatively add communication paths among models and design various structures of deep stacking networks, including Gaussian-based Stacking Networks, Concatenate-based Stacking Networks, Sequence-based Stacking Networks and Tree-based Stacking Networks. We conduct experiments on six low-resource datasets from various domains. Our proposed framework shows significant performance improvements on all datasets compared with several strong baselines.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1711.01427 [cs.CL]
	(or arXiv:1711.01427v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1711.01427

Submission history

From: Jingjing Xu [view email]
[v1] Sat, 4 Nov 2017 12:24:26 UTC (2,277 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jingjing Xu
Xu Sun
Sujian Li
Xiaoyan Cai
Bingzhen Wei

export BibTeX citation

Computer Science > Computation and Language

Title:Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators