A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Hashimoto, Kazuma; Xiong, Caiming; Tsuruoka, Yoshimasa; Socher, Richard

Computer Science > Computation and Language

arXiv:1611.01587 (cs)

[Submitted on 5 Nov 2016 (v1), last revised 24 Jul 2017 (this version, v5)]

Title:A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Authors:Kazuma Hashimoto, Caiming Xiong, Yoshimasa Tsuruoka, Richard Socher

View PDF

Abstract:Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a single model. We introduce a joint many-task model together with a strategy for successively growing its depth to solve increasingly complex tasks. Higher layers include shortcut connections to lower-level task predictions to reflect linguistic hierarchies. We use a simple regularization term to allow for optimizing all model weights to improve one task's loss without exhibiting catastrophic interference of the other tasks. Our single end-to-end model obtains state-of-the-art or competitive results on five different tasks from tagging, parsing, relatedness, and entailment tasks.

Comments:	Accepted as a full paper at the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1611.01587 [cs.CL]
	(or arXiv:1611.01587v5 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1611.01587

Submission history

From: Kazuma Hashimoto [view email]
[v1] Sat, 5 Nov 2016 01:59:29 UTC (729 KB)
[v2] Fri, 11 Nov 2016 01:16:07 UTC (731 KB)
[v3] Sat, 19 Nov 2016 00:20:12 UTC (731 KB)
[v4] Sun, 16 Apr 2017 22:38:21 UTC (73 KB)
[v5] Mon, 24 Jul 2017 14:41:16 UTC (79 KB)

Computer Science > Computation and Language

Title:A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators