Neural Machine Translation with Recurrent Highway Networks

Parmar, Maulik; Devi, V. Susheela

doi:10.1007/978-3-030-05918-7_27

Computer Science > Computation and Language

arXiv:1905.01996 (cs)

[Submitted on 28 Apr 2019]

Title:Neural Machine Translation with Recurrent Highway Networks

Authors:Maulik Parmar, V.Susheela Devi

View PDF

Abstract:Recurrent Neural Networks have lately gained a lot of popularity in language modelling tasks, especially in neural machine translation(NMT). Very recent NMT models are based on Encoder-Decoder, where a deep LSTM based encoder is used to project the source sentence to a fixed dimensional vector and then another deep LSTM decodes the target sentence from the vector. However there has been very little work on exploring architectures that have more than one layer in space(i.e. in each time step). This paper examines the effectiveness of the simple Recurrent Highway Networks(RHN) in NMT tasks. The model uses Recurrent Highway Neural Network in encoder and decoder, with attention .We also explore the reconstructor model to improve adequacy. We demonstrate the effectiveness of all three approaches on the IWSLT English-Vietnamese dataset. We see that RHN performs on par with LSTM based models and even better in some this http URL see that deep RHN models are easy to train compared to deep LSTM based models because of highway connections. The paper also investigates the effects of increasing recurrent depth in each time step.

Comments:	International Conference on Mining Intelligence and Knowledge Exploration
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1905.01996 [cs.CL]
	(or arXiv:1905.01996v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1905.01996
Journal reference:	In: Groza A., Prasath R. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2018. Lecture Notes in Computer Science, vol 11308. Springer, Cham
Related DOI:	https://doi.org/10.1007/978-3-030-05918-7_27

Submission history

From: Maulik Parmar [view email]
[v1] Sun, 28 Apr 2019 08:27:55 UTC (561 KB)

Computer Science > Computation and Language

Title:Neural Machine Translation with Recurrent Highway Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Neural Machine Translation with Recurrent Highway Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators