An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition

Aras, Gizem; Makaroglu, Didem; Demir, Seniz; Cakir, Altan

Computer Science > Computation and Language

arXiv:2005.07692 (cs)

[Submitted on 14 May 2020 (v1), last revised 18 May 2020 (this version, v2)]

Title:An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition

Authors:Gizem Aras, Didem Makaroglu, Seniz Demir, Altan Cakir

View PDF

Abstract:Named entity recognition (NER) is an extensively studied task that extracts and classifies named entities in a text. NER is crucial not only in downstream language processing applications such as relation extraction and question answering but also in large scale big data operations such as real-time analysis of online digital media content. Recent research efforts on Turkish, a less studied language with morphologically rich nature, have demonstrated the effectiveness of neural architectures on well-formed texts and yielded state-of-the art results by formulating the task as a sequence tagging problem. In this work, we empirically investigate the use of recent neural architectures (Bidirectional long short-term memory and Transformer-based networks) proposed for Turkish NER tagging in the same setting. Our results demonstrate that transformer-based networks which can model long-range context overcome the limitations of BiLSTM networks where different input features at the character, subword, and word levels are utilized. We also propose a transformer-based network with a conditional random field (CRF) layer that leads to the state-of-the-art result (95.95\% f-measure) on a common dataset. Our study contributes to the literature that quantifies the impact of transfer learning on processing morphologically rich languages.

Comments:	Submitted to Expert Systems with Applications
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Report number:	ITUAI08
Cite as:	arXiv:2005.07692 [cs.CL]
	(or arXiv:2005.07692v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2005.07692

Submission history

From: Altan Cakir [view email]
[v1] Thu, 14 May 2020 06:54:07 UTC (531 KB)
[v2] Mon, 18 May 2020 05:53:16 UTC (531 KB)

Computer Science > Computation and Language

Title:An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators