Skip to main content

Showing 1–1 of 1 results for author: Samaraweeraa, M

.
  1. arXiv:2412.02056  [pdf, other

    cs.CL

    A Multi-way Parallel Named Entity Annotated Corpus for English, Tamil and Sinhala

    Authors: Surangika Ranathunga, Asanka Ranasinghea, Janaka Shamala, Ayodya Dandeniyaa, Rashmi Galappaththia, Malithi Samaraweeraa

    Abstract: This paper presents a multi-way parallel English-Tamil-Sinhala corpus annotated with Named Entities (NEs), where Sinhala and Tamil are low-resource languages. Using pre-trained multilingual Language Models (mLMs), we establish new benchmark Named Entity Recognition (NER) results on this dataset for Sinhala and Tamil. We also carry out a detailed investigation on the NER capabilities of different t… ▽ More

    Submitted 14 January, 2025; v1 submitted 2 December, 2024; originally announced December 2024.