-
Improving the quality of Gujarati-Hindi Machine Translation through part-of-speech tagging and stemmer-assisted transliteration
Abstract: Machine Translation for Indian languages is an emerging research area. Transliteration is one such module that we design while designing a translation system. Transliteration means mapping of source language text into the target language. Simple mapping decreases the efficiency of overall translation system. We propose the use of stemming and part-of-speech tagging for transliteration. The effecti… ▽ More
Submitted 11 July, 2013; originally announced July 2013.
Comments: 6 pages; June 2013, url-http://airccse.org/journal/ijnlc/papers/2313ijnlc05.pdf
-
A Lightweight Stemmer for Gujarati
Abstract: Gujarati is a resource poor language with almost no language processing tools being available. In this paper we have shown an implementation of a rule based stemmer of Gujarati. We have shown the creation of rules for stemming and the richness in morphology that Gujarati possesses. We have also evaluated our results by verifying it with a human expert.
Submitted 11 November, 2012; v1 submitted 19 October, 2012; originally announced October 2012.
Comments: In Proceedings of 46th Annual Convention of Computer Society of India