Skip to main content

Showing 1–23 of 23 results for author: Mathur, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:1507.03223  [pdf

    cs.CL

    Classifier-Based Text Simplification for Improved Machine Translation

    Authors: Shruti Tyagi, Deepti Chopra, Iti Mathur, Nisheeth Joshi

    Abstract: Machine Translation is one of the research fields of Computational Linguistics. The objective of many MT Researchers is to develop an MT System that produce good quality and high accuracy output translations and which also covers maximum language pairs. As internet and Globalization is increasing day by day, we need a way that improves the quality of translation. For this reason, we have developed… ▽ More

    Submitted 12 July, 2015; originally announced July 2015.

    Comments: In Proceedings of International Conference on Advances in Computer Engineering and Applications 2015

  2. arXiv:1407.2694  [pdf

    cs.CL

    Quality Estimation Of Machine Translation Outputs Through Stemming

    Authors: Pooja Gupta, Nisheeth Joshi, Iti Mathur

    Abstract: Machine Translation is the challenging problem for Indian languages. Every day we can see some machine translators being developed, but getting a high quality automatic translation is still a very distant dream . The correct translated sentence for Hindi language is rarely found. In this paper, we are emphasizing on English-Hindi language pair, so in order to preserve the correct MT output we pres… ▽ More

    Submitted 10 July, 2014; originally announced July 2014.

    Journal ref: International Journal on Computational Sciences & Applications (IJCSA) Vol.4, No.3, June 2014

  3. Shiva++: An Enhanced Graph based Ontology Matcher

    Authors: Iti Mathur, Nisheeth Joshi, Hemant Darbari, Ajai Kumar

    Abstract: With the web getting bigger and assimilating knowledge about different concepts and domains, it is becoming very difficult for simple database driven applications to capture the data for a domain. Thus developers have come out with ontology based systems which can store large amount of information and can apply reasoning and produce timely information. Thus facilitating effective knowledge managem… ▽ More

    Submitted 19 April, 2014; originally announced April 2014.

    Comments: arXiv admin note: text overlap with arXiv:1403.7465

    Journal ref: International Journal of Computer Applications 92(16):30-34, April 2014

  4. Shiva: A Framework for Graph Based Ontology Matching

    Authors: Iti Mathur, Nisheeth Joshi, Hemant Darbari, Ajai Kumar

    Abstract: Since long, corporations are looking for knowledge sources which can provide structured description of data and can focus on meaning and shared understanding. Structures which can facilitate open world assumptions and can be flexible enough to incorporate and recognize more than one name for an entity. A source whose major purpose is to facilitate human communication and interoperability. Clearly,… ▽ More

    Submitted 28 March, 2014; originally announced March 2014.

    Journal ref: International Journal of Computer Applications 89(11):30-34, March 2014

  5. arXiv:1312.7223  [pdf

    cs.CL

    Quality Estimation of English-Hindi Outputs using Naive Bayes Classifier

    Authors: Rashmi Gupta, Nisheeth Joshi, Iti Mathur

    Abstract: In this paper we present an approach for estimating the quality of machine translation system. There are various methods for estimating the quality of output sentences, but in this paper we focus on Naïve Bayes classifier to build model using features which are extracted from the input sentences. These features are used for finding the likelihood of each of the sentences of the training data which… ▽ More

    Submitted 27 December, 2013; originally announced December 2013.

    Comments: In Proceedings of 2013 International Conference on Advances in Computing, Communications and Informatics

  6. Automatic Ranking of MT Outputs using Approximations

    Authors: Pooja Gupta, Nisheeth Joshi, Iti Mathur

    Abstract: Since long, research on machine translation has been ongoing. Still, we do not get good translations from MT engines so developed. Manual ranking of these outputs tends to be very time consuming and expensive. Identifying which one is better or worse than the others is a very taxing task. In this paper, we show an approach which can provide automatic ranks to MT outputs (translations) taken from d… ▽ More

    Submitted 22 November, 2013; originally announced November 2013.

    Journal ref: International Journal of Computer Applications 81(17):27-31, November 2013

  7. HEVAL: Yet Another Human Evaluation Metric

    Authors: Nisheeth Joshi, Iti Mathur, Hemant Darbari, Ajai Kumar

    Abstract: Machine translation evaluation is a very important activity in machine translation development. Automatic evaluation metrics proposed in literature are inadequate as they require one or more human reference translations to compare them with output produced by machine translation. This does not always give accurate results as a text can have several different translations. Human evaluation metrics,… ▽ More

    Submitted 15 November, 2013; originally announced November 2013.

    Journal ref: International Journal on Natural Language Computing Vol. 2, No.5, November 2013

  8. arXiv:1310.0581  [pdf

    cs.CL

    Rule Based Stemmer in Urdu

    Authors: Vaishali Gupta, Nisheeth Joshi, Iti Mathur

    Abstract: Urdu is a combination of several languages like Arabic, Hindi, English, Turkish, Sanskrit etc. It has a complex and rich morphology. This is the reason why not much work has been done in Urdu language processing. Stemming is used to convert a word into its respective root form. In stemming, we separate the suffix and prefix from the word. It is useful in search engines, natural language processing… ▽ More

    Submitted 2 October, 2013; originally announced October 2013.

    Comments: In Proceedings of 4th International Conference on Computer and Communication Technology

  9. arXiv:1310.0578  [pdf

    cs.CL

    Subjective and Objective Evaluation of English to Urdu Machine Translation

    Authors: Vaishali Gupta, Nisheeth Joshi, Iti Mathur

    Abstract: Machine translation is research based area where evaluation is very important phenomenon for checking the quality of MT output. The work is based on the evaluation of English to Urdu Machine translation. In this research work we have evaluated the translation quality of Urdu language which has been translated by using different Machine Translation systems like Google, Babylon and Ijunoon. The eval… ▽ More

    Submitted 2 October, 2013; originally announced October 2013.

    Comments: In Proceedings of 2013 International Conference on Advances in Computing, Communications and Informatics

  10. arXiv:1310.0575  [pdf

    cs.CL

    Development of Marathi Part of Speech Tagger Using Statistical Approach

    Authors: Jyoti Singh, Nisheeth Joshi, Iti Mathur

    Abstract: Part-of-speech (POS) tagging is a process of assigning the words in a text corresponding to a particular part of speech. A fundamental version of POS tagging is the identification of words as nouns, verbs, adjectives etc. For processing natural languages, Part of Speech tagging is a prominent tool. It is one of the simplest as well as most constant and statistical model for many NLP applications.… ▽ More

    Submitted 9 October, 2013; v1 submitted 2 October, 2013; originally announced October 2013.

    Comments: In Proceedings of 2013 International Conference on Advances in Computing, Communications and Informatics

  11. arXiv:1310.0573  [pdf

    cs.CL

    Improving the Quality of MT Output using Novel Name Entity Translation Scheme

    Authors: Deepti Bhalla, Nisheeth Joshi, Iti Mathur

    Abstract: This paper presents a novel approach to machine translation by combining the state of art name entity translation scheme. Improper translation of name entities lapse the quality of machine translated output. In this work, name entities are transliterated by using statistical rule based approach. This paper describes the translation and transliteration of name entities from English to Punjabi. We h… ▽ More

    Submitted 2 October, 2013; originally announced October 2013.

    Comments: In Proceedings of 2013 International Conference on Advances in Computing, Communications and Informatics

  12. Analysing Quality of English-Hindi Machine Translation Engine Outputs Using Bayesian Classification

    Authors: Rashmi Gupta, Nisheeth Joshi, Iti Mathur

    Abstract: This paper considers the problem for estimating the quality of machine translation outputs which are independent of human intervention and are generally addressed using machine learning techniques.There are various measures through which a machine learns translations quality. Automatic Evaluation metrics produce good co-relation at corpus level but cannot produce the same results at the same segme… ▽ More

    Submitted 4 September, 2013; originally announced September 2013.

    Journal ref: International Journal of Artificial Intelligence & Applications (IJAIA), Vol. 4, No. 4, July 2013

  13. arXiv:1307.6163  [pdf

    cs.CL

    Human and Automatic Evaluation of English-Hindi Machine Translation

    Authors: Nisheeth Joshi, Hemant Darbari, Iti Mathur

    Abstract: For the past 60 years, Research in machine translation is going on. For the development in this field, a lot of new techniques are being developed each day. As a result, we have witnessed development of many automatic machine translators. A manager of machine translation development project needs to know the performance increase/decrease, after changes have been done in his system. Due to this rea… ▽ More

    Submitted 24 July, 2013; v1 submitted 23 July, 2013; originally announced July 2013.

    Comments: in Hindi, International Joint Rajbhasha Conference on Science and Technology, Oct 2013

  14. Rule Based Transliteration Scheme for English to Punjabi

    Authors: Deepti Bhalla, Nisheeth Joshi, Iti Mathur

    Abstract: Machine Transliteration has come out to be an emerging and a very important research area in the field of machine translation. Transliteration basically aims to preserve the phonological structure of words. Proper transliteration of name entities plays a very significant role in improving the quality of machine translation. In this paper we are doing machine transliteration for English-Punjabi lan… ▽ More

    Submitted 15 July, 2013; originally announced July 2013.

    Comments: International Journal on Natural Language Computing (IJNLC) Vol. 2, No.2, April 2013

  15. Part of Speech Tagging of Marathi Text Using Trigram Method

    Authors: Jyoti Singh, Nisheeth Joshi, Iti Mathur

    Abstract: In this paper we present a Marathi part of speech tagger. It is a morphologically rich language. It is spoken by the native people of Maharashtra. The general approach used for development of tagger is statistical using trigram Method. The main concept of trigram is to explore the most likely POS for a token based on given information of previous two tags by calculating probabilities to determine… ▽ More

    Submitted 15 July, 2013; originally announced July 2013.

    Comments: International Journal of Advanced Information Technology (IJAIT) Vol. 3, No.2, April2013

  16. Improving the quality of Gujarati-Hindi Machine Translation through part-of-speech tagging and stemmer-assisted transliteration

    Authors: Juhi Ameta, Nisheeth Joshi, Iti Mathur

    Abstract: Machine Translation for Indian languages is an emerging research area. Transliteration is one such module that we design while designing a translation system. Transliteration means mapping of source language text into the target language. Simple mapping decreases the efficiency of overall translation system. We propose the use of stemming and part-of-speech tagging for transliteration. The effecti… ▽ More

    Submitted 11 July, 2013; originally announced July 2013.

    Comments: 6 pages; June 2013, url-http://airccse.org/journal/ijnlc/papers/2313ijnlc05.pdf

  17. arXiv:1305.6211  [pdf

    cs.CL

    Development of a Hindi Lemmatizer

    Authors: Snigdha Paul, Nisheeth Joshi, Iti Mathur

    Abstract: We live in a translingual society, in order to communicate with people from different parts of the world we need to have an expertise in their respective languages. Learning all these languages is not at all possible; therefore we need a mechanism which can do this task for us. Machine translators have emerged as a tool which can perform this task. In order to develop a machine translator we need… ▽ More

    Submitted 15 July, 2013; v1 submitted 24 May, 2013; originally announced May 2013.

    Comments: International Journal of Computational Linguistics and Natural Language Processing, Vol 2, Issue 5, 2013

    Journal ref: International Journal of Computational Linguistics and Natural Language Processing, Vol 2, Issue 5, 2013

  18. arXiv:1210.7678  [pdf

    cs.OH

    Plagiarism Detection: Keeping Check on Misuse of Intellectual Property

    Authors: Iti Mathur, Nisheeth Joshi

    Abstract: Today, Plagiarism has become a menace. Every journal editor or conference organizers has to deal with this problem. Simply Copying or rephrasing of text without giving due credit to the original author has become more common. This is considered to be an Intellectual Property Theft. We are developing a Plagiarism Detection Tool which would deal with this problem. In this paper we discuss the common… ▽ More

    Submitted 19 October, 2012; originally announced October 2012.

    Comments: Proceedings of National Conference on Recent Advances in Computer Engineering, 2011

  19. arXiv:1210.5517  [pdf

    cs.CL

    Design of English-Hindi Translation Memory for Efficient Translation

    Authors: Nisheeth Joshi, Iti Mathur

    Abstract: Developing parallel corpora is an important and a difficult activity for Machine Translation. This requires manual annotation by Human Translators. Translating same text again is a useless activity. There are tools available to implement this for European Languages, but no such tool is available for Indian Languages. In this paper we present a tool for Indian Languages which not only provides auto… ▽ More

    Submitted 19 October, 2012; originally announced October 2012.

    Comments: Proceedings of National Conference in Recent Advances in Computer Engineering, 2011

  20. arXiv:1210.5486  [pdf

    cs.CL

    A Lightweight Stemmer for Gujarati

    Authors: Juhi Ameta, Nisheeth Joshi, Iti Mathur

    Abstract: Gujarati is a resource poor language with almost no language processing tools being available. In this paper we have shown an implementation of a rule based stemmer of Gujarati. We have shown the creation of rules for stemming and the richness in morphology that Gujarati possesses. We have also evaluated our results by verifying it with a human expert.

    Submitted 11 November, 2012; v1 submitted 19 October, 2012; originally announced October 2012.

    Comments: In Proceedings of 46th Annual Convention of Computer Society of India

  21. arXiv:1209.1301  [pdf

    cs.CL

    Evaluation of Computational Grammar Formalisms for Indian Languages

    Authors: Nisheeth Joshi, Iti Mathur

    Abstract: Natural Language Parsing has been the most prominent research area since the genesis of Natural Language Processing. Probabilistic Parsers are being developed to make the process of parser development much easier, accurate and fast. In Indian context, identification of which Computational Grammar Formalism is to be used is still a question which needs to be answered. In this paper we focus on this… ▽ More

    Submitted 18 August, 2012; originally announced September 2012.

    Comments: Proc. of International Conference in Computer Engineering and Technology, 2012, Organized by Jodhpur Institute of Engineering and Technology, Jodhpur. Sponsored by IEEE, USA and Institution of Engineers (India), Kolkatta

  22. arXiv:1209.1300  [pdf

    cs.CL

    Input Scheme for Hindi Using Phonetic Mapping

    Authors: Nisheeth Joshi, Iti Mathur

    Abstract: Written Communication on Computers requires knowledge of writing text for the desired language using Computer. Mostly people do not use any other language besides English. This creates a barrier. To resolve this issue we have developed a scheme to input text in Hindi using phonetic mapping scheme. Using this scheme we generate intermediate code strings and match them with pronunciations of input t… ▽ More

    Submitted 18 August, 2012; originally announced September 2012.

    Comments: Proceedings of National Conference on ICT: Theory, Practice and Applications. SPSU Press. Organized by Sir Padampat Singhania University, Udaipur. Sponsored by CSIR, New Delhi. March, 2010

  23. arXiv:1208.3802  [pdf

    cs.AI

    OntoAna: Domain Ontology for Human Anatomy

    Authors: Archana Vashisth, Iti Mathur, Nisheeth Joshi

    Abstract: Today, we can find many search engines which provide us with information which is more operational in nature. None of the search engines provide domain specific information. This becomes very troublesome to a novice user who wishes to have information in a particular domain. In this paper, we have developed an ontology which can be used by a domain specific search engine. We have developed an onto… ▽ More

    Submitted 18 August, 2012; originally announced August 2012.

    Comments: Proceedings of 5th CSI National Conference on Education and Research. Organized by Lingayay University, Faridabad. Sponsored by Computer Society of India and IEEE Delhi Chapter. Proceedings published by Lingayay University Press