Skip to main content

Showing 1–1 of 1 results for author: Hammarström, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.03071  [pdf

    cs.CL

    Urdu Morphology, Orthography and Lexicon Extraction

    Authors: Muhammad Humayoun, Harald Hammarström, Aarne Ranta

    Abstract: Urdu is a challenging language because of, first, its Perso-Arabic script and second, its morphological system having inherent grammatical forms and vocabulary of Arabic, Persian and the native languages of South Asia. This paper describes an implementation of the Urdu language as a software API, and we deal with orthography, morphology and the extraction of the lexicon. The morphology is implemen… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Published in CAASL-2: The Second Workshop on Computational Approaches to Arabic Script-based Languages, July 21-22, 2007, LSA 2007 Linguistic Institute, Stanford University