Skip to main content

Showing 1–15 of 15 results for author: Oflazer, K

.
  1. arXiv:2407.01360  [pdf, other

    cs.CL

    Nullpointer at ArAIEval Shared Task: Arabic Propagandist Technique Detection with Token-to-Word Mapping in Sequence Tagging

    Authors: Abrar Abir, Kemal Oflazer

    Abstract: This paper investigates the optimization of propaganda technique detection in Arabic text, including tweets \& news paragraphs, from ArAIEval shared task 1. Our approach involves fine-tuning the AraBERT v2 model with a neural network classifier for sequence tagging. Experimental results show relying on the first token of the word for technique prediction produces the best performance. In addition,… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: To appear in proceedings of 2024 Arabic NLP Conference

  2. arXiv:2310.15113  [pdf

    cs.CL

    Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

    Authors: Leonie Weissweiler, Valentin Hofmann, Anjali Kantharuban, Anna Cai, Ritam Dutt, Amey Hengle, Anubha Kabra, Atharva Kulkarni, Abhishek Vijayakumar, Haofei Yu, Hinrich Schütze, Kemal Oflazer, David R. Mortensen

    Abstract: Large language models (LLMs) have recently reached an impressive level of linguistic capability, prompting comparisons with human language skills. However, there have been relatively few systematic inquiries into the linguistic capabilities of the latest generation of LLMs, and those studies that do exist (i) ignore the remarkable ability of humans to generalize, (ii) focus only on English, and (i… ▽ More

    Submitted 26 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  3. arXiv:1810.04216  [pdf, other

    cs.CL

    Event Coreference Resolution Using Neural Network Classifiers

    Authors: Arun Pandian, Lamana Mulaffer, Kemal Oflazer, Amna AlZeyara

    Abstract: This paper presents a neural network classifier approach to detecting both within- and cross- document event coreference effectively using only event mention based features. Our approach does not (yet) rely on any event argument features such as semantic roles or spatiotemporal arguments. Experimental results on the ECB+ dataset show that our approach produces F1 scores that significantly outperfo… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

  4. arXiv:1808.08392  [pdf, other

    cs.CL

    MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction

    Authors: Ossama Obeid, Salam Khalifa, Nizar Habash, Houda Bouamor, Wajdi Zaghouani, Kemal Oflazer

    Abstract: In this paper, we introduce MADARi, a joint morphological annotation and spelling correction system for texts in Standard and Dialectal Arabic. The MADARi framework provides intuitive interfaces for annotating text and managing the annotation process of a large number of sizable documents. Morphological annotation includes indicating, for a word, in context, its baseword, clitics, part-of-speech,… ▽ More

    Submitted 25 August, 2018; originally announced August 2018.

    Comments: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

  5. Morphological Disambiguation by Voting Constraints

    Authors: Kemal Oflazer, Gokhan Tur

    Abstract: We present a constraint-based morphological disambiguation system in which individual constraints vote on matching morphological parses, and disambiguation of all the tokens in a sentence is performed at the end by selecting parses that receive the highest votes. This constraint application paradigm makes the outcome of the disambiguation independent of the rule sequence, and hence relieves the… ▽ More

    Submitted 25 April, 1997; originally announced April 1997.

    Comments: 8 pages, Latex source. To appear in Proceedings of ACL/EACL'97 Compressed postscript also available as ftp://ftp.cs.bilkent.edu.tr/pub/ko/acl97.ps.z

  6. arXiv:cmp-lg/9605008  [pdf, ps

    cs.CL

    Tactical Generation in a Free Constituent Order Language

    Authors: Dilek Zeynep Hakkani, Kemal Oflazer, Ilyas Cicekli

    Abstract: This paper describes tactical generation in Turkish, a free constituent order language, in which the order of the constituents may change according to the information structure of the sentences to be generated. In the absence of any information regarding the information structure of a sentence (i.e., topic, focus, background, etc.), the constituents of the sentence obey a default order, but the… ▽ More

    Submitted 5 May, 1996; originally announced May 1996.

    Comments: gzipped, uuencoded postscript file

    Journal ref: Proceedings of 1996 International Workshop on Natural Language Generation

  7. arXiv:cmp-lg/9604003  [pdf, ps

    cs.CL

    Error-tolerant Tree Matching

    Authors: Kemal Oflazer

    Abstract: This paper presents an efficient algorithm for retrieving from a database of trees, all trees that match a given query tree approximately, that is, within a certain error tolerance. It has natural language processing applications in searching for matches in example-based translation systems, and retrieval from lexical databases containing entries of complex feature structures. The algorithm has… ▽ More

    Submitted 17 April, 1996; v1 submitted 11 April, 1996; originally announced April 1996.

    Comments: gzipped and uuencoded postscript, 5 pages. Minor fix in one of the figures. Also available as ftp://ftp.cs.bilkent.edu.tr/pub/ko/coling96-ettm.ps.z

  8. arXiv:cmp-lg/9604002  [pdf, ps

    cs.CL

    A Constraint-based Case Frame Lexicon

    Authors: Kemal Oflazer, Okan Yilmaz

    Abstract: We present a constraint-based case frame lexicon architecture for bi-directional mapping between a syntactic case frame and a semantic frame. The lexicon uses a semantic sense as the basic unit and employs a multi-tiered constraint structure for the resolution of syntactic information into the appropriate senses and/or idiomatic usage. Valency changing transformations such as morphologically mar… ▽ More

    Submitted 11 April, 1996; originally announced April 1996.

    Comments: gzipped, uuencoded postscript, 6 pages. Also available as ftp://ftp.cs.bilkent.edu.tr/pub/ko/coling96-ccl.ps.z ; To Appear in Proceedings of COLING 96, Copenhaged, Denmark, August 1996

  9. arXiv:cmp-lg/9604001  [pdf, ps

    cs.CL

    Combining Hand-crafted Rules and Unsupervised Learning in Constraint-based Morphological Disambiguation

    Authors: Kemal Oflazer, Gokhan Tur

    Abstract: This paper presents a constraint-based morphological disambiguation approach that is applicable languages with complex morphology--specifically agglutinative languages with productive inflectional and derivational morphological phenomena. In certain respects, our approach has been motivated by Brill's recent work, but with the observation that his transformational approach is not directly applic… ▽ More

    Submitted 12 April, 1996; v1 submitted 11 April, 1996; originally announced April 1996.

    Comments: gzipped and uuencoded postscript, 13 pages. Also available as ftp://ftp.cs.bilkent.edu.tr/pub/ko/emnlp.ps.z

  10. arXiv:cmp-lg/9507008  [pdf, ps

    cs.CL

    A Constraint-based Case Frame Lexicon Architecture

    Authors: Kemal Oflazer, Okan Yilmaz

    Abstract: In Turkish, (and possibly in many other languages) verbs often convey several meanings (some totally unrelated) when they are used with subjects, objects, oblique objects, adverbial adjuncts, with certain lexical, morphological, and semantic features, and co-occurrence restrictions. In addition to the usual sense variations due to selectional restrictions on verbal arguments, in most cases, the… ▽ More

    Submitted 21 July, 1995; originally announced July 1995.

    Comments: gzipped, uuencoded postscipt file, 11 pages. To be presented at the ESSLLI Workshop -- The Computational Lexicon. Also available as ftp://ftp.cs.bilkent.edu.tr/pub/tech-reports/1995/BU-CEIS-9511.ps.z

  11. arXiv:cmp-lg/9504031  [pdf, ps

    cs.CL

    Error-tolerant Finite State Recognition with Applications to Morphological Analysis and Spelling Correction

    Authors: Kemal Oflazer

    Abstract: Error-tolerant recognition enables the recognition of strings that deviate mildly from any string in the regular set recognized by the underlying finite state recognizer. Such recognition has applications in error-tolerant morphological processing, spelling correction, and approximate string matching in information retrieval. After a description of the concepts and algorithms involved, we give e… ▽ More

    Submitted 21 July, 1995; v1 submitted 28 April, 1995; originally announced April 1995.

    Comments: Replaces 9504031. gzipped, uuencoded postscript file. To appear in Computational Linguistics Volume 22 No:1, 1996, Also available as ftp://ftp.cs.bilkent.edu.tr/pub/ko/clpaper9512.ps.z

  12. arXiv:cmp-lg/9503001  [pdf, ps

    cs.CL

    Using a Corpus for Teaching Turkish Morphology

    Authors: H. Altay Guvenir, Kemal Oflazer

    Abstract: This paper reports on the preliminary phase of our ongoing research towards developing an intelligent tutoring environment for Turkish grammar. One of the components of this environment is a corpus search tool which, among other aspects of the language, will be used to present the learner sample sentences along with their morphological analyses. Following a brief introduction to the Turkish langua… ▽ More

    Submitted 1 March, 1995; originally announced March 1995.

    Comments: uuencoded gzip'ed postscript file. Appeared in Proceedings of TWLT-7, University of Twente, The Netherlands, June 1994. Software described is available at ftp://ftp.cs.bilkent.edu.tr/pub/Turklang/corpus-search/

    Report number: Bilkent University CS Dept Tech Report BU-CEIS-9423

  13. arXiv:cmp-lg/9410004  [pdf, ps

    cs.CL

    Spelling Correction in Agglutinative Languages

    Authors: Kemal Oflazer

    Abstract: This paper presents an approach to spelling correction in agglutinative languages that is based on two-level morphology and a dynamic programming based search algorithm. Spelling correction in agglutinative languages is significantly different than in languages like English. The concept of a word in such languages is much wider that the entries found in a dictionary, owing to {}~productive word… ▽ More

    Submitted 7 October, 1994; v1 submitted 6 October, 1994; originally announced October 1994.

    Comments: uuencoded postscript file, poster version to appear in ANLP proceedings. (Abstract now fixed)

  14. arXiv:cmp-lg/9407026  [pdf, ps

    cs.CL

    Tagging and Morphological Disambiguation of Turkish Text

    Authors: Kemal Oflazer, Ilker Kuruoz

    Abstract: Automatic text tagging is an important component in higher level analysis of text corpora, and its output can be used in many natural language processing applications. In languages like Turkish or Finnish, with agglutinative morphology, morphological disambiguation is a very crucial process in tagging, as the structures of many lexical forms are morphologically ambiguous. This paper describes a… ▽ More

    Submitted 29 July, 1994; originally announced July 1994.

    Comments: To appear in Proceedings of 4th ACL-ANLP Conf. uuencoded gzip'ed postscript file, 6 pages

    Report number: Bilkent University CS Dept. Tech Report NO: BU-CEIS-9416

  15. arXiv:cmp-lg/9406008  [pdf, ps

    cs.CL

    Parsing Turkish with the Lexical Functional Grammar Formalism

    Authors: Zelal Gungordu, Kemal Oflazer

    Abstract: This paper describes our work on parsing Turkish using the lexical-functional grammar formalism. This work represents the first significant effort for parsing Turkish. Our implementation is based on Tomita's parser developed at Carnegie-Mellon University Center for Machine Translation. The grammar covers a substantial subset of Turkish including simple and complex sentences, and deals with a rea… ▽ More

    Submitted 2 June, 1994; originally announced June 1994.

    Comments: 7 pages, Postscript (compressed (gzip) and uuencoded)

    Report number: (BU-CEIS-9402 Bilkent University CS Dept Tech Report)

    Journal ref: Proceedings of COLING'94