Skip to main content

Showing 1–4 of 4 results for author: Tuffaha, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2007.05612  [pdf, other

    cs.CL cs.LG

    Multi-Dialect Arabic BERT for Country-Level Dialect Identification

    Authors: Bashar Talafha, Mohammad Ali, Muhy Eddin Za'ter, Haitham Seelawi, Ibraheem Tuffaha, Mostafa Samir, Wael Farhan, Hussein T. Al-Natsheh

    Abstract: Arabic dialect identification is a complex problem for a number of inherent properties of the language itself. In this paper, we present the experiments conducted, and the models developed by our competing team, Mawdoo3 AI, along the way to achieving our winning solution to subtask 1 of the Nuanced Arabic Dialect Identification (NADI) shared task. The dialect identification subtask provides 21,000… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: Accepted at the Fifth Arabic Natural Language Processing Workshop (WANLP2020) co-located with the 28th International Conference on Computational Linguistics (COLING'2020), Barcelona, Spain, 12 Dec. 2020

  2. arXiv:1912.12514  [pdf, other

    cs.CL cs.LG

    Tha3aroon at NSURL-2019 Task 8: Semantic Question Similarity in Arabic

    Authors: Ali Fadel, Ibraheem Tuffaha, Mahmoud Al-Ayyoub

    Abstract: In this paper, we describe our team's effort on the semantic text question similarity task of NSURL 2019. Our top performing system utilizes several innovative data augmentation techniques to enlarge the training data. Then, it takes ELMo pre-trained contextual embeddings of the data and feeds them into an ON-LSTM network with self-attention. This results in sequence representation vectors that ar… ▽ More

    Submitted 28 December, 2019; originally announced December 2019.

    Comments: 8 pages, 8 figures, 4 tables

  3. Neural Arabic Text Diacritization: State of the Art Results and a Novel Approach for Machine Translation

    Authors: Ali Fadel, Ibraheem Tuffaha, Bara' Al-Jawarneh, Mahmoud Al-Ayyoub

    Abstract: In this work, we present several deep learning models for the automatic diacritization of Arabic text. Our models are built using two main approaches, viz. Feed-Forward Neural Network (FFNN) and Recurrent Neural Network (RNN), with several enhancements such as 100-hot encoding, embeddings, Conditional Random Field (CRF) and Block-Normalized Gradient (BNG). The models are tested on the only freely… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 18 pages, 17 figures, 14 tables

  4. arXiv:1905.01965  [pdf, other

    cs.CL cs.LG

    Arabic Text Diacritization Using Deep Neural Networks

    Authors: Ali Fadel, Ibraheem Tuffaha, Bara' Al-Jawarneh, Mahmoud Al-Ayyoub

    Abstract: Diacritization of Arabic text is both an interesting and a challenging problem at the same time with various applications ranging from speech synthesis to helping students learning the Arabic language. Like many other tasks or problems in Arabic language processing, the weak efforts invested into this problem and the lack of available (open-source) resources hinder the progress towards solving thi… ▽ More

    Submitted 25 April, 2019; originally announced May 2019.

    Comments: 7 pages, 4 figures, 15 tables