Skip to main content

Showing 1–3 of 3 results for author: Oo, T M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06753  [pdf

    cs.CL cs.AI

    KAConvText: Novel Approach to Burmese Sentence Classification using Kolmogorov-Arnold Convolution

    Authors: Ye Kyaw Thu, Thura Aung, Thazin Myint Oo, Thepchai Supnithi

    Abstract: This paper presents the first application of Kolmogorov-Arnold Convolution for Text (KAConvText) in sentence classification, addressing three tasks: imbalanced binary hate speech detection, balanced multiclass news classification, and imbalanced multiclass ethnic language identification. We investigate various embedding configurations, comparing random to fastText embeddings in both static and fin… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

    Comments: 10 pages, 3 figures, 4 tables

    ACM Class: I.2.7; I.2.6

  2. arXiv:2505.11008  [pdf

    cs.CL cs.LG

    Reconstructing Syllable Sequences in Abugida Scripts with Incomplete Inputs

    Authors: Ye Kyaw Thu, Thazin Myint Oo

    Abstract: This paper explores syllable sequence prediction in Abugida languages using Transformer-based models, focusing on six languages: Bengali, Hindi, Khmer, Lao, Myanmar, and Thai, from the Asian Language Treebank (ALT) dataset. We investigate the reconstruction of complete syllable sequences from various incomplete input types, including consonant sequences, vowel sequences, partial syllables (with ra… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 14 pages, 2 figures, 6 tables, 1 listing

    ACM Class: I.2.7

  3. arXiv:2504.04038  [pdf

    cs.CL

    myNER: Contextualized Burmese Named Entity Recognition with Bidirectional LSTM and fastText Embeddings via Joint Training with POS Tagging

    Authors: Kaung Lwin Thant, Kwankamol Nongpong, Ye Kyaw Thu, Thura Aung, Khaing Hsu Wai, Thazin Myint Oo

    Abstract: Named Entity Recognition (NER) involves identifying and categorizing named entities within textual data. Despite its significance, NER research has often overlooked low-resource languages like Myanmar (Burmese), primarily due to the lack of publicly available annotated datasets. To address this, we introduce myNER, a novel word-level NER corpus featuring a 7-tag annotation scheme, enriched with Pa… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 7 pages, 2 figures, 5 tables, to be published in the proceedings of IEEE ICCI-2025

    ACM Class: I.2.7