Skip to main content

Showing 1–6 of 6 results for author: Chollampatt, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01853  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets

    Authors: Sathish Reddy Indurthi, Wenxuan Zhou, Shamil Chollampatt, Ravi Agrawal, Kaiqiang Song, Lingxiao Zhao, Chenguang Zhu

    Abstract: Advancements in Large Language Models (LLMs) have significantly enhanced instruction-following capabilities. However, most Instruction Fine-Tuning (IFT) datasets are predominantly in English, limiting model performance in other languages. Traditional methods for creating multilingual IFT datasets such as translating existing English IFT datasets or converting existing NLP datasets into IFT dataset… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2009.14395  [pdf, other

    cs.CL

    Can Automatic Post-Editing Improve NMT?

    Authors: Shamil Chollampatt, Raymond Hendy Susanto, Liling Tan, Ewa Szymanska

    Abstract: Automatic post-editing (APE) aims to improve machine translations, thereby reducing human post-editing effort. APE has had notable success when used with statistical machine translation (SMT) systems but has not been as successful over neural machine translation (NMT) systems. This has raised questions on the relevance of APE task in the current scenario. However, the training of APE models has be… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: In EMNLP 2020

  3. arXiv:2004.12681  [pdf, other

    cs.CL

    Lexically Constrained Neural Machine Translation with Levenshtein Transformer

    Authors: Raymond Hendy Susanto, Shamil Chollampatt, Liling Tan

    Abstract: This paper proposes a simple and effective algorithm for incorporating lexical constraints in neural machine translation. Previous work either required re-training existing models with the lexical constraints or incorporating them during beam search decoding with significantly higher computational overheads. Leveraging the flexibility and speed of a recently proposed Levenshtein Transformer model… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: 8 pages, In Proceedings of ACL 2020

  4. arXiv:1801.08831  [pdf, other

    cs.CL

    A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction

    Authors: Shamil Chollampatt, Hwee Tou Ng

    Abstract: We improve automatic correction of grammatical, orthographic, and collocation errors in text using a multilayer convolutional encoder-decoder neural network. The network is initialized with embeddings that make use of character N-gram information to better suit this task. When evaluated on common benchmark test data sets (CoNLL-2014 and JFLEG), our model substantially outperforms all prior neural… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

    Comments: 8 pages, 3 figures, In Proceedings of AAAI 2018

  5. arXiv:1606.00210  [pdf, other

    cs.CL

    Exploiting N-Best Hypotheses to Improve an SMT Approach to Grammatical Error Correction

    Authors: Duc Tam Hoang, Shamil Chollampatt, Hwee Tou Ng

    Abstract: Grammatical error correction (GEC) is the task of detecting and correcting grammatical errors in texts written by second language learners. The statistical machine translation (SMT) approach to GEC, in which sentences written by second language learners are translated to grammatically correct sentences, has achieved state-of-the-art accuracy. However, the SMT approach is unable to utilize global c… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

    Comments: Accepted for presentation at IJCAI-16

  6. arXiv:1606.00189  [pdf, other

    cs.CL

    Neural Network Translation Models for Grammatical Error Correction

    Authors: Shamil Chollampatt, Kaveh Taghipour, Hwee Tou Ng

    Abstract: Phrase-based statistical machine translation (SMT) systems have previously been used for the task of grammatical error correction (GEC) to achieve state-of-the-art accuracy. The superiority of SMT systems comes from their ability to learn text transformations from erroneous to corrected text, without explicitly modeling error types. However, phrase-based SMT systems suffer from limitations of disc… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

    Comments: Accepted for presentation at IJCAI-16