Skip to main content

Showing 1–1 of 1 results for author: Hanmatheekuna, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.17145  [pdf, other

    cs.CL cs.AI cs.LG

    Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ?

    Authors: Jirat Chiaranaipanich, Naiyarat Hanmatheekuna, Jitkapat Sawatphol, Krittamate Tiankanon, Jiramet Kinchagawat, Amrest Chinkamol, Parinthapat Pengpun, Piyalitt Ittichaiwong, Peerat Limkonchotiwat

    Abstract: Large language models (LLMs) perform well on common tasks but struggle with generalization in low-resource and low-computation settings. We examine this limitation by testing various LLMs and specialized translation models on English-Thai machine translation and code-switching datasets. Our findings reveal that under more strict computational constraints, such as 4-bit quantization, LLMs fail to t… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: Accepted in GenBench EMNLP 2024