Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction

Kwon, Sang Yun; Bhatia, Gagan; Nagoudi, El Moatez Billah; Abdul-Mageed, Muhammad

Computer Science > Computation and Language

arXiv:2312.08400 (cs)

[Submitted on 13 Dec 2023]

Title:Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction

Authors:Sang Yun Kwon, Gagan Bhatia, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

View PDF HTML (experimental)

Abstract:Large language models (LLMs) finetuned to follow human instruction have recently exhibited significant capabilities in various English NLP tasks. However, their performance in grammatical error correction (GEC), especially on languages other than English, remains significantly unexplored. In this work, we evaluate the abilities of instruction finetuned LLMs in Arabic GEC, a complex task due to Arabic's rich morphology. Our findings suggest that various prompting methods, coupled with (in-context) few-shot learning, demonstrate considerable effectiveness, with GPT-4 achieving up to $65.49$ F$_{1}$ score under expert prompting (approximately $5$ points higher than our established baseline). Despite these positive results, we find that instruction finetuned models, regardless of their size, are still outperformed by fully finetuned ones, even if they are significantly smaller in size. This disparity highlights substantial room for improvements for LLMs. Inspired by methods used in low-resource machine translation, we also develop a method exploiting synthetic data that significantly outperforms previous models on two standard Arabic benchmarks. Our best model achieves a new SOTA on Arabic GEC, with $73.29$ and $73.26$ F$_{1}$ on the 2014 and 2015 QALB datasets, respectively, compared to peer-reviewed published baselines.

Comments:	arXiv admin note: text overlap with arXiv:2308.04492
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2312.08400 [cs.CL]
	(or arXiv:2312.08400v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2312.08400

Submission history

From: Sang Yun Kwon [view email]
[v1] Wed, 13 Dec 2023 05:33:25 UTC (11,474 KB)

Computer Science > Computation and Language

Title:Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators