JASMINE: Arabic GPT Models for Few-Shot Learning

Nagoudi, El Moatez Billah; Abdul-Mageed, Muhammad; Elmadany, AbdelRahim; Inciarte, Alcides Alcoba; Khondaker, Md Tawkat Islam

Computer Science > Computation and Language

arXiv:2212.10755 (cs)

[Submitted on 21 Dec 2022 (v1), last revised 24 Oct 2023 (this version, v2)]

Title:JASMINE: Arabic GPT Models for Few-Shot Learning

Authors:El Moatez Billah Nagoudi, Muhammad Abdul-Mageed, AbdelRahim Elmadany, Alcides Alcoba Inciarte, Md Tawkat Islam Khondaker

View PDF

Abstract:Scholarship on generative pretraining (GPT) remains acutely Anglocentric, leaving serious gaps in our understanding of the whole class of autoregressive models. For example, we have little knowledge about the potential of these models and their societal impacts in diverse linguistic and cultural settings. We alleviate this issue for Arabic, a wide collection of languages and dialectal varieties with more than 400 million population, by introducing JASMINE. JASMINE is a suite of powerful Arabic autoregressive Transformer language models ranging in size between 300 million-6.7 billion parameters pretrained on a large and diverse dataset (~ 235 GB of text). We also carefully design and release a comprehensive benchmark for both automated and human evaluation of Arabic autoregressive models, with coverage of potential social biases, harms, and toxicity. Using our novel benchmark, we evaluate JASMINE extensively showing powerful performance intrinsically as well as in few-shot learning on a wide range of NLP tasks. We aim to responsibly release our models and evaluation benchmark with interested researchers, along with code for experimenting with them.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.10755 [cs.CL]
	(or arXiv:2212.10755v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.10755

Submission history

From: Abdelrahim Elmadany [view email]
[v1] Wed, 21 Dec 2022 04:21:46 UTC (848 KB)
[v2] Tue, 24 Oct 2023 21:03:30 UTC (1,019 KB)

Computer Science > Computation and Language

Title:JASMINE: Arabic GPT Models for Few-Shot Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:JASMINE: Arabic GPT Models for Few-Shot Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators