Skip to main content

Showing 1–1 of 1 results for author: Mathebula, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.00626  [pdf, other

    cs.CL

    Correcting FLORES Evaluation Dataset for Four African Languages

    Authors: Idris Abdulmumin, Sthembiso Mkhwanazi, Mahlatse S. Mbooi, Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Neo Putini, Miehleketo Mathebula, Matimba Shingange, Tajuddeen Gwadabe, Vukosi Marivate

    Abstract: This paper describes the corrections made to the FLORES evaluation (dev and devtest) dataset for four African languages, namely Hausa, Northern Sotho (Sepedi), Xitsonga, and isiZulu. The original dataset, though groundbreaking in its coverage of low-resource languages, exhibited various inconsistencies and inaccuracies in the reviewed languages that could potentially hinder the integrity of the ev… ▽ More

    Submitted 5 October, 2024; v1 submitted 1 September, 2024; originally announced September 2024.