Skip to main content

Showing 1–1 of 1 results for author: Nekoto, W N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.15916  [pdf, ps, other

    cs.CL

    The Esethu Framework: Reimagining Sustainable Dataset Governance and Curation for Low-Resource Languages

    Authors: Jenalea Rajab, Anuoluwapo Aremu, Everlyn Asiko Chimoto, Dale Dunbar, Graham Morrissey, Fadel Thior, Luandrie Potgieter, Jessico Ojo, Atnafu Lambebo Tonja, Maushami Chetty, Wilhelmina NdapewaOnyothi Nekoto, Pelonomi Moiloa, Jade Abbott, Vukosi Marivate, Benjamin Rosman

    Abstract: This paper presents the Esethu Framework, a sustainable data curation framework specifically designed to empower local communities and ensure equitable benefit-sharing from their linguistic resource. This framework is supported by the Esethu license, a novel community-centric data license. As a proof of concept, we introduce the Vuk'uzenzele isiXhosa Speech Dataset (ViXSD), an open-source corpus d… ▽ More

    Submitted 12 June, 2025; v1 submitted 21 February, 2025; originally announced February 2025.