Skip to main content

Showing 1–1 of 1 results for author: Ümütlü, E E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.09714  [pdf, other

    cs.CL cs.AI

    Evaluating the Quality of Benchmark Datasets for Low-Resource Languages: A Case Study on Turkish

    Authors: Ayşe Aysu Cengiz, Ahmet Kaan Sever, Elif Ecem Ümütlü, Naime Şeyma Erdem, Burak Aytan, Büşra Tufan, Abdullah Topraksoy, Esra Darıcı, Cagri Toraman

    Abstract: The reliance on translated or adapted datasets from English or multilingual resources introduces challenges regarding linguistic and cultural suitability. This study addresses the need for robust and culturally appropriate benchmarks by evaluating the quality of 17 commonly used Turkish benchmark datasets. Using a comprehensive framework that assesses six criteria, both human and LLM-judge annotat… ▽ More

    Submitted 26 April, 2025; v1 submitted 13 April, 2025; originally announced April 2025.