Skip to main content

Showing 1–1 of 1 results for author: Khassib, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.21979  [pdf, ps, other

    cs.CL

    Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset

    Authors: Fakhraddin Alwajih, Samar Mohamed Magdy, Abdellah El Mekki, Omer Nacar, Youssef Nafea, Safaa Taher Abdelfadil, Abdulfattah Mohammed Yahya, Hamzah Luqman, Nada Almarwani, Samah Aloufi, Baraah Qawasmeh, Houdaifa Atou, Serry Sibaee, Hamzah A. Alsayadi, Walid Al-Dhabyani, Maged S. Al-shaibani, Aya El aatar, Nour Qandos, Rahaf Alhamouri, Samar Ahmad, Razan Khassib, Lina Hamad, Mohammed Anwar AL-Ghrawi, Fatimah Alshamari, Cheikh Malainine , et al. (20 additional authors not shown)

    Abstract: Mainstream large vision-language models (LVLMs) inherently encode cultural biases, highlighting the need for diverse multimodal datasets. To address this gap, we introduce Pearl, a large-scale Arabic multimodal dataset and benchmark explicitly designed for cultural understanding. Constructed through advanced agentic workflows and extensive human-in-the-loop annotations by 45 annotators from across… ▽ More

    Submitted 22 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: https://github.com/UBC-NLP/pearl