Skip to main content

Showing 1–5 of 5 results for author: Alnuhait, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.02899  [pdf, ps, other

    cs.CL

    FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs

    Authors: Deema Alnuhait, Neeraja Kirtane, Muhammad Khalifa, Hao Peng

    Abstract: Language models (LMs) hallucinate. We inquire: Can we detect and mitigate hallucinations before they happen? This work answers this research question in the positive, by showing that the internal representations of LMs provide rich signals that can be used for this purpose. We introduce FactCheckmate, which preemptively detects hallucinations by learning a classifier that predicts whether the LM w… ▽ More

    Submitted 24 June, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

  2. arXiv:2403.09017  [pdf, other

    cs.CL

    AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic

    Authors: Emad A. Alghamdi, Reem I. Masoud, Deema Alnuhait, Afnan Y. Alomairi, Ahmed Ashraf, Mohamed Zaytoon

    Abstract: The swift progress and widespread acceptance of artificial intelligence (AI) systems highlight a pressing requirement to comprehend both the capabilities and potential risks associated with AI. Given the linguistic complexity, cultural richness, and underrepresented status of Arabic in AI research, there is a pressing need to focus on Large Language Models (LLMs) performance and safety for Arabic-… ▽ More

    Submitted 4 November, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  3. arXiv:2402.03177  [pdf, other

    cs.CL cs.LG

    CIDAR: Culturally Relevant Instruction Dataset For Arabic

    Authors: Zaid Alyafeai, Khalid Almubarak, Ahmed Ashraf, Deema Alnuhait, Saied Alshahrani, Gubran A. Q. Abdulrahman, Gamil Ahmed, Qais Gawah, Zead Saleh, Mustafa Ghaleb, Yousef Ali, Maged S. Al-Shaibani

    Abstract: Instruction tuning has emerged as a prominent methodology for teaching Large Language Models (LLMs) to follow instructions. However, current instruction datasets predominantly cater to English or are derived from English-dominated LLMs, resulting in inherent biases toward Western culture. This bias significantly impacts the linguistic structures of non-English languages such as Arabic, which has a… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  4. arXiv:2305.13710  [pdf, other

    cs.CL

    Using Textual Interface to Align External Knowledge for End-to-End Task-Oriented Dialogue Systems

    Authors: Qingyang Wu, Deema Alnuhait, Derek Chen, Zhou Yu

    Abstract: Traditional end-to-end task-oriented dialogue systems have been built with a modularized design. However, such design often causes misalignment between the agent response and external knowledge, due to inadequate representation of information. Furthermore, its evaluation metrics emphasize assessing the agent's pre-lexicalization response, neglecting the quality of the completed response. In this w… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  5. arXiv:2303.07316  [pdf, other

    cs.CL cs.AI

    FaceChat: An Emotion-Aware Face-to-face Dialogue Framework

    Authors: Deema Alnuhait, Qingyang Wu, Zhou Yu

    Abstract: While current dialogue systems like ChatGPT have made significant advancements in text-based interactions, they often overlook the potential of other modalities in enhancing the overall user experience. We present FaceChat, a web-based dialogue framework that enables emotionally-sensitive and face-to-face conversations. By seamlessly integrating cutting-edge technologies in natural language proces… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.