Skip to main content

Showing 1–2 of 2 results for author: Alzahrani, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.15145  [pdf, other

    cs.CR

    PromptShield: Deployable Detection for Prompt Injection Attacks

    Authors: Dennis Jacob, Hend Alzahrani, Zhanhao Hu, Basel Alomair, David Wagner

    Abstract: Application designers have moved to integrate large language models (LLMs) into their products. However, many LLM-integrated applications are vulnerable to prompt injections. While attempts have been made to address this problem by building prompt injection detectors, many are not yet suitable for practical deployment. To support research in this area, we introduce PromptShield, a benchmark for tr… ▽ More

    Submitted 11 April, 2025; v1 submitted 25 January, 2025; originally announced January 2025.

    Comments: ACM CODASPY 2025; extended technical report

  2. arXiv:2501.03491  [pdf, ps, other

    cs.CL cs.AI

    Can LLMs Ask Good Questions?

    Authors: Yueheng Zhang, Xiaoyuan Liu, Yiyou Sun, Atheer Alharbi, Hend Alzahrani, Tianneng Shi, Basel Alomair, Dawn Song

    Abstract: We evaluate questions generated by large language models (LLMs) from context, comparing them to human-authored questions across six dimensions: question type, question length, context coverage, answerability, uncommonness, and required answer length. Our study spans two open-source and two proprietary state-of-the-art models. Results reveal that LLM-generated questions tend to demand longer descri… ▽ More

    Submitted 17 June, 2025; v1 submitted 6 January, 2025; originally announced January 2025.