Skip to main content

Showing 1–2 of 2 results for author: Mavi, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06391  [pdf, ps, other

    cs.CY cs.AI cs.CL

    From Rogue to Safe AI: The Role of Explicit Refusals in Aligning LLMs with International Humanitarian Law

    Authors: John Mavi, Diana Teodora Găitan, Sergio Coronado

    Abstract: Large Language Models (LLMs) are widely used across sectors, yet their alignment with International Humanitarian Law (IHL) is not well understood. This study evaluates eight leading LLMs on their ability to refuse prompts that explicitly violate these legal frameworks, focusing also on helpfulness - how clearly and constructively refusals are communicated. While most models rejected unlawful reque… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2410.16285  [pdf, other

    cs.CY cs.AI cs.CL

    Assessing the Performance of Human-Capable LLMs -- Are LLMs Coming for Your Job?

    Authors: John Mavi, Nathan Summers, Sergio Coronado

    Abstract: The current paper presents the development and validation of SelfScore, a novel benchmark designed to assess the performance of automated Large Language Model (LLM) agents on help desk and professional consultation tasks. Given the increasing integration of AI in industries, particularly within customer service, SelfScore fills a crucial gap by enabling the comparison of automated agents and human… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.