Skip to main content

Showing 1–1 of 1 results for author: Flores, J A N

.
  1. arXiv:2409.11445  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Jailbreaking Large Language Models with Symbolic Mathematics

    Authors: Emet Bethany, Mazal Bethany, Juan Arturo Nolazco Flores, Sumit Kumar Jha, Peyman Najafirad

    Abstract: Recent advancements in AI safety have led to increased efforts in training and red-teaming large language models (LLMs) to mitigate unsafe content generation. However, these safety mechanisms may not be comprehensive, leaving potential vulnerabilities unexplored. This paper introduces MathPrompt, a novel jailbreaking technique that exploits LLMs' advanced capabilities in symbolic mathematics to by… ▽ More

    Submitted 5 November, 2024; v1 submitted 16 September, 2024; originally announced September 2024.