Skip to main content

Showing 1–4 of 4 results for author: Nakash, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.09600  [pdf, ps, other

    cs.MA cs.AI cs.CL cs.CR

    Effective Red-Teaming of Policy-Adherent Agents

    Authors: Itay Nakash, George Kour, Koren Lazar, Matan Vetzler, Guy Uziel, Ateret Anaby-Tavor

    Abstract: Task-oriented LLM-based agents are increasingly used in domains with strict policies, such as refund eligibility or cancellation rules. The challenge lies in ensuring that the agent consistently adheres to these rules and policies, appropriately refusing any request that would violate them, while still maintaining a helpful and natural interaction. This calls for the development of tailored design… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2505.19621  [pdf, ps, other

    cs.AI cs.CL

    Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models

    Authors: George Kour, Itay Nakash, Ateret Anaby-Tavor, Michal Shmueli-Scheuer

    Abstract: As Large Language Models (LLMs) become deeply integrated into human life and increasingly influence decision-making, it's crucial to evaluate whether and to what extent they exhibit subjective preferences, opinions, and beliefs. These tendencies may stem from biases within the models, which may shape their behavior, influence the advice and recommendations they offer to users, and potentially rein… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  3. arXiv:2503.19693  [pdf, other

    cs.CL

    AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

    Authors: Itay Nakash, Nitay Calderon, Eyal Ben David, Elad Hoffer, Roi Reichart

    Abstract: Large Language Models (LLMs) have shown impressive versatility as general purpose models. However, their broad applicability comes at a high-cost computational overhead, particularly in auto-regressive decoding where each step requires a forward pass. In domain-specific settings, general-purpose capabilities are unnecessary and can be exchanged for efficiency. In this work, we take a novel perspec… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  4. arXiv:2410.16950  [pdf, other

    cs.CR cs.AI

    Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In

    Authors: Itay Nakash, George Kour, Guy Uziel, Ateret Anaby-Tavor

    Abstract: Following the advancement of large language models (LLMs), the development of LLM-based autonomous agents has become increasingly prevalent. As a result, the need to understand the security vulnerabilities of these agents has become a critical task. We examine how ReAct agents can be exploited using a straightforward yet effective method we refer to as the foot-in-the-door attack. Our experiments… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.