Skip to main content

Showing 1–8 of 8 results for author: Beurer-Kellner, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.08837  [pdf, ps, other

    cs.LG cs.CR

    Design Patterns for Securing LLM Agents against Prompt Injections

    Authors: Luca Beurer-Kellner, Beat Buesser, Ana-Maria Creţu, Edoardo Debenedetti, Daniel Dobos, Daniel Fabian, Marc Fischer, David Froelicher, Kathrin Grosse, Daniel Naeff, Ezinwanne Ozoani, Andrew Paverd, Florian Tramèr, Václav Volhejn

    Abstract: As AI agents powered by Large Language Models (LLMs) become increasingly versatile and capable of addressing a broad spectrum of tasks, ensuring their security has become a critical challenge. Among the most pressing threats are prompt injection attacks, which exploit the agent's resilience on natural language inputs -- an especially dangerous threat when agents are granted tool access or handle s… ▽ More

    Submitted 27 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

  2. arXiv:2406.13352  [pdf, other

    cs.CR cs.LG

    AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

    Authors: Edoardo Debenedetti, Jie Zhang, Mislav Balunović, Luca Beurer-Kellner, Marc Fischer, Florian Tramèr

    Abstract: AI agents aim to solve complex tasks by combining text-based reasoning with external tool calls. Unfortunately, AI agents are vulnerable to prompt injection attacks where data returned by external tools hijacks the agent to execute malicious tasks. To measure the adversarial robustness of AI agents, we introduce AgentDojo, an evaluation framework for agents that execute tools over untrusted data.… ▽ More

    Submitted 24 November, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Updated version after fixing a bug in the Llama implementation and updating the travel suite

  3. arXiv:2403.06988  [pdf, other

    cs.LG cs.CL

    Guiding LLMs The Right Way: Fast, Non-Invasive Constrained Generation

    Authors: Luca Beurer-Kellner, Marc Fischer, Martin Vechev

    Abstract: To ensure that text generated by large language models (LLMs) is in an expected format, constrained decoding proposes to enforce strict formal language constraints during generation. However, as we show in this work, not only do such methods incur performance overhead during generation, but many of them also significantly impair task accuracy, if they do not correctly align the underlying LLM sub-… ▽ More

    Submitted 7 February, 2024; originally announced March 2024.

  4. arXiv:2311.14479  [pdf, other

    cs.CL

    Controlled Text Generation via Language Model Arithmetic

    Authors: Jasper Dekoninck, Marc Fischer, Luca Beurer-Kellner, Martin Vechev

    Abstract: As Large Language Models (LLMs) are deployed more widely, customization with respect to vocabulary, style, and character becomes more important. In this work, we introduce model arithmetic, a novel inference framework for composing and biasing LLMs without the need for model (re)training or highly specific datasets. In addition, the framework allows for more precise control of generated text than… ▽ More

    Submitted 6 March, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

  5. arXiv:2311.04954  [pdf, other

    cs.CL cs.AI

    Prompt Sketching for Large Language Models

    Authors: Luca Beurer-Kellner, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Many recent prompting strategies for large language models (LLMs) query the model multiple times sequentially -- first to produce intermediate results and then the final answer. However, using these methods, both decoder and model are unaware of potential follow-up prompts, leading to disconnected and undesirably wordy intermediate responses. In this work, we address this issue by proposing prompt… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  6. arXiv:2212.06094  [pdf, other

    cs.CL cs.AI

    Prompting Is Programming: A Query Language for Large Language Models

    Authors: Luca Beurer-Kellner, Marc Fischer, Martin Vechev

    Abstract: Large language models have demonstrated outstanding performance on a wide range of tasks such as question answering and code generation. On a high level, given an input, a language model can be used to automatically complete the sequence in a statistically-likely way. Based on this, users prompt these models with language instructions or examples, to implement a variety of downstream tasks. Advanc… ▽ More

    Submitted 30 May, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: To be published at PLDI'23: 44th ACM SIGPLAN International Conference on Programming Language Design and Implementation

  7. arXiv:2211.01980  [pdf, other

    cs.NI cs.LG

    Learning to Configure Computer Networks with Neural Algorithmic Reasoning

    Authors: Luca Beurer-Kellner, Martin Vechev, Laurent Vanbever, Petar Veličković

    Abstract: We present a new method for scaling automatic configuration of computer networks. The key idea is to relax the computationally hard search problem of finding a configuration that satisfies a given specification into an approximate objective amenable to learning-based techniques. Based on this idea, we train a neural algorithmic model which learns to generate configurations likely to (fully or part… ▽ More

    Submitted 26 October, 2022; originally announced November 2022.

  8. arXiv:2204.10049  [pdf, other

    cs.LG cs.PL cs.SE

    On Distribution Shift in Learning-based Bug Detectors

    Authors: Jingxuan He, Luca Beurer-Kellner, Martin Vechev

    Abstract: Deep learning has recently achieved initial success in program analysis tasks such as bug detection. Lacking real bugs, most existing works construct training and test data by injecting synthetic bugs into correct programs. Despite achieving high test accuracy (e.g., 90%), the resulting bug detectors are found to be surprisingly unusable in practice, i.e., <10% precision when used to scan real sof… ▽ More

    Submitted 19 June, 2022; v1 submitted 21 April, 2022; originally announced April 2022.