Skip to main content

Showing 1–8 of 8 results for author: Ruoss, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.17010  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Understanding Prompt Tuning and In-Context Learning via Meta-Learning

    Authors: Tim Genewein, Kevin Wenliang Li, Jordi Grau-Moya, Anian Ruoss, Laurent Orseau, Marcus Hutter

    Abstract: Prompting is one of the main ways to adapt a pretrained model to target tasks. Besides manually constructing prompts, many prompt optimization methods have been proposed in the literature. Method development is mainly empirically driven, with less emphasis on a conceptual understanding of prompting. In this paper we discuss how optimal prompting can be understood through a Bayesian view, which als… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  2. arXiv:2502.10760  [pdf, other

    cs.CL cs.LG stat.ML

    Why is prompting hard? Understanding prompts on binary sequence predictors

    Authors: Li Kevin Wenliang, Anian Ruoss, Jordi Grau-Moya, Marcus Hutter, Tim Genewein

    Abstract: Large language models (LLMs) can be prompted to do many tasks, but finding good prompts is not always easy, nor is understanding some performant prompts. We explore these issues by viewing prompting as conditioning a near-optimal sequence predictor (LLM) pretrained on diverse data sources. Through numerous prompt search experiments, we show that the unintuitive patterns in optimal prompts can be b… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  3. arXiv:2402.04494  [pdf, other

    cs.LG cs.AI stat.ML

    Amortized Planning with Large-Scale Transformers: A Case Study on Chess

    Authors: Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Li Kevin Wenliang, Elliot Catt, John Reid, Cannada A. Lewis, Joel Veness, Tim Genewein

    Abstract: This paper uses chess, a landmark planning problem in AI, to assess transformers' performance on a planning task where memorization is futile $\unicode{x2013}$ even at a large scale. To this end, we release ChessBench, a large-scale benchmark dataset of 10 million chess games with legal move and value annotations (15 billion data points) provided by Stockfish 16, the state-of-the-art chess engine.… ▽ More

    Submitted 21 October, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  4. arXiv:2312.07358  [pdf, other

    stat.ML cs.LG

    Distributional Bellman Operators over Mean Embeddings

    Authors: Li Kevin Wenliang, Grégoire Delétang, Matthew Aitchison, Marcus Hutter, Anian Ruoss, Arthur Gretton, Mark Rowland

    Abstract: We propose a novel algorithmic framework for distributional reinforcement learning, based on learning finite-dimensional mean embeddings of return distributions. We derive several new algorithms for dynamic programming and temporal-difference learning based on this framework, provide asymptotic convergence theory, and examine the empirical performance of the algorithms on a suite of tabular tasks.… ▽ More

    Submitted 4 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

  5. arXiv:2305.16843  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Randomized Positional Encodings Boost Length Generalization of Transformers

    Authors: Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness

    Abstract: Transformers have impressive generalization capabilities on tasks with a fixed context length. However, they fail to generalize to sequences of arbitrary length, even for seemingly simple tasks such as duplicating a string. Moreover, simply training on longer sequences is inefficient due to the quadratic computation complexity of the global attention mechanism. In this work, we demonstrate that th… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  6. arXiv:2302.03067  [pdf, other

    cs.LG cs.AI stat.ML

    Memory-Based Meta-Learning on Non-Stationary Distributions

    Authors: Tim Genewein, Grégoire Delétang, Anian Ruoss, Li Kevin Wenliang, Elliot Catt, Vincent Dutordoir, Jordi Grau-Moya, Laurent Orseau, Marcus Hutter, Joel Veness

    Abstract: Memory-based meta-learning is a technique for approximating Bayes-optimal predictors. Under fairly general conditions, minimizing sequential prediction error, measured by the log loss, leads to implicit meta-learning. The goal of this work is to investigate how far this interpretation can be realized by current sequence prediction models and training regimes. The focus is on piecewise stationary s… ▽ More

    Submitted 25 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  7. arXiv:2009.09318  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    Efficient Certification of Spatial Robustness

    Authors: Anian Ruoss, Maximilian Baader, Mislav Balunović, Martin Vechev

    Abstract: Recent work has exposed the vulnerability of computer vision models to vector field attacks. Due to the widespread usage of such models in safety-critical applications, it is crucial to quantify their robustness against such spatial transformations. However, existing work only provides empirical robustness quantification against vector field deformations via adversarial attacks, which lack provabl… ▽ More

    Submitted 30 January, 2021; v1 submitted 19 September, 2020; originally announced September 2020.

    Comments: Conference Paper at AAAI 2021

  8. arXiv:2002.10312  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Learning Certified Individually Fair Representations

    Authors: Anian Ruoss, Mislav Balunović, Marc Fischer, Martin Vechev

    Abstract: Fair representation learning provides an effective way of enforcing fairness constraints without compromising utility for downstream users. A desirable family of such fairness constraints, each requiring similar treatment for similar individuals, is known as individual fairness. In this work, we introduce the first method that enables data consumers to obtain certificates of individual fairness fo… ▽ More

    Submitted 28 November, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Conference Paper at NeurIPS 2020