Skip to main content

Showing 1–6 of 6 results for author: von Rütte, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.19122  [pdf, ps, other

    cs.CV cs.LG

    Exploring Magnitude Preservation and Rotation Modulation in Diffusion Transformers

    Authors: Eric Tillman Bill, Cristian Perez Jensen, Sotiris Anagnostidis, Dimitri von Rütte

    Abstract: Denoising diffusion models exhibit remarkable generative capabilities, but remain challenging to train due to their inherent stochasticity, where high-variance gradient estimates lead to slow convergence. Previous works have shown that magnitude preservation helps with stabilizing training in the U-net architecture. This work explores whether this effect extends to the Diffusion Transformer (DiT)… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  2. arXiv:2503.04482  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Generalized Interpolating Discrete Diffusion

    Authors: Dimitri von Rütte, Janis Fluri, Yuhui Ding, Antonio Orvieto, Bernhard Schölkopf, Thomas Hofmann

    Abstract: While state-of-the-art language models achieve impressive results through next-token prediction, they have inherent limitations such as the inability to revise already generated tokens. This has prompted exploration of alternative approaches such as discrete diffusion. However, masked diffusion, which has emerged as a popular choice due to its simplicity and effectiveness, reintroduces this inabil… ▽ More

    Submitted 9 June, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

    Comments: Published at ICML 2025; Code available at https://github.com/dvruette/gidd

  3. arXiv:2402.14433  [pdf, other

    cs.CL cs.AI

    A Language Model's Guide Through Latent Space

    Authors: Dimitri von Rütte, Sotiris Anagnostidis, Gregor Bachmann, Thomas Hofmann

    Abstract: Concept guidance has emerged as a cheap and simple way to control the behavior of language models by probing their hidden representations for concept vectors and using them to perturb activations at inference time. While the focus of previous work has largely been on truthfulness, in this paper we extend this framework to a richer set of concepts such as appropriateness, humor, creativity and qual… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    ACM Class: I.2

  4. arXiv:2307.10159  [pdf, other

    cs.CV

    FABRIC: Personalizing Diffusion Models with Iterative Feedback

    Authors: Dimitri von Rütte, Elisabetta Fedele, Jonathan Thomm, Lukas Wolf

    Abstract: In an era where visual content generation is increasingly driven by machine learning, the integration of human feedback into generative models presents significant opportunities for enhancing user experience and output quality. This study explores strategies for incorporating iterative human feedback into the generative process of diffusion-based text-to-image models. We propose FABRIC, a training… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: 14 pages, 7 figures

    MSC Class: I.2.10

  5. arXiv:2304.07327  [pdf, other

    cs.CL cs.AI

    OpenAssistant Conversations -- Democratizing Large Language Model Alignment

    Authors: Andreas Köpf, Yannic Kilcher, Dimitri von Rütte, Sotiris Anagnostidis, Zhi-Rui Tam, Keith Stevens, Abdullah Barhoum, Nguyen Minh Duc, Oliver Stanley, Richárd Nagyfi, Shahul ES, Sameer Suri, David Glushkov, Arnav Dantuluri, Andrew Maguire, Christoph Schuhmann, Huu Nguyen, Alexander Mattick

    Abstract: Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their acce… ▽ More

    Submitted 31 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Published in NeurIPS 2023 Datasets and Benchmarks

    Report number: V-02 ACM Class: I.2

  6. arXiv:2201.10936  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

    Authors: Dimitri von Rütte, Luca Biggio, Yannic Kilcher, Thomas Hofmann

    Abstract: Generating music with deep neural networks has been an area of active research in recent years. While the quality of generated samples has been steadily increasing, most methods are only able to exert minimal control over the generated sequence, if any. We propose the self-supervised description-to-sequence task, which allows for fine-grained controllable generation on a global level. We do so by… ▽ More

    Submitted 22 February, 2024; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: Published in ICLR 2023