Skip to main content

Showing 1–1 of 1 results for author: Rohr, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.11096  [pdf, other

    cs.AI cs.CL

    Mixture of Tunable Experts -- Behavior Modification of DeepSeek-R1 at Inference Time

    Authors: Robert Dahlke, Henrik Klagges, Dan Zecha, Benjamin Merkel, Sven Rohr, Fabian Klemm

    Abstract: We present the Mixture-of-Tunable-Experts (MoTE), a method that extends the Mixture-of-Experts architecture of Large Language Models (LLMs). Without additional training, MoTE enables meaningful and focused behavior changes in LLMs on-the-fly during inference time. By analyzing the digital LLM brain of DeepSeek-R1 using a technique we dub 'functional Token Resonance Imaging' (fTRI) -- inspired by f… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.