Skip to main content

Showing 1–1 of 1 results for author: Eberhard, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.11223  [pdf, ps, other

    cs.AI cs.CL

    Reasoning Language Models: A Blueprint

    Authors: Maciej Besta, Julia Barth, Eric Schreiber, Ales Kubicek, Afonso Catarino, Robert Gerstenberger, Piotr Nyczyk, Patrick Iff, Yueling Li, Sam Houliston, Tomasz Sternal, Marcin Copik, Grzegorz Kwaśniewski, Jürgen Müller, Łukasz Flis, Hannes Eberhard, Zixuan Chen, Hubert Niewiadomski, Torsten Hoefler

    Abstract: Reasoning language models (RLMs), also known as Large Reasoning Models (LRMs), such as OpenAI's o1 and o3, DeepSeek-R1, and Alibaba's QwQ, have redefined AI's problem-solving capabilities by extending LLMs with advanced reasoning mechanisms. Yet, their high costs, proprietary nature, and complex architectures - uniquely combining reinforcement learning (RL), search heuristics, and LLMs - present a… ▽ More

    Submitted 11 June, 2025; v1 submitted 19 January, 2025; originally announced January 2025.