Skip to main content

Showing 1–1 of 1 results for author: Prime Intellect Team

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.07291  [pdf, ps, other

    cs.LG cs.DC

    INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning

    Authors: Prime Intellect Team, Sami Jaghouar, Justus Mattern, Jack Min Ong, Jannik Straube, Manveer Basra, Aaron Pazdera, Kushal Thaman, Matthew Di Ferrante, Felix Gabriel, Fares Obeid, Kemal Erdem, Michael Keiblinger, Johannes Hagemann

    Abstract: We introduce INTELLECT-2, the first globally distributed reinforcement learning (RL) training run of a 32 billion parameter language model. Unlike traditional centralized training efforts, INTELLECT-2 trains a reasoning model using fully asynchronous RL across a dynamic, heterogeneous swarm of permissionless compute contributors. To enable a training run with this unique infrastructure, we built… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 26 pages, 12 figures