Skip to main content

Showing 1–1 of 1 results for author: Ripplinger, S

.
  1. arXiv:2405.06694  [pdf, other

    cs.CL cs.AI

    SUTRA: Scalable Multilingual Language Model Architecture

    Authors: Abhijit Bendale, Michael Sapienza, Steven Ripplinger, Simon Gibbs, Jaewon Lee, Pranav Mistry

    Abstract: In this paper, we introduce SUTRA, multilingual Large Language Model architecture capable of understanding, reasoning, and generating text in over 50 languages. SUTRA's design uniquely decouples core conceptual understanding from language-specific processing, which facilitates scalable and efficient multilingual alignment and learning. Employing a Mixture of Experts framework both in language and… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.