Skip to main content

Showing 1–2 of 2 results for author: Culver, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.19584  [pdf, other

    cs.CL

    SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain

    Authors: Pierre Colombo, Telmo Pires, Malik Boudiaf, Rui Melo, Dominic Culver, Sofia Morgado, Etienne Malaboeuf, Gabriel Hautreux, Johanne Charpentier, Michael Desa

    Abstract: In this paper, we introduce SaulLM-54B and SaulLM-141B, two large language models (LLMs) tailored for the legal sector. These models, which feature architectures of 54 billion and 141 billion parameters, respectively, are based on the Mixtral architecture. The development of SaulLM-54B and SaulLM-141B is guided by large-scale domain adaptation, divided into three strategies: (1) the exploitation o… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  2. arXiv:2403.03883  [pdf, other

    cs.CL

    SaulLM-7B: A pioneering Large Language Model for Law

    Authors: Pierre Colombo, Telmo Pessoa Pires, Malik Boudiaf, Dominic Culver, Rui Melo, Caio Corro, Andre F. T. Martins, Fabrizio Esposito, Vera LĂșcia Raposo, Sofia Morgado, Michael Desa

    Abstract: In this paper, we introduce SaulLM-7B, a large language model (LLM) tailored for the legal domain. With 7 billion parameters, SaulLM-7B is the first LLM designed explicitly for legal text comprehension and generation. Leveraging the Mistral 7B architecture as its foundation, SaulLM-7B is trained on an English legal corpus of over 30 billion tokens. SaulLM-7B exhibits state-of-the-art proficiency i… ▽ More

    Submitted 7 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.