Skip to main content

Showing 1–2 of 2 results for author: Mina, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.08489  [pdf, other

    cs.CL

    Salamandra Technical Report

    Authors: Aitor Gonzalez-Agirre, Marc Pàmies, Joan Llop, Irene Baucells, Severino Da Dalt, Daniel Tamayo, José Javier Saiz, Ferran Espuña, Jaume Prats, Javier Aula-Blasco, Mario Mina, Iñigo Pikabea, Adrián Rubio, Alexander Shvets, Anna Sallés, Iñaki Lacunza, Jorge Palomar, Júlia Falcão, Lucía Tormo, Luis Vasquez-Reina, Montserrat Marimon, Oriol Pareras, Valle Ruiz-Fernández, Marta Villegas

    Abstract: This work introduces Salamandra, a suite of open-source decoder-only large language models available in three different sizes: 2, 7, and 40 billion parameters. The models were trained from scratch on highly multilingual data that comprises text in 35 European languages and code. Our carefully curated corpus is made exclusively from open-access data compiled from a wide variety of sources. Along wi… ▽ More

    Submitted 13 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  2. arXiv:2009.05863  [pdf, other

    stat.ME cs.AI

    Tracking disease outbreaks from sparse data with Bayesian inference

    Authors: Bryan Wilder, Michael J. Mina, Milind Tambe

    Abstract: The COVID-19 pandemic provides new motivation for a classic problem in epidemiology: estimating the empirical rate of transmission during an outbreak (formally, the time-varying reproduction number) from case counts. While standard methods exist, they work best at coarse-grained national or state scales with abundant data, and struggle to accommodate the partial observability and sparse data commo… ▽ More

    Submitted 12 September, 2020; originally announced September 2020.

    Report number: Accepted at AAAI 2021