Skip to main content

Showing 1–2 of 2 results for author: Zubiaga, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.00999  [pdf, ps, other

    cs.CL

    La Leaderboard: A Large Language Model Leaderboard for Spanish Varieties and Languages of Spain and Latin America

    Authors: María Grandury, Javier Aula-Blasco, Júlia Falcão, Clémentine Fourrier, Miguel González, Gonzalo Martínez, Gonzalo Santamaría, Rodrigo Agerri, Nuria Aldama, Luis Chiruzzo, Javier Conde, Helena Gómez, Marta Guerrero, Guido Ivetta, Natalia López, Flor Miriam Plaza-del-Arco, María Teresa Martín-Valdivia, Helena Montoro, Carmen Muñoz, Pedro Reviriego, Leire Rosado, Alejandro Vaca, María Estrella Vallecillo-Rodríguez, Jorge Vallego, Irune Zubiaga

    Abstract: Leaderboards showcase the current capabilities and limitations of Large Language Models (LLMs). To motivate the development of LLMs that represent the linguistic and cultural diversity of the Spanish-speaking community, we present La Leaderboard, the first open-source leaderboard to evaluate generative LLMs in languages and language varieties of Spain and Latin America. La Leaderboard is a communi… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: Accepted at ACL 2025 Main

  2. arXiv:2406.15227  [pdf, other

    cs.CL

    A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation

    Authors: Irune Zubiaga, Aitor Soroa, Rodrigo Agerri

    Abstract: This paper proposes a novel approach to evaluate Counter Narrative (CN) generation using a Large Language Model (LLM) as an evaluator. We show that traditional automatic metrics correlate poorly with human judgements and fail to capture the nuanced relationship between generated CNs and human perception. To alleviate this, we introduce a model ranking pipeline based on pairwise comparisons of gene… ▽ More

    Submitted 4 November, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted for Findings of the Association for Computational Linguistics: EMNLP 2024