Skip to main content

Showing 1–5 of 5 results for author: Malkin, D

.
  1. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  2. arXiv:2502.21228  [pdf, other

    cs.CL cs.AI

    ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer

    Authors: Omer Goldman, Uri Shaham, Dan Malkin, Sivan Eiger, Avinatan Hassidim, Yossi Matias, Joshua Maynez, Adi Mayrav Gilady, Jason Riesa, Shruti Rijhwani, Laura Rimell, Idan Szpektor, Reut Tsarfaty, Matan Eyal

    Abstract: To achieve equitable performance across languages, multilingual large language models (LLMs) must be able to abstract knowledge beyond the language in which it was acquired. However, the current literature lacks reliable ways to measure LLMs' capability of cross-lingual knowledge transfer. To that end, we present ECLeKTic, a multilingual closed-book QA (CBQA) dataset that Evaluates Cross-Lingual K… ▽ More

    Submitted 3 March, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

  3. arXiv:2401.00595  [pdf, other

    cs.CL

    State of What Art? A Call for Multi-Prompt LLM Evaluation

    Authors: Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky

    Abstract: Recent advances in large language models (LLMs) have led to the development of various evaluation benchmarks. These benchmarks typically rely on a single instruction template for evaluating all LLMs on a specific task. In this paper, we comprehensively analyze the brittleness of results obtained via single-prompt evaluations across 6.5M instances, involving 20 different LLMs and 39 tasks from 3 be… ▽ More

    Submitted 6 May, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: Accepted at TACL; pre-MIT Press publication version

  4. arXiv:2210.07135  [pdf, other

    cs.CL

    You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models

    Authors: Tomasz Limisiewicz, Dan Malkin, Gabriel Stanovsky

    Abstract: Multilingual models have been widely used for cross-lingual transfer to low-resource languages. However, the performance on these languages is hindered by their underrepresentation in the pretraining data. To alleviate this problem, we propose a novel multilingual training technique based on teacher-student knowledge distillation. In this setting, we utilize monolingual teacher models optimized fo… ▽ More

    Submitted 26 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: SIGTYP 2023

  5. arXiv:2205.04086  [pdf, other

    cs.CL cs.AI

    A Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the Linguistic Blood Bank

    Authors: Dan Malkin, Tomasz Limisiewicz, Gabriel Stanovsky

    Abstract: We show that the choice of pretraining languages affects downstream cross-lingual transfer for BERT-based models. We inspect zero-shot performance in balanced data conditions to mitigate data size confounds, classifying pretraining languages that improve downstream performance as donors, and languages that are improved in zero-shot performance as recipients. We develop a method of quadratic time c… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022