Skip to main content

Showing 1–7 of 7 results for author: Paparas, D

.
  1. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  2. arXiv:2410.08336  [pdf, other

    cs.LG cs.AI

    Kernel Banzhaf: A Fast and Robust Estimator for Banzhaf Values

    Authors: Yurong Liu, R. Teal Witter, Flip Korn, Tarfah Alrashed, Dimitris Paparas, Christopher Musco, Juliana Freire

    Abstract: Banzhaf values provide a popular, interpretable alternative to the widely-used Shapley values for quantifying the importance of features in machine learning models. Like Shapley values, computing Banzhaf values exactly requires time exponential in the number of features, necessitating the use of efficient estimators. Existing estimators, however, are limited to Monte Carlo sampling methods. In thi… ▽ More

    Submitted 17 February, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

  3. arXiv:2402.04177  [pdf, other

    cs.CL cs.LG stat.ML

    Scaling Laws for Downstream Task Performance in Machine Translation

    Authors: Berivan Isik, Natalia Ponomareva, Hussein Hazimeh, Dimitris Paparas, Sergei Vassilvitskii, Sanmi Koyejo

    Abstract: Scaling laws provide important insights that can guide the design of large language models (LLMs). Existing work has primarily focused on studying scaling laws for pretraining (upstream) loss. However, in transfer learning settings, in which LLMs are pretrained on an unsupervised dataset and then finetuned on a downstream task, we often also care about the downstream performance. In this work, we… ▽ More

    Submitted 20 February, 2025; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Published at the International Conference on Learning Representations (ICLR) 2025. Previous title: "Scaling Laws for Downstream Task Performance of Large Language Models"

  4. arXiv:2012.00738  [pdf, other

    cs.DS cs.GT

    Searching, Sorting, and Cake Cutting in Rounds

    Authors: Simina Brânzei, Dimitris Paparas, Nicholas Recker

    Abstract: We study searching and sorting in rounds motivated by a fair division question: given a cake cutting problem with $n$ players, compute a fair allocation in at most $k$ rounds of interaction with the players. Rounds interpolate between the simultaneous and the fully adaptive settings, also capturing parallel complexity. We find that proportional cake cutting in rounds is equivalent to sorting with… ▽ More

    Submitted 19 November, 2023; v1 submitted 1 December, 2020; originally announced December 2020.

  5. arXiv:1702.07032  [pdf, ps, other

    cs.GT cs.CC cs.DS

    On the Complexity of Simple and Optimal Deterministic Mechanisms for an Additive Buyer

    Authors: Xi Chen, George Matikas, Dimitris Paparas, Mihalis Yannakakis

    Abstract: We show that the Revenue-Optimal Deterministic Mechanism Design problem for a single additive buyer is #P-hard, even when the distributions have support size 2 for each item and, more importantly, even when the optimal solution is guaranteed to be of a very simple kind: the seller picks a price for each individual item and a price for the grand bundle of all the items; the buyer can purchase eithe… ▽ More

    Submitted 14 July, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

  6. arXiv:1311.2138  [pdf, ps, other

    cs.GT cs.CC

    The Complexity of Optimal Multidimensional Pricing

    Authors: Xi Chen, Ilias Diakonikolas, Dimitris Paparas, Xiaorui Sun, Mihalis Yannakakis

    Abstract: We resolve the complexity of revenue-optimal deterministic auctions in the unit-demand single-buyer Bayesian setting, i.e., the optimal item pricing problem, when the buyer's values for the items are independent. We show that the problem of computing a revenue-optimal pricing can be solved in polynomial time for distributions of support size 2, and its decision version is NP-complete for distribut… ▽ More

    Submitted 9 November, 2013; originally announced November 2013.

  7. arXiv:1211.4918  [pdf, other

    cs.CC cs.GT

    The Complexity of Non-Monotone Markets

    Authors: Xi Chen, Dimitris Paparas, Mihalis Yannakakis

    Abstract: We introduce the notion of non-monotone utilities, which covers a wide variety of utility functions in economic theory. We then prove that it is PPAD-hard to compute an approximate Arrow-Debreu market equilibrium in markets with linear and non-monotone utilities. Building on this result, we settle the long-standing open problem regarding the computation of an approximate Arrow-Debreu market equili… ▽ More

    Submitted 20 November, 2012; originally announced November 2012.