Skip to main content

Showing 1–9 of 9 results for author: Karmon, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.03276  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    ComMer: a Framework for Compressing and Merging User Data for Personalization

    Authors: Yoel Zeldes, Amir Zait, Ilia Labzovsky, Danny Karmon, Efrat Farkash

    Abstract: Large Language Models (LLMs) excel at a wide range of tasks, but adapting them to new data, particularly for personalized applications, poses significant challenges due to resource and computational constraints. Existing methods either rely on exposing fresh data to the model through the prompt, which is limited by context size and computationally expensive at inference time, or fine-tuning, which… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: 13 pages, 7 figures

  2. arXiv:2410.11756  [pdf, other

    cs.AI

    Evidence of Cognitive Deficits andDevelopmental Advances in Generative AI: A Clock Drawing Test Analysis

    Authors: Isaac R. Galatzer-Levy, Jed McGiffin, David Munday, Xin Liu, Danny Karmon, Ilia Labzovsky, Rivka Moroshko, Amir Zait, Daniel McDuff

    Abstract: Generative AI's rapid advancement sparks interest in its cognitive abilities, especially given its capacity for tasks like language understanding and code generation. This study explores how several recent GenAI models perform on the Clock Drawing Test (CDT), a neuropsychological assessment of visuospatial planning and organization. While models create clock-like drawings, they struggle with accur… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  3. arXiv:2410.07391  [pdf, other

    cs.AI

    The Cognitive Capabilities of Generative AI: A Comparative Analysis with Human Benchmarks

    Authors: Isaac R. Galatzer-Levy, David Munday, Jed McGiffin, Xin Liu, Danny Karmon, Ilia Labzovsky, Rivka Moroshko, Amir Zait, Daniel McDuff

    Abstract: There is increasing interest in tracking the capabilities of general intelligence foundation models. This study benchmarks leading large language models and vision language models against human performance on the Wechsler Adult Intelligence Scale (WAIS-IV), a comprehensive, population-normed assessment of underlying human cognition and intellectual abilities, with a focus on the domains of VerbalC… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  4. arXiv:2409.14371  [pdf, other

    cs.CL

    The Ability of Large Language Models to Evaluate Constraint-satisfaction in Agent Responses to Open-ended Requests

    Authors: Lior Madmoni, Amir Zait, Ilia Labzovsky, Danny Karmon

    Abstract: Generative AI agents are often expected to respond to complex user requests that have No One Right Answer (NORA), e.g., "design a vegetarian meal plan below 1800 calories". Such requests may entail a set of constraints that the agent should adhere to. To successfully develop agents for NORA scenarios, an accurate automatic evaluation framework is essential, and specifically - one capable of valida… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  5. arXiv:2312.03664  [pdf, other

    cs.AI cs.CL

    Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia

    Authors: Alexander Sasha Vezhnevets, John P. Agapiou, Avia Aharon, Ron Ziv, Jayd Matyas, Edgar A. Duéñez-Guzmán, William A. Cunningham, Simon Osindero, Danny Karmon, Joel Z. Leibo

    Abstract: Agent-based modeling has been around for decades, and applied widely across the social and natural sciences. The scope of this research method is now poised to grow dramatically as it absorbs the new affordances provided by Large Language Models (LLM)s. Generative Agent-Based Models (GABM) are not just classic Agent-Based Models (ABM)s where the agents talk to one another. Rather, GABMs are constr… ▽ More

    Submitted 13 December, 2023; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: 32 pages, 5 figures

  6. arXiv:2211.09722  [pdf, other

    cs.CL cs.LG

    Federated Multilingual Models for Medical Transcript Analysis

    Authors: Andre Manoel, Mirian Hipolito Garcia, Tal Baumel, Shize Su, Jialei Chen, Dan Miller, Danny Karmon, Robert Sim, Dimitrios Dimitriadis

    Abstract: Federated Learning (FL) is a novel machine learning approach that allows the model trainer to access more data samples, by training the model across multiple decentralized data sources, while data access constraints are in place. Such trained models can achieve significantly higher performance beyond what can be done when trained on a single data source. As part of FL's promises, none of the train… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  7. arXiv:1910.12274  [pdf, other

    cs.IR cs.AI cs.CL

    Algorithmic Copywriting: Automated Generation of Health-Related Advertisements to Improve their Performance

    Authors: Brit Youngmann, Ran Gilad-Bachrach, Danny Karmon, Elad Yom-Tov

    Abstract: Search advertising, a popular method for online marketing, has been employed to improve health by eliciting positive behavioral change. However, writing effective advertisements requires expertise and experimentation, which may not be available to health authorities wishing to elicit such changes, especially when dealing with public health crises such as epidemic outbreaks. Here we develop a fra… ▽ More

    Submitted 12 July, 2020; v1 submitted 27 October, 2019; originally announced October 2019.

  8. arXiv:1801.02608  [pdf, other

    cs.CV cs.LG

    LaVAN: Localized and Visible Adversarial Noise

    Authors: Danny Karmon, Daniel Zoran, Yoav Goldberg

    Abstract: Most works on adversarial examples for deep-learning based image classifiers use noise that, while small, covers the entire image. We explore the case where the noise is allowed to be visible but confined to a small, localized patch of the image, without covering any of the main object(s) in the image. We show that it is possible to generate localized adversarial noises that cover only 2% of the p… ▽ More

    Submitted 1 March, 2018; v1 submitted 8 January, 2018; originally announced January 2018.

  9. arXiv:1512.02033  [pdf, ps, other

    cs.LG

    Risk Minimization in Structured Prediction using Orbit Loss

    Authors: Danny Karmon, Joseph Keshet

    Abstract: We introduce a new surrogate loss function called orbit loss in the structured prediction framework, which has good theoretical and practical advantages. While the orbit loss is not convex, it has a simple analytical gradient and a simple perceptron-like learning rule. We analyze the new loss theoretically and state a PAC-Bayesian generalization bound. We also prove that the new loss is consistent… ▽ More

    Submitted 9 December, 2015; v1 submitted 7 December, 2015; originally announced December 2015.