Skip to main content

Showing 1–4 of 4 results for author: Labzovsky, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.03276  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    ComMer: a Framework for Compressing and Merging User Data for Personalization

    Authors: Yoel Zeldes, Amir Zait, Ilia Labzovsky, Danny Karmon, Efrat Farkash

    Abstract: Large Language Models (LLMs) excel at a wide range of tasks, but adapting them to new data, particularly for personalized applications, poses significant challenges due to resource and computational constraints. Existing methods either rely on exposing fresh data to the model through the prompt, which is limited by context size and computationally expensive at inference time, or fine-tuning, which… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: 13 pages, 7 figures

  2. arXiv:2410.11756  [pdf, other

    cs.AI

    Evidence of Cognitive Deficits andDevelopmental Advances in Generative AI: A Clock Drawing Test Analysis

    Authors: Isaac R. Galatzer-Levy, Jed McGiffin, David Munday, Xin Liu, Danny Karmon, Ilia Labzovsky, Rivka Moroshko, Amir Zait, Daniel McDuff

    Abstract: Generative AI's rapid advancement sparks interest in its cognitive abilities, especially given its capacity for tasks like language understanding and code generation. This study explores how several recent GenAI models perform on the Clock Drawing Test (CDT), a neuropsychological assessment of visuospatial planning and organization. While models create clock-like drawings, they struggle with accur… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  3. arXiv:2410.07391  [pdf, other

    cs.AI

    The Cognitive Capabilities of Generative AI: A Comparative Analysis with Human Benchmarks

    Authors: Isaac R. Galatzer-Levy, David Munday, Jed McGiffin, Xin Liu, Danny Karmon, Ilia Labzovsky, Rivka Moroshko, Amir Zait, Daniel McDuff

    Abstract: There is increasing interest in tracking the capabilities of general intelligence foundation models. This study benchmarks leading large language models and vision language models against human performance on the Wechsler Adult Intelligence Scale (WAIS-IV), a comprehensive, population-normed assessment of underlying human cognition and intellectual abilities, with a focus on the domains of VerbalC… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  4. arXiv:2409.14371  [pdf, other

    cs.CL

    The Ability of Large Language Models to Evaluate Constraint-satisfaction in Agent Responses to Open-ended Requests

    Authors: Lior Madmoni, Amir Zait, Ilia Labzovsky, Danny Karmon

    Abstract: Generative AI agents are often expected to respond to complex user requests that have No One Right Answer (NORA), e.g., "design a vegetarian meal plan below 1800 calories". Such requests may entail a set of constraints that the agent should adhere to. To successfully develop agents for NORA scenarios, an accurate automatic evaluation framework is essential, and specifically - one capable of valida… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.