Skip to main content

Showing 1–4 of 4 results for author: Akata, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.15678  [pdf, ps, other

    cs.LG

    Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models

    Authors: Luca M. Schulze Buschoff, Konstantinos Voudouris, Elif Akata, Matthias Bethge, Joshua B. Tenenbaum, Eric Schulz

    Abstract: Pre-trained vision language models still fall short of human visual cognition. In an effort to improve visual cognition and align models with human behavior, we introduce visual stimuli and human judgments on visual cognition tasks, allowing us to systematically evaluate performance across cognitive domains under a consistent environment. We fine-tune models on ground truth data for intuitive phys… ▽ More

    Submitted 30 May, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

  2. arXiv:2410.20268  [pdf, other

    cs.LG

    Centaur: a foundation model of human cognition

    Authors: Marcel Binz, Elif Akata, Matthias Bethge, Franziska Brändle, Fred Callaway, Julian Coda-Forno, Peter Dayan, Can Demircan, Maria K. Eckstein, Noémi Éltető, Thomas L. Griffiths, Susanne Haridi, Akshay K. Jagadish, Li Ji-An, Alexander Kipnis, Sreejan Kumar, Tobias Ludwig, Marvin Mathony, Marcelo Mattar, Alireza Modirshanechi, Surabhi S. Nath, Joshua C. Peterson, Milena Rmus, Evan M. Russek, Tankred Saanum , et al. (15 additional authors not shown)

    Abstract: Establishing a unified theory of cognition has been a major goal of psychology. While there have been previous attempts to instantiate such theories by building computational models, we currently do not have one model that captures the human mind in its entirety. A first step in this direction is to create a model that can predict human behavior in a wide range of settings. Here we introduce Centa… ▽ More

    Submitted 28 April, 2025; v1 submitted 26 October, 2024; originally announced October 2024.

  3. arXiv:2311.16093  [pdf, other

    cs.LG

    Visual cognition in multimodal large language models

    Authors: Luca M. Schulze Buschoff, Elif Akata, Matthias Bethge, Eric Schulz

    Abstract: A chief goal of artificial intelligence is to build machines that think like people. Yet it has been argued that deep neural network architectures fail to accomplish this. Researchers have asserted these models' limitations in the domains of causal reasoning, intuitive physics, and intuitive psychology. Yet recent advancements, namely the rise of large language models, particularly those designed… ▽ More

    Submitted 8 August, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Updated manuscript

  4. Playing repeated games with Large Language Models

    Authors: Elif Akata, Lion Schulz, Julian Coda-Forno, Seong Joon Oh, Matthias Bethge, Eric Schulz

    Abstract: LLMs are increasingly used in applications where they interact with humans and other agents. We propose to use behavioural game theory to study LLM's cooperation and coordination behaviour. We let different LLMs play finitely repeated $2\times2$ games with each other, with human-like strategies, and actual human players. Our results show that LLMs perform particularly well at self-interested games… ▽ More

    Submitted 7 May, 2025; v1 submitted 26 May, 2023; originally announced May 2023.