Skip to main content

Showing 1–6 of 6 results for author: Gureckis, T

.
  1. arXiv:2505.12075  [pdf, ps, other

    cs.CL cs.LG

    Do different prompting methods yield a common task representation in language models?

    Authors: Guy Davidson, Todd M. Gureckis, Brenden M. Lake, Adina Williams

    Abstract: Demonstrations and instructions are two primary approaches for prompting language models to perform in-context learning (ICL) tasks. Do identical tasks elicited in different ways result in similar representations of the task? An improved understanding of task representation mechanisms would offer interpretability insights and may aid in steering models. We study this through \textit{function vecto… ▽ More

    Submitted 21 May, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

    Comments: 9 pages, 4 figures; under review

  2. arXiv:2409.01374  [pdf, other

    cs.AI

    H-ARC: A Robust Estimate of Human Performance on the Abstraction and Reasoning Corpus Benchmark

    Authors: Solim LeGris, Wai Keen Vong, Brenden M. Lake, Todd M. Gureckis

    Abstract: The Abstraction and Reasoning Corpus (ARC) is a visual program synthesis benchmark designed to test challenging out-of-distribution generalization in humans and machines. Since 2019, limited progress has been observed on the challenge using existing artificial intelligence methods. Comparing human and machine performance is important for the validity of the benchmark. While previous work explored… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 12 pages, 7 figures

  3. Goals as Reward-Producing Programs

    Authors: Guy Davidson, Graham Todd, Julian Togelius, Todd M. Gureckis, Brenden M. Lake

    Abstract: People are remarkably capable of generating their own goals, beginning with child's play and continuing into adulthood. Despite considerable empirical and computational work on goals and goal-oriented behavior, models are still far from capturing the richness of everyday human goals. Here, we bridge this gap by collecting a dataset of human-generated playful goals (in the form of scorable, single-… ▽ More

    Submitted 10 September, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Project website and goal program viewer: https://exps.gureckislab.org/guydav/goal_programs_viewer/main/

  4. arXiv:2401.06005  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG

    How does the primate brain combine generative and discriminative computations in vision?

    Authors: Benjamin Peters, James J. DiCarlo, Todd Gureckis, Ralf Haefner, Leyla Isik, Joshua Tenenbaum, Talia Konkle, Thomas Naselaris, Kimberly Stachenfeld, Zenna Tavares, Doris Tsao, Ilker Yildirim, Nikolaus Kriegeskorte

    Abstract: Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remo… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  5. arXiv:2103.05823  [pdf, other

    cs.HC cs.AI cs.LG

    Fast and flexible: Human program induction in abstract reasoning tasks

    Authors: Aysja Johnson, Wai Keen Vong, Brenden M. Lake, Todd M. Gureckis

    Abstract: The Abstraction and Reasoning Corpus (ARC) is a challenging program induction dataset that was recently proposed by Chollet (2019). Here, we report the first set of results collected from a behavioral study of humans solving a subset of tasks from ARC (40 out of 1000). Although this subset of tasks contains considerable variation, our results showed that humans were able to infer the underlying pr… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: 7 pages, 7 figures, 1 table

  6. arXiv:1711.06351  [pdf, other

    cs.CL cs.AI cs.LG

    Question Asking as Program Generation

    Authors: Anselm Rothe, Brenden M. Lake, Todd M. Gureckis

    Abstract: A hallmark of human intelligence is the ability to ask rich, creative, and revealing questions. Here we introduce a cognitive model capable of constructing human-like questions. Our approach treats questions as formal programs that, when executed on the state of the world, output an answer. The model specifies a probability distribution over a complex, compositional space of programs, favoring con… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

    Comments: Published in Advances in Neural Information Processing Systems (NIPS) 30, December 2017

    Journal ref: Rothe, A., Lake, B. M., and Gureckis, T. M. (2017). Question asking as program generation. Advances in Neural Information Processing Systems 30