Skip to main content

Showing 1–8 of 8 results for author: Russin, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15164  [pdf, other

    cs.NE cs.AI cs.LG

    From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks

    Authors: Jacob Russin, Sam Whitman McGrath, Danielle J. Williams, Lotem Elber-Dorozko

    Abstract: Compositionality has long been considered a key explanatory property underlying human intelligence: arbitrary concepts can be composed into novel complex combinations, permitting the acquisition of an open ended, potentially infinite expressive capacity from finite learning experiences. Influential arguments have held that neural networks fail to explain this aspect of behavior, leading many to di… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 32 pages (50 pages including references), 8 figures

  2. arXiv:2405.13231  [pdf

    cs.AI

    Multiple Realizability and the Rise of Deep Learning

    Authors: Sam Whitman McGrath, Jacob Russin

    Abstract: The multiple realizability thesis holds that psychological states may be implemented in a diversity of physical systems. The deep learning revolution seems to be bringing this possibility to life, offering the most plausible examples of man-made realizations of sophisticated cognitive functions to date. This paper explores the implications of deep learning models for the multiple realizability the… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2402.08674  [pdf, other

    cs.NE cs.LG q-bio.NC

    The dynamic interplay between in-context and in-weight learning in humans and neural networks

    Authors: Jacob Russin, Ellie Pavlick, Michael J. Frank

    Abstract: Human learning embodies a striking duality: sometimes, we appear capable of following logical, compositional rules and benefit from structured curricula (e.g., in formal education), while other times, we rely on an incremental approach or trial-and-error, learning better from curricula that are randomly interleaved. Influential psychological theories explain this seemingly disparate behavioral evi… ▽ More

    Submitted 25 April, 2025; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 15 pages (excluding appendix and references), 10 pages of appendix, 14 figures, 7 tables. Previous version accepted as a talk + full paper at CogSci 2024

  4. arXiv:2309.06629  [pdf, other

    cs.AI cs.NE

    The Relational Bottleneck as an Inductive Bias for Efficient Abstraction

    Authors: Taylor W. Webb, Steven M. Frankland, Awni Altabaa, Simon Segert, Kamesh Krishnamurthy, Declan Campbell, Jacob Russin, Tyler Giallanza, Zack Dulberg, Randall O'Reilly, John Lafferty, Jonathan D. Cohen

    Abstract: A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck… ▽ More

    Submitted 1 May, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  5. arXiv:2202.04773  [pdf, other

    q-bio.NC cs.LG cs.NE

    A Neural Network Model of Continual Learning with Cognitive Control

    Authors: Jacob Russin, Maryam Zolfaghar, Seongmin A. Park, Erie Boorman, Randall C. O'Reilly

    Abstract: Neural networks struggle in continual learning settings from catastrophic forgetting: when trials are blocked, new learning can overwrite the learning from previous blocks. Humans learn effectively in these settings, in some cases even showing an advantage of blocking, suggesting the brain contains mechanisms to overcome this problem. Here, we build on previous work and show that neural networks e… ▽ More

    Submitted 3 November, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 7 pages, 5 figures, paper accepted as a talk to CogSci 2022 (https://escholarship.org/uc/item/3gn3w58z)

    Journal ref: CogSci 2022, 44

  6. arXiv:2105.08961  [pdf, other

    cs.LG cs.AI cs.CL

    Compositional Processing Emerges in Neural Networks Solving Math Problems

    Authors: Jacob Russin, Roland Fernandez, Hamid Palangi, Eric Rosen, Nebojsa Jojic, Paul Smolensky, Jianfeng Gao

    Abstract: A longstanding question in cognitive science concerns the learning mechanisms underlying compositionality in human cognition. Humans can infer the structured relationships (e.g., grammatical rules) implicit in their sensory observations (e.g., auditory speech), and use this knowledge to guide the composition of simpler meanings into complex wholes. Recent progress in artificial neural networks has… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 7 pages, 2 figures, Accepted to CogSci 2021 for poster presentation

  7. arXiv:2105.08944  [pdf, other

    q-bio.NC cs.LG

    Complementary Structure-Learning Neural Networks for Relational Reasoning

    Authors: Jacob Russin, Maryam Zolfaghar, Seongmin A. Park, Erie Boorman, Randall C. O'Reilly

    Abstract: The neural mechanisms supporting flexible relational inferences, especially in novel situations, are a major focus of current research. In the complementary learning systems framework, pattern separation in the hippocampus allows rapid learning in novel environments, while slower learning in neocortex accumulates small weight changes to extract systematic structure from well-learned environments.… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 7 pages, 4 figures, Accepted to CogSci 2021 for poster presentation

  8. arXiv:1904.09708  [pdf, other

    cs.LG cs.CL stat.ML

    Compositional generalization in a deep seq2seq model by separating syntax and semantics

    Authors: Jake Russin, Jason Jo, Randall C. O'Reilly, Yoshua Bengio

    Abstract: Standard methods in deep learning for natural language processing fail to capture the compositional structure of human language that allows for systematic generalization outside of the training distribution. However, human learners readily generalize in this way, e.g. by applying known grammatical rules to novel words. Inspired by work in neuroscience suggesting separate brain systems for syntacti… ▽ More

    Submitted 23 May, 2019; v1 submitted 21 April, 2019; originally announced April 2019.

    Comments: 18 pages, 15 figures, preprint version of submission to NeurIPS 2019, under review