Showing 1–2 of 2 results for author: Frank, M J

Search v0.5.6 released 2020-02-24

arXiv:2402.08674 [pdf, other]

cs.NE cs.LG q-bio.NC

The dynamic interplay between in-context and in-weight learning in humans and neural networks

Authors: Jacob Russin, Ellie Pavlick, Michael J. Frank

Abstract: Human learning embodies a striking duality: sometimes, we appear capable of following logical, compositional rules and benefit from structured curricula (e.g., in formal education), while other times, we rely on an incremental approach or trial-and-error, learning better from curricula that are randomly interleaved. Influential psychological theories explain this seemingly disparate behavioral evi… ▽ More Human learning embodies a striking duality: sometimes, we appear capable of following logical, compositional rules and benefit from structured curricula (e.g., in formal education), while other times, we rely on an incremental approach or trial-and-error, learning better from curricula that are randomly interleaved. Influential psychological theories explain this seemingly disparate behavioral evidence by positing two qualitatively different learning systems -- one for rapid, rule-based inferences and another for slow, incremental adaptation. It remains unclear how to reconcile such theories with neural networks, which learn via incremental weight updates and are thus a natural model for the latter type of learning, but are not obviously compatible with the former. However, recent evidence suggests that metalearning neural networks and large language models are capable of "in-context learning" (ICL) -- the ability to flexibly grasp the structure of a new task from a few examples. Here, we show that the dynamic interplay between ICL and default in-weight learning (IWL) naturally captures a broad range of learning phenomena observed in humans, reproducing curriculum effects on category-learning and compositional tasks, and recapitulating a tradeoff between flexibility and retention. Our work shows how emergent ICL can equip neural networks with fundamentally different learning properties that can coexist with their native IWL, thus offering a novel perspective on dual-process theories and human cognitive flexibility. △ Less

Submitted 25 April, 2025; v1 submitted 13 February, 2024; originally announced February 2024.

Comments: 15 pages (excluding appendix and references), 10 pages of appendix, 14 figures, 7 tables. Previous version accepted as a talk + full paper at CogSci 2024
arXiv:1112.0778 [pdf, other]

q-bio.NC

A computational model of inhibitory control in frontal cortex and basal ganglia

Authors: Thomas V. Wiecki, Michael J. Frank

Abstract: Planning and executing volitional actions in the face of conflicting habitual responses is a critical aspect of human behavior. At the core of the interplay between these two control systems lies an override mechanism that can suppress the habitual action selection process and allow executive control to take over. Here, we construct a neural circuit model informed by behavioral and electrophysiolo… ▽ More Planning and executing volitional actions in the face of conflicting habitual responses is a critical aspect of human behavior. At the core of the interplay between these two control systems lies an override mechanism that can suppress the habitual action selection process and allow executive control to take over. Here, we construct a neural circuit model informed by behavioral and electrophysiological data collected on various response inhibition paradigms. This model extends a well established model of action selection in the basal ganglia by including a frontal executive control network which integrates information about sensory input and task rules to facilitate well-informed decision making via the oculomotor system. Our simulations of the antisaccade, Simon and saccade-override task ensue in conflict between a prepotent and controlled response which causes the network to pause action selection via projections to the subthalamic nucleus. Our model reproduces key behavioral and electrophysiological patterns and their sensitivity to lesions and pharmacological manipulations. Finally, we show how this network can be extended to include the inferior frontal cortex to simulate key qualitative patterns of global response inhibition demands as required in the stop-signal task. △ Less

Submitted 3 December, 2012; v1 submitted 4 December, 2011; originally announced December 2011.

Comments: 3rd submission (now accepted at Psychological Review). Removed switch-DDM and some other data points, restructured some graphics. Added systematic accuracy-RT analysis of speed-accuracy trade-off

Search v0.5.6 released 2020-02-24