Skip to main content

Showing 1–4 of 4 results for author: Chaudhry, H T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14957  [pdf, ps, other

    q-bio.NC cs.LG

    POCO: Scalable Neural Forecasting through Population Conditioning

    Authors: Yu Duan, Hamza Tahir Chaudhry, Misha B. Ahrens, Christopher D Harvey, Matthew G Perich, Karl Deisseroth, Kanaka Rajan

    Abstract: Predicting future neural activity is a core challenge in modeling brain dynamics, with applications ranging from scientific investigation to closed-loop neurotechnology. While recent models of population activity emphasize interpretability and behavioral decoding, neural forecasting-particularly across multi-session, spontaneous recordings-remains underexplored. We introduce POCO, a unified foreca… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  2. arXiv:2412.05418  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    No Free Lunch From Random Feature Ensembles

    Authors: Benjamin S. Ruben, William L. Tong, Hamza Tahir Chaudhry, Cengiz Pehlevan

    Abstract: Given a budget on total model size, one must decide whether to train a single, large neural network or to combine the predictions of many smaller networks. We study this trade-off for ensembles of random-feature ridge regression models. We prove that when a fixed number of trainable parameters are partitioned among $K$ independently trained models, $K=1$ achieves optimal performance, provided the… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  3. arXiv:2405.15712  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Infinite Limits of Multi-head Transformer Dynamics

    Authors: Blake Bordelon, Hamza Tahir Chaudhry, Cengiz Pehlevan

    Abstract: In this work, we analyze various scaling limits of the training dynamics of transformer models in the feature learning regime. We identify the set of parameterizations that admit well-defined infinite width and depth limits, allowing the attention layers to update throughout training--a relevant notion of feature learning in these models. We then use tools from dynamical mean field theory (DMFT) t… ▽ More

    Submitted 4 October, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Updating for Neurips 2024

  4. arXiv:2306.04532  [pdf, other

    cs.NE cond-mat.dis-nn cs.LG q-bio.NC stat.ML

    Long Sequence Hopfield Memory

    Authors: Hamza Tahir Chaudhry, Jacob A. Zavatone-Veth, Dmitry Krotov, Cengiz Pehlevan

    Abstract: Sequence memory is an essential attribute of natural and artificial intelligence that enables agents to encode, store, and retrieve complex sequences of stimuli and actions. Computational models of sequence memory have been proposed where recurrent Hopfield-like neural networks are trained with temporally asymmetric Hebbian rules. However, these networks suffer from limited sequence capacity (maxi… ▽ More

    Submitted 2 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Camera-Ready, 41 pages

    Journal ref: Advances in Neural Information Processing Systems 36 (2023)