Skip to main content

Showing 1–12 of 12 results for author: Sompolinsky, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.16689  [pdf, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech stat.ML

    Coding schemes in neural networks learning classification tasks

    Authors: Alexander van Meegen, Haim Sompolinsky

    Abstract: Neural networks posses the crucial ability to generate meaningful representations of task-dependent features. Indeed, with appropriate scaling, supervised learning in neural networks can result in strong, task-dependent feature learning. However, the nature of the emergent representations, which we call the `coding scheme', is still unclear. To understand the emergent coding scheme, we investigate… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2405.15926  [pdf, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech stat.ML

    Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers

    Authors: Lorenzo Tiberi, Francesca Mignacco, Kazuki Irie, Haim Sompolinsky

    Abstract: Despite the remarkable empirical performance of Transformers, their theoretical understanding remains elusive. Here, we consider a deep multi-head self-attention network, that is closely related to Transformers yet analytically tractable. We develop a statistical mechanics theory of Bayesian learning in this model, deriving exact equations for the network's predictor statistics under the finite-wi… ▽ More

    Submitted 7 December, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2206.08933  [pdf, other

    q-bio.NC cond-mat.dis-nn cond-mat.stat-mech cs.LG stat.ML

    A theory of learning with constrained weight-distribution

    Authors: Weishun Zhong, Ben Sorscher, Daniel D Lee, Haim Sompolinsky

    Abstract: A central question in computational neuroscience is how structure determines function in neural networks. The emerging high-quality large-scale connectomic datasets raise the question of what general functional principles can be gleaned from structural information such as the distribution of excitatory/inhibitory synapse types and the distribution of synaptic weights. Motivated by this question, w… ▽ More

    Submitted 24 October, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: 38 pages, 13 figures. Updated introduction part and fixed several typos

  4. arXiv:2205.14544  [pdf, other

    q-bio.NC cond-mat.dis-nn stat.ML

    Temporal support vectors for spiking neuronal networks

    Authors: Ran Rubin, Haim Sompolinsky

    Abstract: When neural circuits learn to perform a task, it is often the case that there are many sets of synaptic connections that are consistent with the task. However, only a small number of possible solutions are robust to noise in the input and are capable of generalizing their performance of the task to new inputs. Finding such good solutions is an important goal of learning systems in general and neur… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  5. arXiv:2203.07040  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.NE q-bio.NC stat.ML

    Soft-margin classification of object manifolds

    Authors: Uri Cohen, Haim Sompolinsky

    Abstract: A neural population responding to multiple appearances of a single object defines a manifold in the neural response space. The ability to classify such manifolds is of interest, as object recognition and other computational tasks require a response that is insensitive to variability within a manifold. Linear classification of object manifolds was previously studied for max-margin classifiers. Soft… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Journal ref: Phys. Rev. E 106, 024126 (2022)

  6. arXiv:2008.08653  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.LG q-bio.NC stat.ML

    A new role for circuit expansion for learning in neural networks

    Authors: Julia Steinberg, Madhu Advani, Haim Sompolinsky

    Abstract: Many sensory pathways in the brain rely on sparsely active populations of neurons downstream from the input stimuli. The biological reason for the occurrence of expanded structure in the brain is unclear, but may be because expansion can increase the expressive power of a neural network. In this work, we show that expanding a neural network can improve its generalization performance even in cases… ▽ More

    Submitted 21 December, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: 13+10 pages, 13 figures

    Journal ref: Phys. Rev. E 103, 022404 (2021)

  7. arXiv:2004.01190  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG cs.NE

    Predicting the outputs of finite deep neural networks trained with noisy gradients

    Authors: Gadi Naveh, Oded Ben-David, Haim Sompolinsky, Zohar Ringel

    Abstract: A recent line of works studied wide deep neural networks (DNNs) by approximating them as Gaussian Processes (GPs). A DNN trained with gradient flow was shown to map to a GP governed by the Neural Tangent Kernel (NTK), whereas earlier works showed that a DNN with an i.i.d. prior over its weights maps to the so-called Neural Network Gaussian Process (NNGP). Here we consider a DNN training protocol,… ▽ More

    Submitted 30 September, 2021; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: 8 pages + appendix, 7 figures overall

  8. arXiv:1710.06487  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.NE q-bio.NC stat.ML

    Classification and Geometry of General Perceptual Manifolds

    Authors: SueYeon Chung, Daniel D. Lee, Haim Sompolinsky

    Abstract: Perceptual manifolds arise when a neural population responds to an ensemble of sensory signals associated with different physical features (e.g., orientation, pose, scale, location, and intensity) of the same perceptual object. Object recognition and discrimination requires classifying the manifolds in a manner that is insensitive to variability within a manifold. How neuronal systems give rise to… ▽ More

    Submitted 24 June, 2018; v1 submitted 17 October, 2017; originally announced October 2017.

    Comments: 24 pages, 12 figures, Supplementary Materials

    Journal ref: Phys. Rev. X 8, 031003 (2018)

  9. Learning Data Manifolds with a Cutting Plane Method

    Authors: SueYeon Chung, Uri Cohen, Haim Sompolinsky, Daniel D. Lee

    Abstract: We consider the problem of classifying data manifolds where each manifold represents invariances that are parameterized by continuous degrees of freedom. Conventional data augmentation methods rely upon sampling large numbers of training examples from these manifolds; instead, we propose an iterative algorithm called M_{CP} based upon a cutting-plane approach that efficiently solves a quadratic se… ▽ More

    Submitted 28 May, 2017; originally announced May 2017.

    Journal ref: Neural Computation. Volume:30, Issue:10, (2018) pp.2593-2615

  10. arXiv:1705.01502  [pdf, other

    q-bio.NC cond-mat.dis-nn cs.LG cs.NE stat.ML

    Balanced Excitation and Inhibition are Required for High-Capacity, Noise-Robust Neuronal Selectivity

    Authors: Ran Rubin, L. F. Abbott, Haim Sompolinsky

    Abstract: Neurons and networks in the cerebral cortex must operate reliably despite multiple sources of noise. To evaluate the impact of both input and output noise, we determine the robustness of single-neuron stimulus selective responses, as well as the robustness of attractor states of networks of neurons performing memory tasks. We find that robustness to output noise requires synaptic connections to be… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.

    Comments: Article and supplementary information

    Journal ref: Proceedings of the National Academy of Sciences of the United States of America, 114(41), 2017

  11. arXiv:1512.01834  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.NE q-bio.NC stat.ML

    Linear Readout of Object Manifolds

    Authors: SueYeon Chung, Daniel D. Lee, Haim Sompolinsky

    Abstract: Objects are represented in sensory systems by continuous manifolds due to sensitivity of neuronal responses to changes in physical features such as location, orientation, and intensity. What makes certain sensory representations better suited for invariant decoding of objects by downstream networks? We present a theory that characterizes the ability of a linear readout network, the perceptron, to… ▽ More

    Submitted 21 August, 2016; v1 submitted 6 December, 2015; originally announced December 2015.

    Comments: 5 pages, 3 figures, accepted in Physical Review E as Rapid Communication on 14th May. 2016

    Journal ref: Phys. Rev. E 93, 060301 (R) (2016)

  12. Theory of spike timing based neural classifiers

    Authors: Ran Rubin, Remi Monasson, Haim Sompolinsky

    Abstract: We study the computational capacity of a model neuron, the Tempotron, which classifies sequences of spikes by linear-threshold operations. We use statistical mechanics and extreme value theory to derive the capacity of the system in random classification tasks. In contrast to its static analog, the Perceptron, the Tempotron's solutions space consists of a large number of small clusters of weight v… ▽ More

    Submitted 26 October, 2010; originally announced October 2010.

    Comments: 4 page, 4 figures, Accepted to Physical Review Letters on 19th Oct. 2010

    Journal ref: Phys. Rev. Lett. 105, 218102 (2010)